The MetaSieve Blog

May 15, 2010

Natural Language Processing in Groovy: A Primer – Using Groovy for processing and analysing textual data

Filed under: Uncategorized — Tags: , , — Björn Wilmsmann @ 1:00 am

The May issue of GroovyMag, an online-only magazine for everything Groovy and Grails has been published today.

I’ve contributed an article about doing natural language processing with Groovy. Here’s a teaser:

Given that most Internet content in way or another contains natural language, then it’s no surprise natural language processing (NLP) and text mining have become a vital aspect of many Internet applications – from large scale search engines to social media apps. Making sense of the content that is stored in your application can make all the difference. Groovy’s regular expression, text processing abilities and readily available NLP libraries for Java make it a natural match for processing large amounts of text.

Since its inception Perl, the language that arguably can be considered the first scripting language in modern terms, had the features necessary to conveniently process large amounts of text built right into the language. Fortunately, most scripting languages that followed Perl borrowed and inherited many of these useful features. This is why things like methods for reading texts and regular expressions are commonplace in Python, Ruby and, of course, Groovy too.

[ … ]

Read the rest of this article at GroovyMag.

Related links:

Leave a Comment »

No comments yet.

RSS feed for comments on this post. TrackBack URI

Leave a comment

Blog at WordPress.com.