Skip to content

guidj/spotlight

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spotlight

Annotating Movie Reviews with UIMA

Using UIMA to annotate names of cinematographers, actors and characters in movie reviews

Data Set

Movie review data set by Bo Pang and Lillian Lee. Can be downloaded here

Download the data files, and place the positive review files under the folder:

resources/data/text/pos/

Annotation

To execute the annotators with UIMA tools, one needs to configure a few variables regarding the location of descriptor files and other resources.

  • modelMEFilePath: location of the OpenNLP POS tag pack for English (EN)

  • sentenceMEFilePath: location of the OpenNLP sentence identification pack for English (EN)

You can edit these variables in the PersonDescriptor.xml file.

Running

You need to have UIMA properly set up in your environment.

Use use the documentAnalyser tool to annotate documents:

$UIMA_HOME/bin/documentAnalyzer.sh

And the annotationViewer to view the annotations:

$UIMA_HOME/bin/annotationViewer.sh

Building

mvn clean compile assembly:single

About

NLP: Annotating Movie Reviews with UIMA

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages