An extraction based text summarization algorithm. This python script has been written in order to experiment with NLTK and implement a summarizing algorithm for given text. Adaptation of an article on The Tokenizer and following gist.
Warning: Some unicode issues persist.
Navigate to the folder via command line.
python summarizer.py <url>
Ex:
python summarizer.py http://en.wikipedia.org/wiki/Nodejs
This script has been tested on Python 2.7 and has the following dependencies:
- NLTK
- goose
- numpy