LexRank

LexRank is the extractive generic text summarization system proposed by Erkan and Radev. This algoritm is the stochastic graph based method to find important sentences of text to create meaningful summarizations in multi-document systems for Natural Language Processing. The main idea of this process was similar sentences in a cluster are the more central to subject and to find the most important sentences they benefit from eigenvector centrality of representations of sentences in a graph. There are different ways of defining similarity between two sentences, however in this algorithm one of the most popular similarity measure which is cosine distance similarity metric is used.

Representation of documents modeled as vectors (with TF-IDF counts) in a vector space and similarity between different documents in this space represented by cosine similarity matrix. According to this method, every node represents one sentence and then graph is constructed and cosine similarity determines edges between nodes. Pagerank algorithm is used to compute the centrality of sentences over whole text. Sentences which have a high rank are more central to the topic.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
LexRank.R		LexRank.R
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LexRank

About

Releases

Packages

Languages

bguvenc/LexRank

Folders and files

Latest commit

History

Repository files navigation

LexRank

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages