Skip to content

dale2456/google_ngrams_Version-20200217

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Method For 1-grams

get all 1 grams converted to lower case

choose those with:

no part with postag = _NOUN,_VERB,_ADJ,_ADV,_PRON,_DET,_ADP,_NUM,_CONJ,_PRT,_ROOT,_START,_END,_.,_X

less than 15 runes

apostrophe words with rune lengths 2'1 and 3'1

words in cicadawords dictionary

words in wolfram english dictionary

words in scrabble dictionary

words in nltk.corpus.brown.words() dictionary

words in nltk.corpus.inaugural.words() dictionary

words in nltk.corpus.words.words() dictionary

Method For N-grams

For all files found in below links select ngrams that,

only contain words in pre-computed 1-gram lists

AND

have a rune word length that matches words in an LP sentence.

http://storage.googleapis.com/books/ngrams/books/20200217/eng/eng-2-ngrams_exports.html http://storage.googleapis.com/books/ngrams/books/20200217/eng/eng-3-ngrams_exports.html http://storage.googleapis.com/books/ngrams/books/20200217/eng/eng-4-ngrams_exports.html http://storage.googleapis.com/books/ngrams/books/20200217/eng/eng-5-ngrams_exports.html

About

private

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Mathematica 100.0%