txtcrawl This is material pertaining to crawling the web for text for the purpose of compiling a dataset for experimentation in language model training.