The objective of this research is to accurately classify what forum a Reddit post came from, given the text of the post. To predict which forum a post came from, we trained two variations of Naive Bayes classifiers- Multinomial and Bernoulli.
Code is in redditForums.py, report is redditForums.pdf.
r/WorldNews https://www.kaggle.com/rootuser/worldnews-on-reddit
r/Jokes https://www.kaggle.com/cuddlefish/reddit-rjokes
r/Coronavirus https://www.kaggle.com/manish1578/rcoronavirus-dataset-from-redditcom
r/AndroidDev https://www.kaggle.com/viktorarsovski/randroiddev-data-20152019