Skip to content

Boby024/fake_news_topic_detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

fake_news_topic_detection

To allow the classifier to be better in classification (fake news detection and Topical Domain Classification):

  • Combining columns "headlne" and "content" from dataset and exclude row without content if these exist
  • data cleaning ( removing email address, hyperlinks, numbers, special characters and duplicate)
  • After testing Decision Tree, Random Forest and Multinomial Naive Bayes (with parameter such as unigram, bigram), we decide to use Multinomial Naive Bayes algorithm (using Scikit-learn library to classify text cleaned) based on its result

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages