Write a Python program that learns a TFIDF model from spam_train.csv. Use the model to get TFIDF vectors for spam_test.csv.
Your program will then learn an SVM model (LinearSVC in sklearn) on the TFIDF of spam_train.csv and predict spam or ham for spam_test.csv.
Your model should achieve at least 95% accuracy to get full points.
Your program takes in two files spam_train.csv and spam_test.csv and outputs the predicted labels as well as their accuracy on the spam_test.csv file.
Directories: /afs/cad/courses/ccs/s20/cs/675/002/. For example if your ucid is abc12 then copy your programs into /afs/cad/courses/ccs/s20/cs/675/002/abc12.
Your completed program is due before 1pm May 11th 2020
Run the following command
python3 project4.py spam_train.csv spam_test.csv