This repository consists of a Machine Learning class final project collaboratively prepared by myself and three other classmates (Max Eddy, Matthew Weglicki, Julio Soldevilla). The goal of the project was to use Machine Learning models from the course, both supervised and unsupervised, to classify the top 5 senders of Hillary Clinton's released emails after the enactment o the Freedom of Information Act.
About:
• MasterFile.R: All the code needed to do processing of data
• FunctionsFile.R: User-defined functions to help prepare data, functions we use in the statistical analysis of Random Forests, SVM and K-means cluster analysis and additional functions that aid us in the creation of illustrative plots.
• Project Report WriteUp.pdf: Containts a holistic report of the results of our R code, machine learning algorithms, and training and test set analysis and observations.