Constructed an end to end Predictive Analytics Pipeline using Python spark and Logistic Regression algorithm to predict and classify movies based on genre. Implemented word2vec along with the machine learning algorithm and achieved high f-score of 0.99. Represented in the Kaggle ’20.