In this project, we will be tasked with building a plagiarism detector that examines a text file and performs binary classification; labeling that file as either plagiarized or not, depending on how similar that text file is to a provided source text.
This project uses the following software and python libraries:
- Python
- Numpy
- Pandas
- Scikit-learn
- Amazon SageMaker