The model is used to classify a comment into multiple classes.
The dataset used: https://www.kaggle.com/competitions/jigsaw-toxic-comment-classification-challenge
The output generated using the code achieves the following scores: Private Score: 0.97747 Public Score: 0.97584
The output files and other resources can be found here: https://drive.google.com/drive/folders/14yZO6mAVwBk0vqPN2KJC4ME4OKrpE_CN?usp=share_link
Helpful Resources:
https://machinelearningmastery.com/clean-text-machine-learning-python/
https://www.analyticsvidhya.com/blog/2022/01/text-cleaning-methods-in-nlp/