Analysis-of-tweets

In this repository there are different models that analyze the opinion left by travelers on twitter. The data has been taken from a competition in Kaggle carried out by a Spanish areoline. The data has been processed and various techniques have been tried for its processing.

Machine Learning

Bag of words(TF-idf)
Random Forest
GuassianNB
XGBoost

Deep Learning

Word Embedding(Glove)
CNN with Kernel = 1: this is a video where explain this technique.
Fast-Text: a simple and efficient model for text classification.
BETO: the model bert trained for spanish.
GRUs: Gated recurrent units.

Results

All models have undergone a fine tuning process to get the best performance from them.

Figure 1: Results of the experiment for a balanced Dataset

Figure 2: Results of the experiment for a unbalanced Dataset

Conclusion

As can be seen in the figures, the connectionist approach (deep learning) generates better results for both datasets, however, using balanced data, the models manage to reach 80% accuracy.CNN and Fast-text are fast and effective methods, however, despite being more powerful transofmers fall below the previous two methods. I suppose that the reason is because being a model designed for large volumes of data, with few data as is the case, only 7000 samples, these models give good results but it is not as impressive as in other applications.

Technologies and Libraries

Python 🐍
Sklearn 🧮
PyTorch ❤️
Hugging Face 🤖

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
results		results
Deep_Learning_models.ipynb		Deep_Learning_models.ipynb
README.md		README.md
baseline.ipynb		baseline.ipynb
tweets_preparing.ipynb		tweets_preparing.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Analysis-of-tweets

Machine Learning

Deep Learning

Results

Conclusion

Technologies and Libraries

About

Releases

Packages

Languages

miguel-kjh/Analysis-of-tweets

Folders and files

Latest commit

History

Repository files navigation

Analysis-of-tweets

Machine Learning

Deep Learning

Results

Conclusion

Technologies and Libraries

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages