GitHub - alexantoniogonzalez2/predicting-twitter-fake-connections: A machine learning model to predict fake connections in Twitter data.

Who are my friends?

A machine learning model to predict fake connections in Twitter data.

Requirements

The complete version of this project is available in this GitHub repository.
Environment for run a Jupyter Notebook. For example: Jupyter Project. A basic requirement for Jupyter Notebook is Python.

Compatibility

The libraries needed in this project are specified in the Jupyter Notebooks. The most general libraries utilized are:

Python: 3.7.5
Tensorflow: 2.0.0
Keras: 2.2.4-tf

Structure

'data' folder: Contains the raw data available for this project. It is not added the file 'train.txt' which is available in this Kaggle competition.
'data_processing': Contains the main files to preprocess the raw data and generate files with a proper structure.
'data_generated': Contains the files generated in data preprocessing.
'data_models': Contains the dataset ready to be used by the machine learning models.
'predictions': Contains the prediction made by the different models.
A research report is available here.

Some key files

data_models/dataset7.cvs contains the most updated dataset.
models/neural_netwoks has one of the most relevant models obtained.

Further Analysis

A full report detailing our findings can be found here.

Context

This works is part of the subject COMP90051 Statistical Machine Learning, 2020 Semester 2, The University of Melbourne. Our group was formed by Alex González, Yuqing Xiao and Yee Hean Chuah. Our name in the Kaggle competition was 50 cents.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
data		data
data_generated		data_generated
data_models		data_models
data_processing		data_processing
models		models
predictions		predictions
.gitignore		.gitignore
Ego Network Analysis Research Report.pdf		Ego Network Analysis Research Report.pdf
README.md		README.md
StatsML Project1 v1.ipynb		StatsML Project1 v1.ipynb
Untitled.ipynb		Untitled.ipynb
Yuqing_nn.ipynb		Yuqing_nn.ipynb
preprocess.ipynb		preprocess.ipynb
preprocess.py		preprocess.py
prioritydict.py		prioritydict.py
regression.ipynb		regression.ipynb
test_data 2.csv		test_data 2.csv
test_data.csv		test_data.csv
yuqing_model.ipynb		yuqing_model.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Who are my friends?

Requirements

Compatibility

Structure

Some key files

Further Analysis

Context

About

Releases

Packages

Contributors 3

Languages

alexantoniogonzalez2/predicting-twitter-fake-connections

Folders and files

Latest commit

History

Repository files navigation

Who are my friends?

Requirements

Compatibility

Structure

Some key files

Further Analysis

Context

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages