Disaster Response Pipelines

Table of Content:

Project Overview
Project Outline
Training Dataset
Machine Learning Model
Files Structure
Requirments
Running Process
Conclusion
Acknowledgements

1. Project Overview

Disaster response reduces possible losses from disasters and provide appropriate help to disaster victims. It is a continuous process in which governments, corporations, and civil society prepare for and mitigate the effects of catastrophes. An appropiate action at all disaster stages results in higher readiness, better warnings, and decreased susceptibility.

Social media applications are one of the best sources to get a quick overview of what is going on around the world, but it is difficult to go through everything on the internet. This project aims to help governments to filter millions of social media messages into categories by using a supervised machine learning model. The model is trained on Figure Eight dataset to categorize the messages into their right categories, so that the governments can respond to disasters quickly.

2. Project Outline

This section explains all three parts of this project from cleaning the data to deploying the model on the flask app.

2.1 Extract, Transform, and Load Pipeline

The Extract, Transform and Load (ETL) pipeline is responsible for preparing the dataset for the machine learning pipeline and it works as following:

Extract the messages and their categories from the CSV files
Clean and merge the messages and categories in one data frame
Saves the data frame inside an SQLite database

2.2 Machine Learning Pipleline

The Machine Learning (ML) pipeline is responsible for creating the machine learning model pipeline and it works as following:

Load the dataset from the SQLite database
Create a machine learning pipeline that tokenizes and trains the SVM model on the training dataset
Evaluate the model on the testing dataset
Save the model as a pickle file

2.3 Flask Web App

Flask Web App is responsible for deploying the machine learning model on a website and allowing the user to use the trained model to do predictions.

3. Training Dataset

The cleaned training dataset contains more than 26K labeled messages and has 36 different classes such as related, offer, food, water, and electricity. The following photo shows how many classes the dataset has:

4. The machine learning model

The macinhe learning model was built using SVClinear from scikit-learn library. The model accruacy was calculated using Numpy mean which was equal to 95%.

5. Files Structure

├── app #Website folder
│   ├── run.py #Responsible of running the website
│   └── templates
│       ├── go.html #Responsible of showing the results
│       └── master.html #The main page
|
├── data
│   ├── disaster_categories.csv #Categories dataset
│   ├── disaster_messages.csv #Messages dataset
│   ├── DisasterResponse.db #The cleaned dataset in SQLite database
│   └── process_data.py #Responsible for preparing the dataset 
|
├── models
│   ├── classifier.pkl #The SVM model
│   └── train_classifier.py #Responsible for creating the machine learning model
|
├── readme_images #This folder contains all images for the readme file
│   ├── dataset.png
│   └── website_example.png
└── README.md #Readme file

6. Requirments

In order to run this project, you must have Python3 installed on your machine. You also must have all listed libraries inside the requirments.txt so run the following command to install them:

pip3 install -r requirments.txt

7. Running Process

This secions explains how to run each part of this project using command prompt or terminal

7.1 Process Data

You must be inside the data directory in order to run this command:

python3 process_data.py disaster_messages.csv disaster_categories.csv <database_name>.db

7.2 Train Classifier

You must be inside the models directory in order to run this command:

python3 train_classifier.py ../data/<database_name>.db <model_name>.pkl

7.3 Run the flask web app

You must be inside the app directory in order to run this command:

python3 run.py

The link of the website will be 0.0.0.0:3001

8. Conclusion

In the conclusion, catastrophes are horrible if we are not properly prepared to deal with them, thus having a system that consistently delivers correct warnings is helpful to get an early notice of a potential disaster and decrease potential losses. The system was built using scikit-learn and achieved 95% accuracy, but that does not mean it is the best model. Testing different solutions such as Reccurrent neural network may give a better result so, feel free to fork this repository and test different solutions.

9. Acknowledgements

I would like to express my appreciation to Misk Academy and Udacity for the amazing work on the data science course and the support they give us to build this project

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Disaster Response Pipelines

Table of Content:

1. Project Overview

2. Project Outline

2.1 Extract, Transform, and Load Pipeline

2.2 Machine Learning Pipleline

2.3 Flask Web App

3. Training Dataset

4. The machine learning model

5. Files Structure

6. Requirments

7. Running Process

7.1 Process Data

7.2 Train Classifier

7.3 Run the flask web app

8. Conclusion

9. Acknowledgements

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
app		app
data		data
models		models
readme_images		readme_images
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

murtadapy/disaster-response-pipelines

Folders and files

Latest commit

History

Repository files navigation

Disaster Response Pipelines

Table of Content:

1. Project Overview

2. Project Outline

2.1 Extract, Transform, and Load Pipeline

2.2 Machine Learning Pipleline

2.3 Flask Web App

3. Training Dataset

4. The machine learning model

5. Files Structure

6. Requirments

7. Running Process

7.1 Process Data

7.2 Train Classifier

7.3 Run the flask web app

8. Conclusion

9. Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages