Disaster Response Pipeline Project

Description

This is a training project with the aim of analysing disaster data from Figure Eight and creating a model that classifies disaster messages. The project consists of 3 stages (ETL Pipeline, ML Pipeline, and web application) that upload and clean the initial data, classify it according to the task and then upload as an app.

Installation

The code contained in this repository was written in HTML and Python 3, and requires the following Python packages: json, plotly, pandas, nltk, flask, sklearn, sqlalchemy, sys, numpy, re, pickle, warnings

Running Instructions

Run the following commands in the project's root directory to set up your database and model.
- To run ETL pipeline that cleans data and stores in database python data/process_data.py data/disaster_messages.csv data/disaster_categories.csv data/DisasterResponse.db
- To run ML pipeline that trains classifier and saves python models/train_classifier.py data/DisasterResponse.db models/classifier.pkl
Run the following command in the app's directory to run your web app. python run.py
Go to http://0.0.0.0:3001/

Important files

process_data.py : ETL Pipeline

Actions: load and parse datasets, cleanse the data and store the data in a SQLite database.

train_classifier.py : ML Pipeline

Actions: load the data from SQLite database, splits the data into training and test sets, build a text processing and clssification pipeline, trains and tunes a model using GridSearchCV and exports the final model as a pickle file.

Run.py : Flask Web App

Actions: display the visualization (the app accept messages from users and returns classification results for 36 categories of disaster events).

Limitations and possible improvement

The used datasets are very unbalanced, with very few positive examples for some message categories. This results in a low recall rate despite having high accuracy.

This app must not be used for actual pridiction unless more data is collected.

Additionally, the model training time can be improved.

Screenshots

Licensing and Acknowledgements

This app was developed as part of the Udacity Data Scientist Nanodegree.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
app		app
data		data
models		models
README.md		README.md
Screen1.png		Screen1.png
Screen2.png		Screen2.png
Screen3.png		Screen3.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Disaster Response Pipeline Project

Description

Installation

Running Instructions

Important files

process_data.py : ETL Pipeline

train_classifier.py : ML Pipeline

Run.py : Flask Web App

Limitations and possible improvement

Screenshots

Licensing and Acknowledgements

About

Releases

Packages

Languages

pol690/Disaster-Response-Pipeline-Project

Folders and files

Latest commit

History

Repository files navigation

Disaster Response Pipeline Project

Description

Installation

Running Instructions

Important files

process_data.py : ETL Pipeline

train_classifier.py : ML Pipeline

Run.py : Flask Web App

Limitations and possible improvement

Screenshots

Licensing and Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages