udacity-datascience-disaster-response-pipeline

Description

This project in Collaboration with Figure Eight is part of the fullfillment of the Udacity Data Science NanoDegree. The dataset here is provided by Figure Eight and contains pre-labelled tweet and messages from real-life disaster events. This project is aim is to build a Natural Language Processing (NLP) model to categorize messages according to predefined Categories.

The Project is seperated into 3 main parts

ETL Pipeline for gathering the datasets and preparing them for the Maching learning Modeling step
ML Pipeline for building a NLP model and exporting saving into a database
Flask App for providing end user interactivity with the model and visualizations.

Dependencies

- Python 3.x.x+
- Machine Learning & ELT: Pandas, Numpy, Sciki-Learn
- Natural Language Processes: nltk
- Database: SQLalchemy (SQLite Database)
- Model Persistence: Pickle
- Web App and Data Visualization: Flask, Plotly

Installation

1. Clone the Repository 

    
2. Run ETL Pipeline
python data/process_data.py data/disaster_messages.csv data/disaster_categories.csv data/DisasteResponse.db

3. Run ML Pipeline

```python

python models/train_classifier.py data/DisasterResponse.db data/classifier.pkl

```

4. Run Web App
 ```python
 
 cd app
 python run.py
 
 ```
 5. Access webapp on ```http://0.0.0.0:3001/``` on your browser

File Descriptions

There are 3 main parts

data folder: Data
- ETL Pipeline Preparation.ipynb: ETL pipeline notebook
- process_data.py: Contain ETL pipeline python code for preparing data for ML pipeline
models folder : Contains machine learning files
- ML Pipeline Preparation.ipynb: ML pipeline notebook
- train_classifier.py: Contain python code for running ML Pipelinne
app: Web App and Visualizations run.py: Main Falsk app template folder: contain templates files

Results

A Model to run prediction on, after following installatin above the model is persisted to a pickle file in the data founder 'classifier.pkl'
After the was ran an average f1-score .94 was obtained, 94%
A web interface to test the model, inferfaces below

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
app		app
data		data
models		models
.gitignore		.gitignore
README.md		README.md
distribute_categories.png		distribute_categories.png
home_screen.png		home_screen.png
predict_result.png		predict_result.png
question_screen.png		question_screen.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of Contents

Description

Dependencies

Installation

File Descriptions

Results

Fig1 - Home Screen

Fig2 - Enter Question

Fig3 - Predicted Results

Fig4 - Distributed Categories

Licensing, Authors, Acknowledgements

Author

udacity-datascience-disaster-response-pipeline

About

Releases

Packages

Languages

austin047/udacity-datascience-disaster-res-pipeline

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

Description

Dependencies

Installation

File Descriptions

Results

Fig1 - Home Screen

Fig2 - Enter Question

Fig3 - Predicted Results

Fig4 - Distributed Categories

Licensing, Authors, Acknowledgements

Author

udacity-datascience-disaster-response-pipeline

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages