Titanic Kaggle Challenge

This project aims to predict passenger survival on the Titanic using machine learning. The dataset is part of the Titanic: Machine Learning from Disaster competition on Kaggle.

Project Overview

In this project, we use the Titanic dataset to predict whether a passenger survived or not based on various features like age, sex, class, and the number of siblings/spouses aboard. The dataset consists of both categorical and numerical data, which we preprocess before training a machine learning model.

Key Steps:

Data Preprocessing: Clean and prepare the data by handling missing values, encoding categorical variables, and splitting data for training and testing.
Exploratory Data Analysis (EDA): Visualize and explore the dataset to gain insights into the relationships between the features and survival rate.
Model Training: Train a Random Forest Classifier model to predict survival based on the features in the dataset.
Evaluation: Evaluate the model's performance using accuracy score and confusion matrix.

Requirements

To run this project, you need the following Python libraries:

numpy
pandas
seaborn
matplotlib
scikit-learn

You can install the required libraries using the following command:

pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
README.md		README.md
requirements.txt		requirements.txt
titanic.ipynb		titanic.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Titanic Kaggle Challenge

Project Overview

Key Steps:

Requirements

About

Releases

Packages

Languages

hwong511/kaggle-titanic

Folders and files

Latest commit

History

Repository files navigation

Titanic Kaggle Challenge

Project Overview

Key Steps:

Requirements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages