Churn Analysis (Churn analysis summary information)

Churn analysis is an analysis method used to determine the churn rate of a business and understand its causes. It is often used in industries such as subscription-based services, telecommunications, banking, and retail.
Churn analysis calculates the rate at which existing customers abandon the business over a given period (usually one year). This analysis is used to understand the reasons for customer churn and why customers leave the business. This knowledge helps businesses develop strategies to increase customer loyalty.
Methods used for Churn analysis may include analysis of data on customer behavior, customer satisfaction surveys, customer feedback, and use of customer relationship management data

Project Overview

Purpose of the project I did: I developed a model with XGBoost by performing churn analysis so that banks can predict whether their customers will abandon them. And the accuracy score value of the model I developed is 0.90.
The variables in the dataset I used for the project are as follows:
- CreditScore : Customer's credit score
- Geography : Customer's country of residence
- Gender : Gender of the customer
- Age : Customer's Age
- Tenure : Time of working with the customer bank (in years)
- Balance : Balance in the customer's account
- NumOfProducts : Number of products used by the customer from the bank
- HasCrCard : Whether the customer has a credit card
- IsActiveMember : Whether the customer is an active customer of the bank (0: not, 1: yes)
- EstimatedSalary : Client's estimated salary
- Exited : Whether the customer has left the bank (0: no, 1: yes)
Built a client facing API using streamlit

Code and Resources Used:

Python Version : 3.10.9
Packages : matplotlib ,seaborn,pandas , imblearn,sklearn , joblib , lightgbm, xgboost and lazypredict

Data Cleaning

I removed the following columns that I thought were unnecessary: "RowNumber","CustomerId","Surname"
I used get_dummies function for categorical variables
I used a pie chart to visualize the churn and non-churn data proportions in the dataset.

Since there is an 80 - 20 ratio among the data, I will produce synthetic data with SMOTEC to eliminate this skewed distribution.

I created a retirement column based on the retirement ages of the countries
I grouped the Age and CreditScore values
I created new variables. The new variables are:
- EstimatedSalary / Age
- CreditScore / Age
- NumOfProducts / Tenure
- EstimatedSalary / CreditScore
- EstimatedSalary / Balance
- EstimatedSalary / Tenure
- EstimatedSalary / NumOfProducts
- CreditScore / Tenure
I had the boxplot plot drawn to detect outliers in the variables.

I applied the ROBUSTSCALER operation for variables with outliers.

EDA

I used a pie chart to visualize the percentages of bank customers' genders

I had a pie chart drawn to get information about the age distribution of the customers in the bank.

I had a pie chart drawn to get information about the credit scores of the customers in the bank.

I used a pie chart to see what percentage of customers at the bank have credit cards

I used a pie chart to visualize the percentages of customers in the bank by country

I calculated the number of churns of women and men in countries with the Groupby method.

I use groupby to calculate which age women and men are the most churn

Model Building

I used the lazypredict library to find the model with the highest success in the shortest time

I used the XGBoost algorithm in line with the graphic above.

APP

Since it performs many operations through the parameters entered during the application, I created a file called Module.py that contains all the methods that perform these operations. And this library is imported into the app application, and the entered data is processed to be converted into a format suitable for the model

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
App.py		App.py
Churn_Predictions.csv		Churn_Predictions.csv
Data Cleaning.ipynb		Data Cleaning.ipynb
EDA Data Cleaning.csv		EDA Data Cleaning.csv
EDA.ipynb		EDA.ipynb
Model Building Data Cleaning.csv		Model Building Data Cleaning.csv
Model Building.ipynb		Model Building.ipynb
Module.py		Module.py
README.md		README.md
age_pie.png		age_pie.png
app.png		app.png
boxplot.png		boxplot.png
class_report.png		class_report.png
creditscore_pie.png		creditscore_pie.png
gender_pie.png		gender_pie.png
geo_pie.png		geo_pie.png
groupby_1.png		groupby_1.png
groupby_2.png		groupby_2.png
hascrcard_pie.png		hascrcard_pie.png
model.joblib		model.joblib
models.png		models.png
pie_after.png		pie_after.png
pie_before.png		pie_before.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Churn Analysis (Churn analysis summary information)

Project Overview

Code and Resources Used:

Data Cleaning

EDA

Model Building

APP

About

Releases

Packages

Languages

gamzeaslan/Customer_Churn_Analysis_App

Folders and files

Latest commit

History

Repository files navigation

Churn Analysis (Churn analysis summary information)

Project Overview

Code and Resources Used:

Data Cleaning

EDA

Model Building

APP

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages