GitHub - Prarabdha14/Customer-Segmentation-with-K-Means-Clustering-and-Silhouette-Analysis: This repository explores customer segmentation on the Mall Customer Dataset using the K-Means clustering algorithm. Silhouette analysis is employed to determine the optimal number of clusters for the given dataset.

Advantages of Silhouette Analysis:

Optimal Cluster Number: Helps determine the optimal number of clusters for a given dataset by evaluating the quality of the clustering. Cluster Validity: Provides a measure of how well-separated the clusters are and how well-defined each data point is within its assigned cluster. Model Selection: Guides the selection of the most appropriate number of clusters for K-Means and other clustering algorithms.

Key Features:

Data Loading and Preprocessing: Includes data loading, handling missing values, and feature scaling.

K-Means Clustering: Implements K-Means clustering with varying numbers of clusters.

Silhouette Analysis: Calculates the Silhouette score for different cluster numbers to identify the optimal number of clusters.

Visualization: Visualizes the clusters using scatter plots and explores the characteristics of each customer segment.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
Silhouette_score_analysis.ipynb		Silhouette_score_analysis.ipynb
dataset		dataset

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

Prarabdha14/Customer-Segmentation-with-K-Means-Clustering-and-Silhouette-Analysis

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages