Skip to content

This project applies K-means clustering to group cryptocurrencies based on 24-hour and 7-day price changes. It also investigates the impact of dimensionality reduction using PCA on clustering outcomes.

Notifications You must be signed in to change notification settings

Asalvs/CryptoClustering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Crypto Clustering

Overview

The Crypto Clustering project aims to predict if cryptocurrencies are affected by 24-hour or 7-day price changes using unsupervised learning techniques, specifically K-means clustering. Additionally, the project explores the impact of dimensionality reduction using Principal Component Analysis (PCA) on clustering.

Steps

  1. Load and preprocess the data.
  2. Scale the data using StandardScaler.
  3. Find the best value for k using the elbow method.
  4. Cluster cryptocurrencies with K-means using the original scaled data.
  5. Perform PCA to reduce the features to three principal components.
  6. Find the best value for k using the PCA data.
  7. Cluster cryptocurrencies with K-means using the PCA data.
  8. Visualize and compare the results using hvPlot.

Results

The project includes the following visualizations:

  1. Elbow curve for the original data.

bokeh_plot (5)

  1. Elbow curve for the PCA data.

bokeh_plot (6)

  1. Scatter plot of cryptocurrency clusters based on the original data.

bokeh_plot (8)

  1. Scatter plot of cryptocurrency clusters based on the PCA data.

bokeh_plot (9)

Conclusion

The project analyzes the impact of using fewer features on clustering the data using K-means. Comparing the clustering results of the original data and the PCA data helps to understand the effect of dimensionality reduction on the clustering process.

Dependencies

  • Python
  • pandas
  • NumPy
  • scikit-learn
  • hvPlot

About

This project applies K-means clustering to group cryptocurrencies based on 24-hour and 7-day price changes. It also investigates the impact of dimensionality reduction using PCA on clustering outcomes.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published