Mood Classification of Songs Based on Lyrics

This project uses natural language processing (NLP) and machine learning techniques to classify songs into different mood categories based on their lyrics. The Jupyter Notebook contains all steps from data preprocessing to model evaluation, making it easy to follow and replicate.

Introduction

Song lyrics often reflect the mood or sentiment of a piece. This project analyzes song lyrics to classify them into predefined mood categories such as happy, sad, energetic, or calm. It leverages advanced NLP techniques and machine learning algorithms to achieve accurate predictions.

Features

Text Preprocessing: Includes tokenization, stop-word removal, stemming, and lemmatization.
Feature Extraction: Utilizes TF-IDF (Term Frequency-Inverse Document Frequency) and word embeddings.
Machine Learning Models: Implements classifiers like Logistic Regression, Support Vector Machines (SVM), and Neural Networks.
Interactive Visualizations: Confusion matrices and performance metrics are plotted for better understanding.

Dataset

The dataset includes song lyrics paired with mood labels.
Preprocessed to remove noise and ensure compatibility with NLP techniques.
Source: Publicly available lyric datasets or user-curated collections.

Technologies Used

Languages: Python (Jupyter Notebook)
Libraries: Pandas, NumPy, Scikit-learn, Matplotlib, Seaborn, NLTK, TensorFlow/Keras
Tools: Jupyter Notebook for interactive data exploration and analysis.

Notebook Structure

Data Preprocessing:
- Cleans and prepares the dataset for feature extraction.
Feature Extraction:
- Uses TF-IDF for traditional machine learning models.
- Implements word embeddings (e.g., Word2Vec, GloVe) for deep learning models.
Model Training:
- Trains Logistic Regression, SVM, and Neural Network models on the dataset.
Evaluation:
- Calculates accuracy, precision, recall, and F1-score.
- Visualizes results using confusion matrices.

Usage

Clone the repository:

git clone https://github.com/rajvi-patel-22/Mood-classification-of-songs-based-on-lyrics.git

Run the cells sequentially to preprocess data, train models, and evaluate results.

Results

Achieved high accuracy in mood classification using SVM with TF-IDF.
Neural networks demonstrated strong performance when using word embeddings for feature extraction.
Visualizations of confusion matrices provide detailed insights into model performance.

Future Scope

Extend the dataset to include multilingual song lyrics.
Integrate transformer-based models like BERT for improved contextual analysis.
Develop a web-based interface for real-time mood classification.

References

Natural Language Toolkit (NLTK) Documentation
Scikit-learn User Guide
TensorFlow/Keras Documentation
Research papers on sentiment analysis and mood classification.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Data		Data
CLASSIFICATION.ipynb		CLASSIFICATION.ipynb
Data_Preprocessing.ipynb		Data_Preprocessing.ipynb
MyClassification_artist.ipynb		MyClassification_artist.ipynb
README.md		README.md
stopwords.txt		stopwords.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mood Classification of Songs Based on Lyrics

Table of Contents

Introduction

Features

Dataset

Technologies Used

Notebook Structure

Usage

Results

Future Scope

References

About

Releases

Packages

Contributors 2

Languages

rajvi-patel-22/Moodify

Folders and files

Latest commit

History

Repository files navigation

Mood Classification of Songs Based on Lyrics

Table of Contents

Introduction

Features

Dataset

Technologies Used

Notebook Structure

Usage

Results

Future Scope

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages