The objective of this project is to demonstrate binary classification using various algorithms and explore their performance in determining whether short text messages are related to mental health or not. Additionally, we aim to gain insights into the types of mistakes made by these models. We leverage tools such as confusion matrices to analyze these errors and evaluate the effectiveness of our models.
Clone this repository and install the required dependencies using the following command:
git clone https://github.com/ManuelEspejo/NLP-Sentiment-Analysis-Mental-Health.git
- Open the
sentiment_analysis.ipynb
Jupyter Notebook. - Follow the step-by-step instructions provided in the notebook to perform the analysis.
- Modify the code or experiment with different models and techniques to enhance the analysis.
- Execute the notebook cells to see the results and insights.
The dataset used in this project consists of short text messages labeled as either mental health-related (1) or not considered (0). The dataset has been downloaded from kaggle. You can download it from here.
We experiment with various machine learning models, including Dense model, Convolutional Neural Networks and LSTM, to perform sentiment analysis. The notebook explores the implementation and comparison of these models for the text binary classification task.
To evaluate the performance of the models, we use common evaluation metrics such as accuracy, precision, recall, and F1-score.
The results of the sentiment analysis demonstrate the effectiveness of the applied machine learning models. We discuss the achieved accuracy and provide visualizations of evaluation metrics for a comprehensive understanding of the analysis.
Contributions to this project are welcome. If you have any suggestions or would like to enhance the analysis, please feel free to submit a pull request.
At the end of the notebook there is also a "next steps" section, in wich I have talked about some approaches that we could take with this notebook for further improve our models performance. Anyone is welcome to try one of those approaches to see the potential improvements, it would be great to see that.
This project is licensed under the MIT License.