Hate Speech Classification

Problem Statement:

The rapid evolution of AI has ushered in a host of challenges, including bias, privacy, and security concerns. We’ve observed instances where users have requested sensitive information from consumer chatbot applications. It’s imperative to implement measures that prevent the leakage of harmful or sensitive information to the public. Given that large language models (LLMs) powering these chatbot applications are trained on vast amounts of data from diverse sources, there’s a risk that they might encounter or generate hate speech or critical information. Therefore, it’s crucial to have safeguards in place to classify whether a user’s prompt could lead to the generation of hate speech.

Solution:

While an LLM could potentially solve this problem, the high deployment costs for such a simple task of identifying hate or harmful language make it less feasible. Hence, this project focuses on a simpler, more cost-effective solution. We propose a compact LSTM network that is not only easy to implement but also significantly reduces the compute budget while effectively solving the problem.

Applications include:

Social Media Moderation: filter out hate speech in comments, posts, and private messages on social media platforms.
Customer Support: ensure that customers do not use hate speech when interacting with these chatbots.
Content Recommendation Systems: to avoid recommending content that contains hate speech.

Training and Prediction Pipeline:

Note: Find each of the pipeline components code flow diagrams in folder: flowcharts

Project Workflows

constants
config_enity
artifact_enity
components
pipeline
app.py

How to run?

conda create -n hate python=3.8 -y

conda activate hate

pip install -r requirements.txt

python app.py

Gcloud cli

https://dl.google.com/dl/cloudsdk/channels/rapid/GoogleCloudSDKInstaller.exe

gcloud init

Deployment

Setting up circleCI
Switch on self hosted runner
Create Project
Configure EC2
config.yml
env variables

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.circleci		.circleci
data		data
flowcharts		flowcharts
hate		hate
notebook		notebook
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
circleci_setup_template.sh		circleci_setup_template.sh
demo.py		demo.py
requirements.txt		requirements.txt
setup.py		setup.py
template.py		template.py
tokenizer.pickle		tokenizer.pickle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hate Speech Classification

Problem Statement:

Solution:

Applications include:

Training and Prediction Pipeline:

Project Workflows

How to run?

Gcloud cli

Deployment

About

Releases

Packages

Languages

License

abhishekvarma12345/HateSpeechClassification

Folders and files

Latest commit

History

Repository files navigation

Hate Speech Classification

Problem Statement:

Solution:

Applications include:

Training and Prediction Pipeline:

Project Workflows

How to run?

Gcloud cli

Deployment

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages