NLP CS 2025 Kaggle Challenge

Competition Link

You can access the competition via the following link : Kaggle We are the team named meow.

Report

The report is available in the file NLP_report.pdf following this link

Team Members

Rayane Bouaita
Erwan David
Pierre El Anati
Guillaume Faynot
Gabriel Trier

Description

Text classification with sparsely represented training data is not a trivial task. We are going to present our solution using large language models (LLMs) to classify texts from almost 390 different languages. After studying the data provided to us, we decided to use different approaches using machine learning models (XLM-Roberta & BERT). Our final model achieved an accuracy of 88.0%, placing our team in the top 10 of the ranking.

Installation

To install the required packages, you can run the following command:

pip install -r requirements.txt

Usage

To train the model, you can run the following command from the root directory:

python models/roberta.py

You can also use the model.ipynb notebook to train the model.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
Archives		Archives
data		data
models		models
results		results
NLP_report.pdf		NLP_report.pdf
README.md		README.md
model.ipynb		model.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NLP CS 2025 Kaggle Challenge

Competition Link

Report

Team Members

Description

Installation

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

guillfay/NLPKaggleChallenge

Folders and files

Latest commit

History

Repository files navigation

NLP CS 2025 Kaggle Challenge

Competition Link

Report

Team Members

Description

Installation

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages