PFE Rapids

End of Studies projects at CentraleSupelec. The goal is to discover and test the Rapids ecosystem, focusing on the librairy cuML.

Installation

Follow the following steps:

Install Conda
Create working environment for Rapids using the following explainer: link. It will install all the require depencie to run cuml.
Activate the environment conda activate env_name.
Install the others depencies the will be listed in for each sub-project.

Data

You can download the data used using the following links::

Digit Recognizer
Detecting Malicious URLs using the svm light version.

Hello Scripts

Clustering Benchmark

Script to benchmark different clustering algorithms. Code adapted from here.

kNN

Test cuml kNN on real data. Code adapted from here

Kmeans 101

A script showing to most basic use of cuML Kmeans implementation.

Image App

Test of a integration of cuML in a Flask API.
Run the app with: python app.py
You can pretrained the model with the model_maker.py script, make sure to properly set the dataset path.

Urls Classification

Set the rigth dataset path and launch the script using python trainer_standalone.py

Full Stream

Set the right dataset path in this script.
Be sure to have Kafka running you can follow this
Launch the mock producer python src/mock_producer/main.py
Launch the trainer python src/trainer/main.py
Launch the metric collector python src/metrics_garbage/main.py

Metrics Analysis

Notebooks to plot metrics analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
data/digit-recognizer		data/digit-recognizer
full_stream		full_stream
hello_scripts		hello_scripts
img_app		img_app
metrics_analysis		metrics_analysis
url_class		url_class
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PFE Rapids

Installation

Data

Hello Scripts

Clustering Benchmark

kNN

Kmeans 101

Image App

Urls Classification

Full Stream

Metrics Analysis

About

Releases

Packages

Languages

GaspardBT/pfe_rapids

Folders and files

Latest commit

History

Repository files navigation

PFE Rapids

Installation

Data

Hello Scripts

Clustering Benchmark

kNN

Kmeans 101

Image App

Urls Classification

Full Stream

Metrics Analysis

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages