Deep Packet

This a fork from the original implementation at https://github.com/munhouiani/Deep-Packet

Below is the list of files that were modified for the experiment:

SMOTE Implementation: create_train_test_set.py
Train/Test data reporting: data_reports.py
Test and collect metrics to evaluate model performance: test_cnn.py
Precision-Recall curves: ml/metrics.py

To setup the project on an ICL6 machine

Create an environment via conda

For Linux (CUDA 10.6)

conda env create -f env_linux_cuda116.yaml

Download the pre-processed dataset from small dataset
Create a directory called processed_small and extract the contents of the downloaded dataset
```
mkdir processed_small
tar -xvzf processed_small.tar.gz -C processed_small
```

Create Train/Test split using Random Under-Sampling (baseline)

python create_train_test_set.py --source ~/datasets/processed_small --train ~/datasets/undersampled_train_split --test ~/datasets/test_split --class_balancing under_sampling

Create Train/Test split using SMOTE and Random Under-Sampling (experiment)

Minority classes (c): 2
Nearest Neighbors (k): 5
Amount of SMOTE (n): 1, 2, 3, 4, 5

python create_train_test_set.py --source ~/datasets/processed_small --train ~/datasets/smote_c2_n2_k5_train_split --test ~/datasets/test_split --class_balancing SMOTE+under_sampling -c 2 -n 2 -k 5 -t app --skip_test 1

Train Model

Application Classification

python train_cnn.py -d ~/datasets/smote_c2_n1_k5_train_split/application_classification/train.parquet -m model/application_classification.cnn.model.smote.c2n1k5 -t app

Test Model

Application Classification

python test_cnn.py -d ~/datasets/test_split/application_classification/test.parquet -m model/application_classification.cnn.model.smote.c2n1k5 -t app -p c2n1k5

(Optional) Data reporting script to show the label distribution for any train/test split

python data_reports.py -p /path/to/datasets/test_split/application_classification/test.parquet -t app -o app_test_data_dist.png

(Optional) Data Pre-processing script for raw pcap files.

python preprocessing.py -s /path/to/pcap_files -t /path/to/datasets/processed_new

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.github		.github
images		images
ml		ml
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
create_train_test_set.py		create_train_test_set.py
data_reports.py		data_reports.py
env_linux_cuda116.yaml		env_linux_cuda116.yaml
preprocessing.py		preprocessing.py
test_cnn.py		test_cnn.py
train_cnn.py		train_cnn.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Packet

Below is the list of files that were modified for the experiment:

To setup the project on an ICL6 machine

Create Train/Test split using Random Under-Sampling (baseline)

Create Train/Test split using SMOTE and Random Under-Sampling (experiment)

Train Model

Test Model

(Optional) Data reporting script to show the label distribution for any train/test split

(Optional) Data Pre-processing script for raw pcap files.

About

Releases

Packages

Languages

License

ConorGagliardi/Deep-Packet

Folders and files

Latest commit

History

Repository files navigation

Deep Packet

Below is the list of files that were modified for the experiment:

To setup the project on an ICL6 machine

Create Train/Test split using Random Under-Sampling (baseline)

Create Train/Test split using SMOTE and Random Under-Sampling (experiment)

Train Model

Test Model

(Optional) Data reporting script to show the label distribution for any train/test split

(Optional) Data Pre-processing script for raw pcap files.

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages