Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Team 8 #20

Open
wants to merge 35 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
35 commits
Select commit Hold shift + click to select a range
cbcfae7
test
YanniPapandreou Mar 24, 2021
12bfac8
task_3 predictions
YanniPapandreou Mar 24, 2021
9765c28
task_1 predictions
YanniPapandreou Mar 24, 2021
8bf210c
task_1 predictions
YanniPapandreou Mar 24, 2021
cc1134e
task_4 predictions
YanniPapandreou Mar 24, 2021
59b2c0d
task_3 predictions nn
YanniPapandreou Mar 24, 2021
2155c42
task_3 predictions
YanniPapandreou Mar 24, 2021
ae2284e
task_3 predictions
YanniPapandreou Mar 24, 2021
2eb167b
task_3 predictions
YanniPapandreou Mar 24, 2021
ad0e47d
task_3 predictions
YanniPapandreou Mar 24, 2021
0a003d7
task_3 predictions
YanniPapandreou Mar 24, 2021
1ce0091
task_1 predictions
YanniPapandreou Mar 24, 2021
63179a2
task_4 predictions
YanniPapandreou Mar 24, 2021
569b989
task_1 predictions
YanniPapandreou Mar 24, 2021
a8448ad
task_4 predictions
YanniPapandreou Mar 24, 2021
99d1c3e
task_2 predictions
YanniPapandreou Mar 24, 2021
4555ce2
task_2 predictions
YanniPapandreou Mar 24, 2021
f90db81
bonus 1 predictions
YanniPapandreou Mar 24, 2021
485e787
task_3 predictions
YanniPapandreou Mar 24, 2021
4b2bee0
task_3 predictions
YanniPapandreou Mar 24, 2021
0f012a0
bonus_2 predictions
YanniPapandreou Mar 24, 2021
4b8e853
task_4 predictions
YanniPapandreou Mar 24, 2021
a7f466e
bonus_2 predictions
YanniPapandreou Mar 24, 2021
3c43eee
bonus_2 predictions
YanniPapandreou Mar 24, 2021
010ef5c
task_2 predictions
YanniPapandreou Mar 24, 2021
1f5b838
task_4 predictions
YanniPapandreou Mar 24, 2021
a0ec7dc
bonus_3 predictions
YanniPapandreou Mar 24, 2021
3fac72c
task_4 predictions
YanniPapandreou Mar 24, 2021
22f423c
bonus_2 predictions
YanniPapandreou Mar 24, 2021
7b1eede
task_4 predictions
YanniPapandreou Mar 24, 2021
d6b68f2
bonus_3 predictions
YanniPapandreou Mar 24, 2021
4b6689d
bonus_3 predictions
YanniPapandreou Mar 24, 2021
f96b5e3
code
YanniPapandreou Mar 24, 2021
899692a
code
YanniPapandreou Mar 24, 2021
7a4d24a
Add files via upload
nevetse Mar 25, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3,363 changes: 3,363 additions & 0 deletions bonus_1_prediction.csv

Large diffs are not rendered by default.

3,363 changes: 3,363 additions & 0 deletions bonus_1_predictions.csv

Large diffs are not rendered by default.

3,363 changes: 3,363 additions & 0 deletions bonus_2_predictions.csv

Large diffs are not rendered by default.

3,363 changes: 3,363 additions & 0 deletions bonus_3_predictions.csv

Large diffs are not rendered by default.

46 changes: 46 additions & 0 deletions bonus_task_1.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,46 @@
#! /usr/bin/env python3
import pandas as pd
import numpy as np
from sklearn import *

## Training
train_descriptors = pd.read_csv("../train_and_test_sets/train_descriptors.csv")
train_mord3d = pd.read_csv("../train_and_test_sets/train_mord3d.csv")
train_morgan = pd.read_csv("../train_and_test_sets/train_morgan.csv")
train_rdk = pd.read_csv("../train_and_test_sets/train_rdk.csv")

## Train responses
train_crystals = pd.read_csv("../train_and_test_sets/train_crystals.csv")
train_distances = pd.read_csv("../train_and_test_sets/train_distances.csv")
train_centroid_distances = pd.read_csv("../train_and_test_sets/train_centroid_distances.csv")

train_descriptors_full = train_descriptors.iloc[:, 3:-2].dropna(axis=1, how="any")
train_mord3d_full = train_mord3d.dropna(axis=1, how="any").drop(['identifiers','Unnamed: 0', 'InchiKey',
'smiles', 'name'], axis=1)
train_morgan_full = train_morgan.drop('0',axis=1)
train_rdk = train_rdk.drop('0',axis=1)

## Testing
test_descriptors = pd.read_csv("../train_and_test_sets/test_descriptors.csv")
test_mord3d = pd.read_csv("../train_and_test_sets/test_mord3d.csv")
test_morgan = pd.read_csv("../train_and_test_sets/test_morgan.csv")
test_rdk = pd.read_csv("../train_and_test_sets/test_rdk.csv")

test_descriptors_full = test_descriptors[train_descriptors_full.columns]
test_mord3d_full = test_mord3d[train_mord3d_full.columns]
test_morgan_full = test_morgan.drop('0',axis=1)
test_rdk = test_rdk.drop('0',axis=1)

train_PCA = decomposition.PCA(n_components=.95)
scaler_for_PCA = preprocessing.StandardScaler()
train_descriptors_PCA = train_PCA.fit_transform(scaler_for_PCA.fit_transform(train_descriptors_full))
test_descriptors_PCA = train_PCA.transform(scaler_for_PCA.transform(test_descriptors_full))

target = 'packing_coefficient'

from sklearn import tree
model = tree.DecisionTreeRegressor(criterion='mae')
model.fit(train_descriptors_PCA, train_crystals[target])

predictions = model.predict(test_descriptors_PCA)
pd.DataFrame(predictions).to_csv("task_packing_coefficient_predictions22.csv", header=False, index=False)
Loading