Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The idea of ensemble in this model #11

Open
wanyueli opened this issue Sep 22, 2020 · 1 comment
Open

The idea of ensemble in this model #11

wanyueli opened this issue Sep 22, 2020 · 1 comment

Comments

@wanyueli
Copy link

Hi!In your paper you said that "Each combination of features and network architecture was initially trained with the PDBBind refined set minus the core set for 300 epochs. We trained 20 replicas from scratch, and for each replica, the model that minimized the validation error of the core set was saved. " Does this means we should run this code from the preprocess step to the test step 20 times and then choose five models with the lowest error to get the final results?

@hhassan1
Copy link
Collaborator

Hi @wanyueli , you are mostly correct.

The only difference is that we only preprocessed the dataset once. The "scratch" part was meant to say that we ran the training 20 times, for each training saving the model on the iteration where the lowest validation error was achieved.

The ensambling was done manually, it is not part of the pipeline. For ensambling we saved the 5 best predictions (best according to the RMSE of the validation set) and averaged the prediction of each complex across these 5 predictions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants