Train and Test dataset partition #5

hezt · 2019-11-13T20:38:42Z

Hello,

Could you please illustrate how to partite the train set and the test set from CV files (http://gerv.csail.mit.edu/deepligand_CVdata/) to get the evaluation performance curve depicted in your paper?

I'm trying to reimplement your train and evaluate processes.

Thanks,
Zitong

hezt · 2019-11-13T20:51:19Z

Hello,

Moreover, whether you concatenate the prediction results on each fold, where the model was trained on the other 4 folds, to draw auROC and auPRC curves?

Best,
Zitong

haoyangz · 2019-11-14T17:22:02Z

@hezt For each of the five folds, we trained one model using the other four folds before using it to predict on this fold. The resulting predictions of the five folds were concatenated to calculate auROC and other metrics.

KiAkize · 2021-03-15T03:16:28Z

Hello，

I am also trying to retrain 5cv models to reimplement results.

Could you please illustrate what each column of the downloaded 5CV data means?

In addition, besides renaming MHC names to the format in the MHC_pseudo.dat, what else needs to be done before using preprocess.py to transform training data?

Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train and Test dataset partition #5

Train and Test dataset partition #5

hezt commented Nov 13, 2019

hezt commented Nov 13, 2019

haoyangz commented Nov 14, 2019

KiAkize commented Mar 15, 2021 •

edited

Loading

Train and Test dataset partition #5

Train and Test dataset partition #5

Comments

hezt commented Nov 13, 2019

hezt commented Nov 13, 2019

haoyangz commented Nov 14, 2019

KiAkize commented Mar 15, 2021 • edited Loading

KiAkize commented Mar 15, 2021 •

edited

Loading