You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A number of the MoleculeNet datasets have PAINS compounds and other compounds that detract from their usefulness as benchmarking datasets. Let's use this issue to brainstorm the set of datasets that we want to filter. I think we can use @PatWaltershttps://github.com/PatWalters/rd_filters library to help us filter datasets.
Off the top of my head, I think we can start by applying the PAINS filters to the
Chembl
Chemb25
HIV
PCBA
datasets. We should discuss here to see if this makes sense though.
A number of the MoleculeNet datasets have PAINS compounds and other compounds that detract from their usefulness as benchmarking datasets. Let's use this issue to brainstorm the set of datasets that we want to filter. I think we can use @PatWalters https://github.com/PatWalters/rd_filters library to help us filter datasets.
Off the top of my head, I think we can start by applying the PAINS filters to the
datasets. We should discuss here to see if this makes sense though.
CC @mufeili @PatWalters @lilleswing @peastman
The text was updated successfully, but these errors were encountered: