Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New MoleculeNet Datasets #2

Open
lilleswing opened this issue Feb 24, 2020 · 4 comments
Open

New MoleculeNet Datasets #2

lilleswing opened this issue Feb 24, 2020 · 4 comments

Comments

@lilleswing
Copy link
Member

Two years have passed since the publication of MoleculeNet. Since then many strides in supervised learning for molecules have been made. Are all of the datasets from the original paper still relevant to the challenges projects are facing today? Are there new datasets that should be added to the benchmark?

@rbharath
Copy link
Member

We should add enamine as a dataset: https://enamine.net/library-synthesis/real-compounds/real-compound-libraries

@rbharath
Copy link
Member

We should consider adding the crystallography open database. Following up on discussion from deepchem/deepchem#425

@rbharath
Copy link
Member

We should consider adding the cambridge structural database. Following up on discussion from https://www.ccdc.cam.ac.uk/solutions/csd-system/components/csd/.

Following up on discussion from deepchem/deepchem#426.

@rbharath
Copy link
Member

Following up on the discussion from deepchem/deepchem#867.

We should try to add some more assay binding data.

rbharath pushed a commit that referenced this issue Feb 7, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants