Processing of QM7/QM9 Targets #42

FelixKatz77 · 2021-11-16T18:11:28Z

Hi,
I wanted to use the splits of the QM7 and QM8 datasets for benchmarking when I noticed a discrepancy between the targets accessible via the load_qm7()/load_qm8() functions and the original targets of these datasets (http://quantum-machine.org/datasets/). I could not find any information on any processing of the targets. Could you clarify if any normalisation or rescaling was done?

I was also wondering how the benchmark performance was determined in the case of multitask datasets. In these cases, was a single task taken into account or the performance on all tasks? Thanks!

rbharath · 2021-11-16T19:22:34Z

I believe that outputs are normalized (see https://deepchem.readthedocs.io/en/latest/api_reference/moleculenet.html#qm7-datasets, and linked source). The discrepancy between the load functions and the original datasets is a little disconcerting and something we should investigate

For benchmark performance, I believe it is mean performance across all tasks but I'm going from memory and may be wrong

FelixKatz77 · 2021-11-17T08:36:46Z

I think the target processing is relevant to all the regression tasks. I tried to figure out the mapping between the targets in the datasets downloaded from https://moleculenet.org/datasets-1 and the targets you can access via the 'y' label after loading the datasets via dc.molnet.load_dataset() but could not figure it out. Would be great if you could comment on this.

FelixKatz77 · 2021-11-17T09:01:09Z

I figured out the normalization using the 'transformers' argument in dc.molnet.load_dataset().

FelixKatz77 · 2021-11-17T09:04:24Z

If get any more insights on the benchmarking for multitask datasets I would still be happy to learn about this.
Thanks!

FelixKatz77 closed this as completed Nov 17, 2021

FelixKatz77 reopened this Nov 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Processing of QM7/QM9 Targets #42

Processing of QM7/QM9 Targets #42

FelixKatz77 commented Nov 16, 2021

rbharath commented Nov 16, 2021

FelixKatz77 commented Nov 17, 2021

FelixKatz77 commented Nov 17, 2021

FelixKatz77 commented Nov 17, 2021

Processing of QM7/QM9 Targets #42

Processing of QM7/QM9 Targets #42

Comments

FelixKatz77 commented Nov 16, 2021

rbharath commented Nov 16, 2021

FelixKatz77 commented Nov 17, 2021

FelixKatz77 commented Nov 17, 2021

FelixKatz77 commented Nov 17, 2021