Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Roadmap] Release Plan for 0.3 #18

Open
mufeili opened this issue Jun 12, 2020 · 11 comments
Open

[Roadmap] Release Plan for 0.3 #18

mufeili opened this issue Jun 12, 2020 · 11 comments

Comments

@mufeili
Copy link
Contributor

mufeili commented Jun 12, 2020

This post is used to list the development plan for the next release. Feel free to leave comments if you have any requirement.

  1. Support average precision metric
  2. Pre-trained models on benchmarks like MoleculeNet, Alchemy, QM9, etc
  3. Better support for attention visualization
  4. Visualization for learned molecular representations
  5. Adjust learning rate and add gradient clipping for ogbl-ppa.
  6. Add better support for feature selection
@mufeili mufeili pinned this issue Jun 16, 2020
@autodataming
Copy link
Contributor

if xxx.txt.proc file is not correspond to the xxx.txt file, the xxx.txt.proc shou be generated again.

@autodataming
Copy link
Contributor

file 2.rxns

[O:1]=[C:2]([OH:3])[c:4]1[c:5]([Br:6])[cH:7][cH:8][cH:9][c:10]1[NH:11][C:12](=[O:13])[CH3:14]>>[O:1]=[C:2]([OH:3])[c:4]1[c:5]([Br:6])[cH:7][cH:8][cH:9][c:10]1[NH2:11]

run the command,

python find_reaction_center_eval.py --test-path  2.rxns -np 1

it report error:


dgl._ffi.base.DGLError: Expect number of features to match number of nodes (len(u)). Got 27 and 14 instead.

@mufeili
Copy link
Contributor Author

mufeili commented Jun 25, 2020

if xxx.txt.proc file is not correspond to the xxx.txt file, the xxx.txt.proc shou be generated again.

If we want to ensure that, we always need to compute graph edits from scratch. As a result, let's always generate that x.proc file from scratch. I've done that in PR #32 .

@mufeili
Copy link
Contributor Author

mufeili commented Jun 25, 2020

file 2.rxns

[O:1]=[C:2]([OH:3])[c:4]1[c:5]([Br:6])[cH:7][cH:8][cH:9][c:10]1[NH:11][C:12](=[O:13])[CH3:14]>>[O:1]=[C:2]([OH:3])[c:4]1[c:5]([Br:6])[cH:7][cH:8][cH:9][c:10]1[NH2:11]

run the command,

python find_reaction_center_eval.py --test-path  2.rxns -np 1

it report error:


dgl._ffi.base.DGLError: Expect number of features to match number of nodes (len(u)). Got 27 and 14 instead.

I guess you previously held some different reactions in 2.rxns and the script loads constructed DGLGraphs for those different reactions. I'm now changing the default behavior to constructing DGLGraphs from scratch in PR #32.

@autodataming
Copy link
Contributor

  1. DGLGraphs file "test.bin"
  2. rxn file "xxx.txt"
  3. rxn process file "xxx.txt.proc"

it will be better if the base name of DGLGraph file is consistent with the rxn file.

test.bin -> xxx.txt.bin

@mufeili
Copy link
Contributor Author

mufeili commented Jun 28, 2020

  1. DGLGraphs file "test.bin"
  2. rxn file "xxx.txt"
  3. rxn process file "xxx.txt.proc"

it will be better if the base name of DGLGraph file is consistent with the rxn file.

test.bin -> xxx.txt.bin

This shall be addressed in PR #35.

@autodataming
Copy link
Contributor

add debug mode!

In the debug mode, it will report what rxn raise the error.

run the command

python find_reaction_center_eval.py --test-path sin_map_clean.rxns   -np 1

Evaluation on the test set.
Traceback (most recent call last):
  File "find_reaction_center_eval.py", line 79, in <module>
    main(args)
  File "find_reaction_center_eval.py", line 47, in main
    args, args['top_ks_test'], model, test_loader, args['easy'])
  File "/home/NFS/user/zgong/czq/workflow_retro_deepsyn2/step3dgllifesci/dgl-lifesci/examples/reaction_prediction/rexgen_direct/utils.py", line 456, in reaction_center_final_eval
    for batch_id, batch_data in enumerate(data_loader):
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 345, in __next__
    data = self._next_data()
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 385, in _next_data
    data = self._dataset_fetcher.fetch(index)  # may raise StopIteration
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/torch/utils/data/_utils/fetch.py", line 44, in fetch
    data = [self.dataset[idx] for idx in possibly_batched_index]
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/torch/utils/data/_utils/fetch.py", line 44, in <listcomp>
    data = [self.dataset[idx] for idx in possibly_batched_index]
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/dgllife/data/uspto.py", line 509, in __getitem__
    self.atom_pair_labels[item] = get_pair_label(mol, self.graph_edits[item])
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/dgllife/data/uspto.py", line 181, in get_pair_label
    labels[i, j, pair_to_changes[(j, i)]] = 1.
IndexError: index 62 is out of bounds for dimension 1 with size 62

obtain the head 100 rxns in the file sin_map_clean.rxns, it will not report error!

head -n 100 sin_map_clean.rxns > sin100.rxns
python find_reaction_center_eval.py --test-path sin100.rxns    -np 1

@mufeili
Copy link
Contributor Author

mufeili commented Jun 28, 2020

add debug mode!

In the debug mode, it will report what rxn raise the error.

run the command

python find_reaction_center_eval.py --test-path sin_map_clean.rxns   -np 1
Evaluation on the test set.
Traceback (most recent call last):
  File "find_reaction_center_eval.py", line 79, in <module>
    main(args)
  File "find_reaction_center_eval.py", line 47, in main
    args, args['top_ks_test'], model, test_loader, args['easy'])
  File "/home/NFS/user/zgong/czq/workflow_retro_deepsyn2/step3dgllifesci/dgl-lifesci/examples/reaction_prediction/rexgen_direct/utils.py", line 456, in reaction_center_final_eval
    for batch_id, batch_data in enumerate(data_loader):
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 345, in __next__
    data = self._next_data()
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 385, in _next_data
    data = self._dataset_fetcher.fetch(index)  # may raise StopIteration
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/torch/utils/data/_utils/fetch.py", line 44, in fetch
    data = [self.dataset[idx] for idx in possibly_batched_index]
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/torch/utils/data/_utils/fetch.py", line 44, in <listcomp>
    data = [self.dataset[idx] for idx in possibly_batched_index]
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/dgllife/data/uspto.py", line 509, in __getitem__
    self.atom_pair_labels[item] = get_pair_label(mol, self.graph_edits[item])
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/dgllife/data/uspto.py", line 181, in get_pair_label
    labels[i, j, pair_to_changes[(j, i)]] = 1.
IndexError: index 62 is out of bounds for dimension 1 with size 62

obtain the head 100 rxns in the file sin_map_clean.rxns, it will not report error!

head -n 100 sin_map_clean.rxns > sin100.rxns
python find_reaction_center_eval.py --test-path sin100.rxns    -np 1

Can you provide a reaction that will yield the error? I want to use that for developing the feature you requested.

@mufeili
Copy link
Contributor Author

mufeili commented Jun 30, 2020

add debug mode!

In the debug mode, it will report what rxn raise the error.

run the command

python find_reaction_center_eval.py --test-path sin_map_clean.rxns   -np 1
Evaluation on the test set.
Traceback (most recent call last):
  File "find_reaction_center_eval.py", line 79, in <module>
    main(args)
  File "find_reaction_center_eval.py", line 47, in main
    args, args['top_ks_test'], model, test_loader, args['easy'])
  File "/home/NFS/user/zgong/czq/workflow_retro_deepsyn2/step3dgllifesci/dgl-lifesci/examples/reaction_prediction/rexgen_direct/utils.py", line 456, in reaction_center_final_eval
    for batch_id, batch_data in enumerate(data_loader):
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 345, in __next__
    data = self._next_data()
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 385, in _next_data
    data = self._dataset_fetcher.fetch(index)  # may raise StopIteration
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/torch/utils/data/_utils/fetch.py", line 44, in fetch
    data = [self.dataset[idx] for idx in possibly_batched_index]
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/torch/utils/data/_utils/fetch.py", line 44, in <listcomp>
    data = [self.dataset[idx] for idx in possibly_batched_index]
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/dgllife/data/uspto.py", line 509, in __getitem__
    self.atom_pair_labels[item] = get_pair_label(mol, self.graph_edits[item])
  File "/home/zgong/nfs/program/anaconda2/envs/py36dgllifesci/lib/python3.6/site-packages/dgllife/data/uspto.py", line 181, in get_pair_label
    labels[i, j, pair_to_changes[(j, i)]] = 1.
IndexError: index 62 is out of bounds for dimension 1 with size 62

obtain the head 100 rxns in the file sin_map_clean.rxns, it will not report error!

head -n 100 sin_map_clean.rxns > sin100.rxns
python find_reaction_center_eval.py --test-path sin100.rxns    -np 1

This shall be addressed in PR #38 .

@mufeili mufeili changed the title [Roadmap] Release Plan for 0.2.4 [Roadmap] Release Plan for 0.3 Aug 4, 2020
@mufeili
Copy link
Contributor Author

mufeili commented Aug 25, 2020 via email

@summer-cola
Copy link

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants