Some issues on the size of the dataset #135
Unanswered
Wangyt54549
asked this question in
Q&A
Replies: 1 comment
-
You can use |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi everyone,
I'm trying to repeat the simulation of n-c12 pyrolysis (https://dx.doi.org/10.1021/acs.energyfuels.0c03211).
I've got the dump file from 1 ps MD with the parameters from the original paper, and use the MDDatasetBuilder with the Line," datasetbuilder -d c12.dump -c 3.5 -a C H -n c12". Then a dataset of 20,493 structures(.xyz/.gjf) is created, and this is much larger than the initial dataset of 590 clusters in the paper.
I wonder if any other operations is needed? or the k-means clustering algorithm in sklearn is not performed correctly? or this is a common result?
Beta Was this translation helpful? Give feedback.
All reactions