Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does the repo lack the required source code for training? #3

Open
vkgo opened this issue Oct 8, 2022 · 4 comments
Open

Does the repo lack the required source code for training? #3

vkgo opened this issue Oct 8, 2022 · 4 comments

Comments

@vkgo
Copy link

vkgo commented Oct 8, 2022

It seems I can't find the training source codes in this repository.

@rahuls321
Copy link

Yes it's missing from the repo.

@iamhere1
Copy link

iamhere1 commented Oct 10, 2022

Hi, for some reasons, the complete training code is not convenient to be pushed now. However, we have published most of the hyper parameters in the paper. If you want to train your AES model, here are some tips:

  1. Based on our open source code, run the decoding process to see if the QWK metric is consistent with the paper.
  2. Add the training process (mainly include the process of updating the gradient according to the loss function. The code for data loading, data encoding, and the model have been published in the repo).

@shield124
Copy link

Hi, for some reasons, the complete training code is not convenient to be pushed now. However, we have published most of the hyper parameters in the paper. If you want to train your AES model, here are some tips:

  1. Based on our open source code, run the decoding process to see if the QWK metric is consistent with the paper.
  2. Add the training process (mainly include the process of updating the gradient according to the loss function. The code for data loading, data encoding, and the model have been published in the repo).

作者大大,可以分享下train模型的代码吗?研一入门小白实在难以自己写出来训练的代码,感谢您!

@iamhere1
Copy link

Hi, for some reasons, the complete training code is not convenient to be pushed now. However, we have published most of the hyper parameters in the paper. If you want to train your AES model, here are some tips:

  1. Based on our open source code, run the decoding process to see if the QWK metric is consistent with the paper.
  2. Add the training process (mainly include the process of updating the gradient according to the loss function. The code for data loading, data encoding, and the model have been published in the repo).

作者大大,可以分享下train模型的代码吗?研一入门小白实在难以自己写出来训练的代码,感谢您!

预测代码可以先跑下和理解下,训练部分你先尝试加下损失函数和梯度更新的逻辑,如果有问题,欢迎继续交流。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants