-
Notifications
You must be signed in to change notification settings - Fork 165
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hello, thanks for your great works, I'm confused with the dataset. #54
Comments
Hello, for the dataset used in these examples, please see #2 . The expected structure of the input data is described in the Transformer's documentation; you can implement your own dataset as long as it matches this input shape. |
Hi, I have read the doc. For the inputs and outpurs of the model, I understand those as follows: |
In other word, I want to deal with a regression task, it can be described as follows: |
|
Thanks for you reply. In this case, the parameter attention_size can be set <= K ? |
Yes exactly ! |
Hi, I don't see directly where a NaN could come from, I encourage you to debug during the validation loss computation in order to see what tensor or function is malfunctioning. |
Hello sir, i'm confused with the dataset, can share the dataset_57M.npz or other demo dataset.
I just don't know the dataset's structure.
The text was updated successfully, but these errors were encountered: