Truncated BPTT #56
Replies: 4 comments 12 replies
-
Thanks! Let me try to add a code sample in this discussion. |
Beta Was this translation helpful? Give feedback.
-
Here is an example of TBPTT based on classifying FashionMNIST:
|
Beta Was this translation helpful? Give feedback.
-
Here is an example of k1 = T_truncated, k2 = 2 * T_truncated. But it is not the standard TBPTT because some computation graph is not re-used. The re-used computation graph is difficult to implement.
|
Beta Was this translation helpful? Give feedback.
-
I've just heard that PyTorch Lightning has TBPTT: |
Beta Was this translation helpful? Give feedback.
-
Hi there,
SpikingJelly is great, congrats to the developers, and thank you :-)
When dealing with a long DVS sequence with labels provided at multiple time points, truncated backprop through time (TBPTT) can be useful, see for example: http://arxiv.org/abs/2009.13436
I think it would be straightforward to use TBPTT in SpikingJelly, but only in the step-by-step mode, not in the layer-by-layer one, right?
Beta Was this translation helpful? Give feedback.
All reactions