how do you process 50 batches before giving the loss output #9232
Unanswered
neurosynapse
asked this question in
Q&A
Replies: 1 comment 1 reply
-
Hi @Franko9999. Actually, It would be nice to check |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
It would be nice if you could help me to understand how its implemented that you process 50 batches at once. Here is the example of retinanet training, batch size is 4 but the code loads 50 batches in one iteration and then gives the loss output. Is the loss calculated for each batch separately and then you get average at the end? How did you implement this training process as I see it greatly improves the training speed compared to single batch process code variants. Could you explain why? Its a little bit hard to debug the code because of the register usage. Thanks a lot!
Here I print the input shape for the model, as you can see it loads up 50 batches and then gives the loss:
Best regards,
Roberts
Beta Was this translation helpful? Give feedback.
All reactions