[WIP] Fetch new validation data for each calc_loss #279

Sopel97 · 2020-12-22T19:59:10Z

This PR makes calc_loss use a new validation set each time it's called (one can now limit the amount of validation data used per calc_loss call, so this makes sense). Since we're using relatively small validation sets for each calc_loss this should, in principle at least, make validation loss more informative when observed over multiple validation steps.

I'd like to get some feedback from @noobpwnftw on this change before it's merged.

I'd also like to get some feedback on how to handle the case when we couldn't fetch the requested amount of the data for validation. This might happen in extreme cases. Currently I made it continue with a warning as the training can proceed even without validation data, but there might be a better resolution to this problem.

NightlyKing · 2020-12-28T07:05:49Z

As I already said in discord: the loss now really jumps around like crazy. My idea was to only slowly introduce new positions to the validation set to get a more stable loss output which will help with automatic LR drops.

fetch new validation data for each calc_loss

9545b18

Sopel97 changed the title ~~Fetch new validation data for each calc_loss~~ [WIP] Fetch new validation data for each calc_loss Dec 24, 2020

Sopel97 added the question Further information is requested label Apr 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Fetch new validation data for each calc_loss #279

[WIP] Fetch new validation data for each calc_loss #279

Sopel97 commented Dec 22, 2020 •

edited

Loading

NightlyKing commented Dec 28, 2020

[WIP] Fetch new validation data for each calc_loss #279

Are you sure you want to change the base?

[WIP] Fetch new validation data for each calc_loss #279

Conversation

Sopel97 commented Dec 22, 2020 • edited Loading

NightlyKing commented Dec 28, 2020

Sopel97 commented Dec 22, 2020 •

edited

Loading