rf.BatchNorm keeps updating statistics when used in eval mode in training #1625

mnghiap · 2024-09-12T18:10:35Z

Currently rf.BatchNorm decides whether to update the running statistics based on the rf.get_run_ctx().train_flag as in this line.

update_running_stats = self.running_mean is not None and rf.get_run_ctx().train_flag

However, there are cases where it should be used in eval mode during training (e.g. the teacher model in knowledge distillation), and in those cases it should not update the running statistics.

I suggest that we add something to allow rf.BatchNorm (and maybe other modules with similar issues, but I don't know yet) to run in eval mode even in training.

The text was updated successfully, but these errors were encountered:

albertz · 2024-09-13T00:38:48Z

I was planning to add some context scope where we can overwrite the train_flag. So usage would be sth like:

with rf.get_run_ctx().set_train_flag_scope(False):
  ...

That should solve this, right?

mnghiap added the returnn-frontend label Sep 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rf.BatchNorm keeps updating statistics when used in eval mode in training #1625

rf.BatchNorm keeps updating statistics when used in eval mode in training #1625

mnghiap commented Sep 12, 2024

albertz commented Sep 13, 2024

rf.BatchNorm keeps updating statistics when used in eval mode in training #1625

rf.BatchNorm keeps updating statistics when used in eval mode in training #1625

Comments

mnghiap commented Sep 12, 2024

albertz commented Sep 13, 2024