Skip to content

Latest commit

 

History

History

small-scale-batch-size-impact

Small scale model with varying batch sizes

The configuration is exactly the same as small-scale-against-choosefirst-v0. I did 10 runs on varying batch sizes.

Legend:

legend

Loss:

loss

In fact layer weights got stable very fast. Here are some sample pictures on the first a few layers:

layer-weights

$\rho$ metric:

  • vs ChooseFirst:

vs-choosefirst

  • vs Random:

vs-random