Use Dadaptation for all DL models #327

jakobnissen · 2024-06-25T12:35:57Z

Deprecate the learning rate.

Closes #267

This optimizer is more efficient, and makes the learning rate obsolete, which is nice from a UI standpoint.

jakobnissen · 2024-06-26T08:40:42Z

Benchmarks show a 1% decrease in NCs, which is probably within measurement error. Also, the overall loss seems to get slightly better.
I'm going to merge this now anyway, because getting rid of the learning rate is a good idea. @sgalkina FYI, we might consider upgrading to prodigy, which is a newer, shinier optimiser made by the same people.

jakobnissen added the Needs benchmark Must benchmark before merging this label Jun 25, 2024

Use Dadaptation for all DL models

f580c4d

This optimizer is more efficient, and makes the learning rate obsolete, which is nice from a UI standpoint.

jakobnissen force-pushed the dadapt branch from bd69eac to f580c4d Compare June 25, 2024 12:41

jakobnissen removed Needs benchmark Must benchmark before merging this labels Jun 26, 2024

jakobnissen merged commit b75f79d into master Jun 26, 2024
5 checks passed

jakobnissen deleted the dadapt branch June 26, 2024 08:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Dadaptation for all DL models #327

Use Dadaptation for all DL models #327

jakobnissen commented Jun 25, 2024

jakobnissen commented Jun 26, 2024

Use Dadaptation for all DL models #327

Use Dadaptation for all DL models #327

Conversation

jakobnissen commented Jun 25, 2024

jakobnissen commented Jun 26, 2024