LayerNorm_A calculation precision is unstable, leading to incorrect recognition results. #42

winf · 2024-08-02T02:41:22Z

#41 #20
When using models other than .en, if the model size exceeds ‘tiny’, the recognition results are found to be significantly incorrect. By comparing the layer-by-layer outcomes with the openai repository’s runtime results, it was ultimately determined that the source of the problem is the poor stability of mean and variance calculations in the LayerNorm_A function. I fixed the issue by replacing fp16 with fp32. Reference: https://git.bwbot.org/publish/useful-transformers/-/blob/main/lib/layernorm.cc

winf · 2024-08-02T02:48:35Z

@marty1885 @guynich

marty1885 · 2024-08-02T05:15:37Z

Thank you! This solves my problems too!

I don't have write access to the repository. Maybe you could make a PR and upstream the changes? I can test on my end.

winf mentioned this issue Aug 7, 2024

fix layernorm_A #43

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LayerNorm_A calculation precision is unstable, leading to incorrect recognition results. #42

LayerNorm_A calculation precision is unstable, leading to incorrect recognition results. #42

winf commented Aug 2, 2024

winf commented Aug 2, 2024

marty1885 commented Aug 2, 2024

LayerNorm_A calculation precision is unstable, leading to incorrect recognition results. #42

LayerNorm_A calculation precision is unstable, leading to incorrect recognition results. #42

Comments

winf commented Aug 2, 2024

winf commented Aug 2, 2024

marty1885 commented Aug 2, 2024