You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have benchmarked with resnet50, resnet101, the bn_fusion performance improves with CPU (about 7%), but no improvement with cuda.
There is no noticeably difference between torch.cuda.cudnn.benchmark true and false.
My guess is that cudnn can optimize such case really good already.
I need to check this issue in more detail, but I will be back to it when I have more time. I was testing on CPU and I was getting performance gain and reduced memory usage.
I have benchmarked with resnet50, resnet101, the bn_fusion performance improves with CPU (about 7%), but no improvement with cuda.
There is no noticeably difference between
torch.cuda.cudnn.benchmark
true and false.My guess is that cudnn can optimize such case really good already.
my test code: https://github.com/xuyuan/pytorch_bn_fusion/blob/master/test_convert_inference.py
The text was updated successfully, but these errors were encountered: