-
Notifications
You must be signed in to change notification settings - Fork 62
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
speed gpu #50
Comments
Thanks for your interest. The benchmark results on our 2080ti device are below:
May you provide more details about your benchmark results? |
Thanks for your reply. The benchmark results on ours 2080ti GPU are below: (if bs=1) MobileOne-s2 160 479 Does this mean that the model is difficult to apply to the problem of single graph transmission single graph inference under the high-speed camera? |
Thanks. We thought that it depends on the device. For example, RepViT-M0.9 runs as fast as MobileOne-S1 on iPhone 12 with bs=1. On the 2080Ti with bs=1, we suggest that you could locate some inference bottleneck. For example, SE layer with bs=1 may cause extra apparent latency on 2080Ti, which is not like on the iPhone. Besides, we suggest that you could improve the performance on 2080Ti with TensorRT. We will also try to improve the performance of RepViT in such case. |
您好,我有几个问题和发现。基于2080ti GPU对Repvit的不同尺寸规模的模型进行速度测试,其并不能展现比mobileOne-s2,s1以及fastvit-t8更高的速度。无论是throughput还是FPS等都比相关的同精度算法模型要慢。(对比上述模型主要是因为均采用结构重参数)
The text was updated successfully, but these errors were encountered: