Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

speed gpu #50

Open
zzyy520 opened this issue Mar 30, 2024 · 3 comments
Open

speed gpu #50

zzyy520 opened this issue Mar 30, 2024 · 3 comments

Comments

@zzyy520
Copy link

zzyy520 commented Mar 30, 2024

您好,我有几个问题和发现。基于2080ti GPU对Repvit的不同尺寸规模的模型进行速度测试,其并不能展现比mobileOne-s2,s1以及fastvit-t8更高的速度。无论是throughput还是FPS等都比相关的同精度算法模型要慢。(对比上述模型主要是因为均采用结构重参数)

@jameslahm
Copy link
Collaborator

Thanks for your interest. The benchmark results on our 2080ti device are below:

Model Input Throughput (bs=1024)
RepViT-M0.9 224 2870
FastViT-T8 256 2379 (bs=768 because OOM when bs=1024)
MobileOne-S1 224 2745

May you provide more details about your benchmark results?

@zzyy520
Copy link
Author

zzyy520 commented Mar 30, 2024

Thanks for your reply. The benchmark results on ours 2080ti GPU are below:
Model Input Throughput(bs=512)
MobileOne-s2 160 4152
MobileOne-s1 160 5523
RepViT-M1 160 5522
RepViT-M2 160 4708

(if bs=1)

MobileOne-s2 160 479
....-s1 160 429
RepViT-M1 160 200
RepViT-M2 160 182
FastVit-T8 160 325

Does this mean that the model is difficult to apply to the problem of single graph transmission single graph inference under the high-speed camera?

@jameslahm
Copy link
Collaborator

Thanks. We thought that it depends on the device. For example, RepViT-M0.9 runs as fast as MobileOne-S1 on iPhone 12 with bs=1. On the 2080Ti with bs=1, we suggest that you could locate some inference bottleneck. For example, SE layer with bs=1 may cause extra apparent latency on 2080Ti, which is not like on the iPhone. Besides, we suggest that you could improve the performance on 2080Ti with TensorRT. We will also try to improve the performance of RepViT in such case.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants