Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于性能分析的一点疑惑 #60

Open
Zhiy-Zhang opened this issue Apr 25, 2024 · 1 comment
Open

关于性能分析的一点疑惑 #60

Zhiy-Zhang opened this issue Apr 25, 2024 · 1 comment

Comments

@Zhiy-Zhang
Copy link

What are the problems?(screenshots or detailed error messages)

想问下有性能分析的工具嘛?profiler相关,还是只能用nsight profile这种自己去看一些算子性能

What are the types of GPU/CPU you are using?

GPU:A100-80G-SXM4

What's the operating system ppl.llm.serving runs on?

Ubuntu 20.04.4
cuda:12.3
cudnn:8904
trt:9.2.0

What's the compiler and its version?

gcc 11.4
cmake version 3.27.9
Cuda compilation tools, release 12.3, V12.3.107

Which version(commit id or tag) of ppl.llm.serving is used?

commit id:51c3b3d5c5eba25c276a84388f04a2c9e198699f

@Vincent-syr
Copy link
Collaborator

Vincent-syr commented Apr 28, 2024

serving整体的profiling信息根据宏“PPL_LLM_ENABLE_PROFILING”输出,默认是打开的,算子的profiling信息需要nsight去看,建议跑offline_inference。如果想要单step的kernel profling信息,可以参考https://github.com/openppl-public/ppl.nn/blob/master/tools/pplnn_llm.cc#L819
,编译时“-DPPLNN_ENABLE_KERNEL_PROFILING=ON”

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants