关于性能分析的一点疑惑 #60

Zhiy-Zhang · 2024-04-25T06:15:33Z

What are the problems?(screenshots or detailed error messages)

想问下有性能分析的工具嘛？profiler相关，还是只能用nsight profile这种自己去看一些算子性能

What are the types of GPU/CPU you are using?

GPU：A100-80G-SXM4

What's the operating system ppl.llm.serving runs on?

Ubuntu 20.04.4
cuda：12.3
cudnn：8904
trt：9.2.0

What's the compiler and its version?

gcc 11.4
cmake version 3.27.9
Cuda compilation tools, release 12.3, V12.3.107

Which version(commit id or tag) of ppl.llm.serving is used?

commit id：51c3b3d5c5eba25c276a84388f04a2c9e198699f

Vincent-syr · 2024-04-28T06:21:48Z

serving整体的profiling信息根据宏“PPL_LLM_ENABLE_PROFILING”输出，默认是打开的，算子的profiling信息需要nsight去看，建议跑offline_inference。如果想要单step的kernel profling信息，可以参考https://github.com/openppl-public/ppl.nn/blob/master/tools/pplnn_llm.cc#L819
，编译时“-DPPLNN_ENABLE_KERNEL_PROFILING=ON”

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

关于性能分析的一点疑惑 #60

关于性能分析的一点疑惑 #60

Zhiy-Zhang commented Apr 25, 2024

Vincent-syr commented Apr 28, 2024 •

edited

Loading

关于性能分析的一点疑惑 #60

关于性能分析的一点疑惑 #60

Comments

Zhiy-Zhang commented Apr 25, 2024

What are the problems?(screenshots or detailed error messages)

What are the types of GPU/CPU you are using?

What's the operating system ppl.llm.serving runs on?

What's the compiler and its version?

Which version(commit id or tag) of ppl.llm.serving is used?

Vincent-syr commented Apr 28, 2024 • edited Loading

Vincent-syr commented Apr 28, 2024 •

edited

Loading