Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does it support NVIDIA TensorRT #108

Open
ivoxx opened this issue Aug 5, 2021 · 19 comments
Open

Does it support NVIDIA TensorRT #108

ivoxx opened this issue Aug 5, 2021 · 19 comments

Comments

@ivoxx
Copy link

ivoxx commented Aug 5, 2021

Does it support NVIDIA TensorRT?

  1. inside container cannot read tensorrt engine which generate from same deivce and cuda version ,shows

[TensorRT] WARNING: using an engine plan file across different models of devices is not recommended and is likely to affect performance or enven cause errors

  1. inside container some onnx models cannot build tensorrt engine, report segmentation fault

any solution or suggestion?

@xwttzz
Copy link

xwttzz commented Aug 5, 2021

I had the same problem. It is normal to use the whole card, but not once it is fragmented.

@mYmNeo
Copy link
Contributor

mYmNeo commented Aug 5, 2021

The gpu-manager version and log?

@xwttzz
Copy link

xwttzz commented Aug 5, 2021

image
日志看起来是正常的。 manager的版本是1.1.4 @mYmNeo 方便加个微信号?

@mYmNeo
Copy link
Contributor

mYmNeo commented Aug 6, 2021

Please follow the FAQ, provides the application container log

@ivoxx
Copy link
Author

ivoxx commented Aug 6, 2021

2

1

@xwttzz
Copy link

xwttzz commented Aug 6, 2021

image
this is the container log. @mYmNeo

@mYmNeo
Copy link
Contributor

mYmNeo commented Aug 9, 2021

You need following the FAQ, set environment first, then run your application, the expected log should have /tmp/xxx pattern

@xwttzz
Copy link

xwttzz commented Aug 10, 2021

I have followed the FAQ。
image
I can't find any useful logs in this directory。Which log in this directory do you want。
@mYmNeo

@mYmNeo
Copy link
Contributor

mYmNeo commented Aug 10, 2021

I have followed the FAQ。
image
I can't find any useful logs in this directory。Which log in this directory do you want。
@mYmNeo

  1. Export LOGGER_LEVEL environment
  2. run your application
  3. vcuda log will print on the screen

@xwttzz
Copy link

xwttzz commented Aug 10, 2021

https://gist.github.com/xwttzz/1f4b3794a2fb19f430ebea828030d145
The above link shows all logs after VCUDA is enabled。
@mYmNeo

@mYmNeo
Copy link
Contributor

mYmNeo commented Aug 11, 2021

https://gist.github.com/xwttzz/1f4b3794a2fb19f430ebea828030d145
The above link shows all logs after VCUDA is enabled。
@mYmNeo

The error shows you application missing library ImportError: libnvinfer.so.7: cannot open shared object file: No such file or directory, this is not VCUDA library

@xwttzz
Copy link

xwttzz commented Aug 11, 2021

The confusing part is that when I set vcore to 100 it is ok, only when it is fragmented 。 Besides, it has been confirmed that LibnVinfer has been installed in the container.
@mYmNeo

@mYmNeo
Copy link
Contributor

mYmNeo commented Aug 11, 2021

How about re-pull the image thomassong/gpu-manager:1.1.5. We have fixed a recursive problem about vcuda.

@xwttzz
Copy link

xwttzz commented Aug 11, 2021

I updated the latest log in the original address. [https://gist.github.com/xwttzz/1f4b3794a2fb19f430ebea828030d145]
Libnvinfer was a historical log and it has nothing to do with this。
@mYmNeo

@mYmNeo
Copy link
Contributor

mYmNeo commented Aug 12, 2021

https://gist.github.com/xwttzz/1f4b3794a2fb19f430ebea828030d145

I think you should debug the coredump to find where it crash

@austingg
Copy link

austingg commented May 5, 2022

@xwttzz have u solved the problem ?

@so2bin
Copy link

so2bin commented Sep 15, 2022

I have the same problem, is anyone have a solution?

@alex337
Copy link

alex337 commented Dec 1, 2022

I have the same problem, is anyone have a solution?

Hi,have u solved the problem ?

@vicmeng
Copy link

vicmeng commented Oct 9, 2023

tkestack/vcuda-controller#38 (comment)
具体报错

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants