support for T4 #2620

krishnanpooja · 2024-12-24T11:32:03Z

System Info

T4 GPU
TensorRT-LLM: 0.15.0.dev2024111200

Who can help?

@ncomly-nvidia , @juney-nvidia , @byshiue

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Build models for tensorrt-llm engine on T4 GPU.

Expected behavior

Should work on T4 GPU

actual behavior

I am seeing warning message
[12/24/2024-10:21:22] [TRT-LLM] [W] Failed to infer cluster info for Tesla T4, treat it as a L40 node with 15 GB memory. This setting makes no effect if you do not use auto parallel.

How do I get it to recognize T4?

additional notes

The model works on A100. Both single and mGPU setup

The text was updated successfully, but these errors were encountered:

nv-guomingz · 2024-12-24T15:03:58Z

Hi @krishnanpooja , T4 is no longer a support hardware since 0.14 release.

krishnanpooja added the bug Something isn't working label Dec 24, 2024

nv-guomingz added triaged Issue has been triaged by maintainers and removed bug Something isn't working labels Dec 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support for T4 #2620

support for T4 #2620

krishnanpooja commented Dec 24, 2024

nv-guomingz commented Dec 24, 2024

support for T4 #2620

support for T4 #2620

Comments

krishnanpooja commented Dec 24, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

actual behavior

additional notes

nv-guomingz commented Dec 24, 2024