decoder_input_ids must be set to 2 when using fairseq model

# Issue
When using fairseq and TensorRT-LLM for inference, I encountered an issue. 
In the model's vocabulary,：
```
</s>:       0
<pad>:    2
</s>:       0
<unk>:    1
```
When using TensorRT-LLM, the decoder_input_ids must be set to 2 (the <pad> token) in order to function correctly.
If I set decoder_input_ids to other token IDs (e.g., 0 for “</s>“ or 1 for “<unk>”), the model does not work properly and does not produce the expected output.

# Environment
fairseq version: 0.12.2
TensorRT-LLM version: 0.9.0

# Example 
the output of the fairseq model : 
<img width="449" alt="Image" src="https://github.com/user-attachments/assets/89265bbe-b58f-4e19-99f9-176b20ac3153" />
the output of the TensorRT-LLM (decoder_start_token_id =2): 
<img width="442" alt="Image" src="https://github.com/user-attachments/assets/27f9d5ab-f0cb-4db6-a0cb-4adf87ebe871" />
the output of the TensorRT-LLM (decoder_start_token_id =0):
 <img width="462" alt="Image" src="https://github.com/user-attachments/assets/a9ece333-2cc6-4998-a2f8-1b16a5de1de9" />


Does the token_id in the TensorRT-LLM engine need to be consistent with the token_id in the fairseq model?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

decoder_input_ids must be set to 2 when using fairseq model #2621

Issue

Environment

Example

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

decoder_input_ids must be set to 2 when using fairseq model #2621

Description

Issue

Environment

Example

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions