Skip to content

problem with output_log_probs #632

Open
@Alireza3242

Description

@Alireza3242

I run triton with tensorrtllm. But when i give long text to llm, triton returns a long array of zeros named output_log_probs in every token. If my text be longer than some number, the request not work truly.

can you add a option to config.pbtxt that prevent sending output_log_probs?

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions