[python] fix last token fetch logic #2423

sindhuvahinis · 2024-10-08T22:53:53Z

Description

When exception occurs in inference, we catch it and set the last token as dummy token. So atleast one token should be set, even if exception occurred.

But to check whether last_token_index is set or not, we check if _last_token_index will be false when last_token_index=0.

This PR reverts the yesterday's PR.

Tested it manually in my ec2 machine

[python] fix last token fetch logic

9cbd7c0

sindhuvahinis requested review from zachgk and a team as code owners October 8, 2024 22:53

siddvenk approved these changes Oct 9, 2024

View reviewed changes

sindhuvahinis merged commit 79d0c3e into deepjavalibrary:master Oct 9, 2024
9 checks passed

sindhuvahinis added a commit to sindhuvahinis/djl-serving that referenced this pull request Oct 9, 2024

[python] fix last token fetch logic (deepjavalibrary#2423)

7e4b02e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[python] fix last token fetch logic #2423

[python] fix last token fetch logic #2423

sindhuvahinis commented Oct 8, 2024 •

edited

Loading

[python] fix last token fetch logic #2423

[python] fix last token fetch logic #2423

Conversation

sindhuvahinis commented Oct 8, 2024 • edited Loading

Description

sindhuvahinis commented Oct 8, 2024 •

edited

Loading