Add `add_bos_token` for Llama3 evaluation #2179

Kaihui-intel · 2025-04-22T07:07:53Z

Type of Change

bug fix

Description

model: meta-llama/Llama-3.1-8B-Instruct
add_bos_token default is False.
If the model was trained or fine-tuned with a BOS token, this may lead to incorrect results.

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

Signed-off-by: Kaihui-intel <[email protected]>

Copilot

Pull Request Overview

This PR introduces a fix by adding an "--add_bos_token" argument to support proper evaluation when the model is fine-tuned with a beginning-of-sequence token.

Added an argument for "add_bos_token" in the CLI parser
Modified the function call to include the "add_bos_token" parameter for evaluation

...age-modeling/quantization/transformers/weight_only/text-generation/run_generation_cpu_woq.py

Co-authored-by: Copilot <[email protected]>

xin3he

good catch!

XuehaoSun · 2025-04-23T07:12:36Z

add_bos_token for llama3

5bb594f

Signed-off-by: Kaihui-intel <[email protected]>

Kaihui-intel requested review from xin3he, XuehaoSun and Copilot April 22, 2025 07:07

Copilot AI reviewed Apr 22, 2025

View reviewed changes

...age-modeling/quantization/transformers/weight_only/text-generation/run_generation_cpu_woq.py Outdated Show resolved Hide resolved

Update code

18e421e

Co-authored-by: Copilot <[email protected]>

xin3he approved these changes Apr 22, 2025

View reviewed changes

XuehaoSun approved these changes Apr 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `add_bos_token` for Llama3 evaluation #2179

Add `add_bos_token` for Llama3 evaluation #2179

Kaihui-intel commented Apr 22, 2025

Copilot AI left a comment

xin3he left a comment

XuehaoSun commented Apr 23, 2025

Add add_bos_token for Llama3 evaluation #2179

Are you sure you want to change the base?

Add add_bos_token for Llama3 evaluation #2179

Conversation

Kaihui-intel commented Apr 22, 2025

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

xin3he left a comment

Choose a reason for hiding this comment

XuehaoSun commented Apr 23, 2025

Add `add_bos_token` for Llama3 evaluation #2179

Add `add_bos_token` for Llama3 evaluation #2179