[Bug] Prompt construction of class `ToolLLaMA` and `ToolLLaMALoRA` #304

yuyq18 · 2024-09-02T15:10:23Z

The prompt construction code of toolbench/inference/LLM/tool_llama_model.py#L97-#L103:

for message in conversation_history:
    role = roles[message['role']]
    content = message['content']
    if role == "System" and functions != []:
        content = process_system_message(content, functions)
    prompt += f"{role}: {content}\n"
prompt += "Assistant:\n"

When the role is assistant, the content included in the prompt only contains the Thought and excludes Action and Action Input. This is because the action details are stored in the function_call key of the message.

Here is the code of conversation_history construction in toolbench/inference/LLM/tool_llama_model.py#L116-#L123:

message = {
    "role": "assistant",
    "content": thought,
    "function_call": {
        "name": action,
        "arguments": action_input
    }
}

This bug results in the assistant portion of the prompt during inference being inconsistent with the prompt used during training, potentially leading to decreased evaluation performance.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Prompt construction of class `ToolLLaMA` and `ToolLLaMALoRA` #304

[Bug] Prompt construction of class `ToolLLaMA` and `ToolLLaMALoRA` #304

yuyq18 commented Sep 2, 2024

[Bug] Prompt construction of class ToolLLaMA and ToolLLaMALoRA #304

[Bug] Prompt construction of class ToolLLaMA and ToolLLaMALoRA #304

Comments

yuyq18 commented Sep 2, 2024

[Bug] Prompt construction of class `ToolLLaMA` and `ToolLLaMALoRA` #304

[Bug] Prompt construction of class `ToolLLaMA` and `ToolLLaMALoRA` #304