Implement Node Generator Module for vllm Serving API Integration #1062

korjsh · 2024-12-18T01:59:36Z

Purpose of Development

When using the existing vllm module, each test requires model reinitialization, leading to significant test time overhead.
By enabling API-based connections, similar to the OpenAI module, model reinitialization time can be saved, improving testing efficiency.

Testing Instructions

Start the vllm serving server:

vllm serve Qwen/Qwen2.5-14B-Instruct-AWQ -q awq --port 8012

Here is sample of evaluate_config.yaml file.

    - node_type: generator
      strategy:
         metrics: 
             - metric_name: rouge
       modules:
             - module_type: vllm_api
               uri: http://localhost:8012
               llm: Qwen/Qwen2.5-14B-Instruct-AWQ
               temperature: [0, 0.5]
               max_tokens: 400

Execute the evaluation process.

vkehfdl1 · 2024-12-19T01:27:40Z

@korjsh Hello! Thanks for the contribution.
Before merging, we have to make a two more things.

The test code. You should make a test code at the tests/autorag/nodes/generator
The docs. You should make a new documentation at docs/source/nodes/generator

If you feel overwhelmed to make the test code and docs, we can cover it.
If you want the PR merge faster, it is always great to do it yourself.

Thanks a lot :)

korjsh and others added 7 commits December 17, 2024 08:06

fix: Poetry shell failed due to incorrect pyproject.toml format

e47ca51

fix: Improved poetry pyproject.toml format

bac819e

fix: change tool.poetry version comment

fd5d0f8

add: vLLM serving API functionality to Nodes Generator

a5689bd

Merge branch 'Marker-Inc-Korea:main' into main

6f42c43

update: translate korean comment to english

c643161

fix: truncate the prompt by token to fit the maximum model length.

0b27557

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Node Generator Module for vllm Serving API Integration #1062

Implement Node Generator Module for vllm Serving API Integration #1062

korjsh commented Dec 18, 2024

vkehfdl1 commented Dec 19, 2024

Implement Node Generator Module for vllm Serving API Integration #1062

Are you sure you want to change the base?

Implement Node Generator Module for vllm Serving API Integration #1062

Conversation

korjsh commented Dec 18, 2024

Purpose of Development

Testing Instructions

vkehfdl1 commented Dec 19, 2024