llama3.2 + cross attn test #220

maleksan85 · 2024-10-03T04:00:58Z

Commands:
server

 VLLM_NO_TUNED_GEMM=1 vllm serve /data/models/Llama-3.2-90B-Vision-Instruct --tensor_parallel_size 2 --enforce-eager --limit-mm-per-prompt "image=2" --max-num-seqs 32 --max_model_len 8192

client (from https://huggingface.co/nltpt/VLLM-llama3.2):

root@banff-cyxtera-s82-5:~/workspace/VLLM-llama3.2# python openai_vision_api_client.py
Chat completion output: The image depicts a serene lake scene with a wooden dock extending into the water, surrounded by lush greenery and majestic mountains in the background. The overall atmosphere of the image exudes tranquility and natural beauty, inviting the viewer to step into its peaceful world.
remove me: testing done, exitting...

shajrawi

Great work!!
Few nits + one question.
Also, can you please make the linter happy? :)

tests/kernels/test_encoder_decoder_attn.py

tests/kernels/utils.py

vllm/attention/backends/rocm_flash_attn.py

vllm/model_executor/layers/linear.py

shajrawi

Looks good - assuming performance does not regress due to reshape

llama3.2 + cross attn test

64cdd32

maleksan85 requested review from gshtras and shajrawi October 3, 2024 04:01

shajrawi reviewed Oct 3, 2024

View reviewed changes

Aleksandr Malyshev and others added 5 commits October 3, 2024 17:20

lint issues fix

caf095b

mypy errors

4b54363

making yapf happy

34d2658

cut off WA for tunned gemms

9ed31a8

Merge branch 'main' into maleksan_llama32_support

2b25a0c

shajrawi previously approved these changes Oct 4, 2024

View reviewed changes

try and catch for non continuous tensor

cfe23d8

maleksan85 dismissed shajrawi’s stale review via cfe23d8 October 4, 2024 22:55

gshtras approved these changes Oct 4, 2024

View reviewed changes

maleksan85 merged commit 2550f14 into main Oct 4, 2024
16 of 17 checks passed

gshtras deleted the maleksan_llama32_support branch October 24, 2024 18:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama3.2 + cross attn test #220

llama3.2 + cross attn test #220

maleksan85 commented Oct 3, 2024

shajrawi left a comment

shajrawi left a comment

llama3.2 + cross attn test #220

llama3.2 + cross attn test #220

Conversation

maleksan85 commented Oct 3, 2024

shajrawi left a comment

Choose a reason for hiding this comment

shajrawi left a comment

Choose a reason for hiding this comment