[ROCm][AMD][Model] llama 3.2 support upstreaming #12421

maleksan85 · 2025-01-24T20:09:58Z

PR to propagate multimodal llama3.2 support into upstream for rocm arch

Signed-off-by: Aleksandr Malyshev <[email protected]>

github-actions · 2025-01-24T20:10:10Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

Signed-off-by: Aleksandr Malyshev <[email protected]>

vllm/model_executor/models/mllama.py

heheda12345

Thanks for contributing. Added some comments for this PR.

heheda12345 · 2025-01-25T07:28:43Z

vllm/model_executor/models/mllama.py

-                    i,
-                )
-            elif self.attn.backend in (_Backend.XFORMERS, _Backend.TORCH_SDPA):
+            if current_platform.is_rocm():


Can you merge this code path with if self.attn.backend in (_Backend.FLASH_ATTN, _Backend.FLASH_ATTN_VLLM_V1):?

normally I would not change code as purpose of this PR it is to match rocm repo and upstream in order not to bring more problems. But code seems identical in two branches and no reason to keep it both. Thank you for it!

vllm/model_executor/models/mllama.py

Signed-off-by: Aleksandr Malyshev <[email protected]>

DarkLight1337 · 2025-01-28T03:18:08Z

LGTM, can you merge from main to fix the CI failures?

Signed-off-by: Aleksandr Malyshev <[email protected]>

maleksan85 · 2025-01-29T17:19:38Z

@DarkLight1337 do you have any other concerns about this change? would be nice to land it asap to prevent merge conflicts. Thanks

DarkLight1337 · 2025-01-29T17:29:05Z

I have started the merge process.

DarkLight1337

LGTM, thanks for your patience!

Aleksandr Malyshev added 2 commits January 24, 2025 13:37

initial commit with rocm fa update

d5b47dc

Signed-off-by: Aleksandr Malyshev <[email protected]>

merge with main

1f34f4e

Signed-off-by: Aleksandr Malyshev <[email protected]>

linters

d8aa3de

Signed-off-by: Aleksandr Malyshev <[email protected]>

maleksan85 changed the title ~~ROCM llama 3.2 support upstreaming~~ [ROCm][AMD][Model] llama 3.2 support upstreaming Jan 24, 2025

DarkLight1337 reviewed Jan 25, 2025

View reviewed changes

vllm/model_executor/models/mllama.py Outdated Show resolved Hide resolved

heheda12345 suggested changes Jan 25, 2025

View reviewed changes

replying to comments

f1f62e6

Signed-off-by: Aleksandr Malyshev <[email protected]>

hongxiayang added the rocm label Jan 27, 2025

Aleksandr Malyshev added 2 commits January 28, 2025 19:12

Merge branch 'upstream/main' into rocm/maleksan_llama32_upstreaming

ccc5285

linter

e096bc8

Signed-off-by: Aleksandr Malyshev <[email protected]>

DarkLight1337 enabled auto-merge (squash) January 29, 2025 17:28

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 29, 2025

DarkLight1337 approved these changes Jan 31, 2025

View reviewed changes

DarkLight1337 merged commit a1fc18c into vllm-project:main Jan 31, 2025
65 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ROCm][AMD][Model] llama 3.2 support upstreaming #12421

[ROCm][AMD][Model] llama 3.2 support upstreaming #12421

maleksan85 commented Jan 24, 2025

github-actions bot commented Jan 24, 2025

heheda12345 left a comment

heheda12345 Jan 25, 2025

maleksan85 Jan 27, 2025

DarkLight1337 commented Jan 28, 2025

maleksan85 commented Jan 29, 2025

DarkLight1337 commented Jan 29, 2025

DarkLight1337 left a comment

[ROCm][AMD][Model] llama 3.2 support upstreaming #12421

[ROCm][AMD][Model] llama 3.2 support upstreaming #12421

Conversation

maleksan85 commented Jan 24, 2025

github-actions bot commented Jan 24, 2025

heheda12345 left a comment

Choose a reason for hiding this comment

heheda12345 Jan 25, 2025

Choose a reason for hiding this comment

maleksan85 Jan 27, 2025

Choose a reason for hiding this comment

DarkLight1337 commented Jan 28, 2025

maleksan85 commented Jan 29, 2025

DarkLight1337 commented Jan 29, 2025

DarkLight1337 left a comment

Choose a reason for hiding this comment