Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

ROCm / vllm Public

forked from vllm-project/vllm

Notifications You must be signed in to change notification settings
Fork 31
Star 57

Code
Issues 5
Pull requests 24
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: ROCm/vllm

Labels 12 Milestones 0

Labels 12 Milestones 0

New pull request New

24 Open 368 Closed

24 Open 368 Closed

Author

Filter by author

Loading

Label

Filter by label

Loading

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Loading

Milestones

Filter by milestone

Loading

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Loading

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fp8 header

#396 opened Jan 31, 2025 by gshtras

Loading…

Fix quark fp8 format loading.

#395 opened Jan 31, 2025 by fxmarty-amd

Loading…

4

Test queue with 8 gpu

#393 opened Jan 29, 2025 by dhonnappa-amd

Loading…

[Bugfix] Deepseek v3 fix max_num_batched_tokens

#386 opened Jan 24, 2025 by Concurrensee • Draft

[Feature] Faster Custom Paged Attention kernels

#385 opened Jan 24, 2025 by tjtanaa • Draft

Switching building to MI300.

#380 opened Jan 22, 2025 by Alexei-V-Ivanov-AMD

Loading…

Adding Deepseek instruct + update manifest

#379 opened Jan 22, 2025 by arakowsk-amd

Loading…

Add TritonScaledMMLinearKernel to fix broken support for int8 models

#377 opened Jan 21, 2025 by rasmith

Loading…

Trying to pass toml file as a parameter to codespell

#376 opened Jan 21, 2025 by gshtras

Loading…

Fixing home dirs. // Testing internal CI.

#371 opened Jan 21, 2025 by Alexei-V-Ivanov-AMD

Loading…

merge paged attention feature and moe feature into llama_fp8_12062024

#370 opened Jan 21, 2025 by yuzho-amd • Draft

[Minor] Hip cmake passthrough

#362 opened Jan 15, 2025 by tvirolai-amd

Loading…

1

fix rocm get_device name

#359 opened Jan 14, 2025 by divakar-amd

Loading…

[Cleanup] Remove obsolete patches and references and test CI

#354 opened Jan 9, 2025 by hongxiayang

Loading…

add video group for test user

#348 opened Jan 6, 2025 by dhonnappa-amd

Loading…

[Minor] updating Docker manifest

#334 opened Dec 18, 2024 by arakowsk-amd

Loading…

Merging PR#327 into main branch

#328 opened Dec 13, 2024 by pramenku

Loading…

1

dummy commit to trigger buildkite job

#311 opened Dec 7, 2024 by dhonnappa-amd

Loading…

Enable CK Attention for Navi31

#285 opened Nov 18, 2024 by hyoon1

Loading…

4

Add vectorized rms_norm support for Navi31

#273 opened Nov 12, 2024 by hyoon1

Loading…

3

[Misc] Upload a shell for bare-metal

#268 opened Nov 8, 2024 by Concurrensee

Loading…

2

[Kernel] Upload a MoE config file for Mixtral8x7B 8GPU on AMD_Instinct_MI300X_OAM machine (fp16)

#261 opened Nov 4, 2024 by Jacob0226

Loading…

3

Fused ROPE and reshape cache kernel

#229 opened Oct 11, 2024 by maleksan85

Loading…

multi-gpu fused_moe tuning support stale

#143 opened Aug 16, 2024 by divakar-amd

Loading…

1 task done

9

ProTip! Updated in the last three days: updated:>2025-01-29.

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.