Skip to content

Actions: vllm-project/vllm

Add label on auto-merge enabled

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,489 workflow runs
1,489 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Core] Reduce unnecessary compute when logprobs=None
Add label on auto-merge enabled #64: Pull request #6532 auto_merge_enabled by comaniac
July 23, 2024 02:19 11s
July 23, 2024 02:19 11s
[Bugfix] Fix null modules_to_not_convert in FBGEMM Fp8 quantization
Add label on auto-merge enabled #63: Pull request #6665 auto_merge_enabled by robertgshaw2-redhat
July 23, 2024 00:58 10s
July 23, 2024 00:58 10s
[Core] Modulize prepare input and attention metadata builder
Add label on auto-merge enabled #62: Pull request #6596 auto_merge_enabled by comaniac
July 22, 2024 23:16 11s
July 22, 2024 23:16 11s
[Frontend] Kill the server on engine death
Add label on auto-merge enabled #61: Pull request #6594 auto_merge_enabled by Yard1
July 22, 2024 23:09 14s
July 22, 2024 23:09 14s
[Misc] Remove deprecation warning for beam search
Add label on auto-merge enabled #60: Pull request #6659 auto_merge_enabled by WoosukKwon
July 22, 2024 23:04 21s
July 22, 2024 23:04 21s
[Bugfix][Kernel] Use int64_t for indices in fp8 quant kernels
Add label on auto-merge enabled #59: Pull request #6649 auto_merge_enabled by robertgshaw2-redhat
July 22, 2024 14:45 12s
July 22, 2024 14:45 12s
[Bugfix] Fix vocab_size field access in LLaVA models
Add label on auto-merge enabled #58: Pull request #6624 auto_merge_enabled by DarkLight1337
July 22, 2024 03:40 14s
July 22, 2024 03:40 14s
[ CI ] Awq Marlin Integration Tests
Add label on auto-merge enabled #57: Pull request #6627 auto_merge_enabled by robertgshaw2-redhat
July 22, 2024 01:01 10s
July 22, 2024 01:01 10s
[ Kernel ] Enable fp8-marlin for fbgemm-fp8 models
Add label on auto-merge enabled #56: Pull request #6606 auto_merge_enabled by mgoin
July 20, 2024 18:32 11s
July 20, 2024 18:32 11s
[Misc] Consolidate and optimize logic for building padded tensors
Add label on auto-merge enabled #55: Pull request #6541 auto_merge_enabled by DarkLight1337
July 20, 2024 03:37 11s
July 20, 2024 03:37 11s
[ Misc ] fbgemm checkpoints
Add label on auto-merge enabled #54: Pull request #6559 auto_merge_enabled by mgoin
July 20, 2024 01:37 10s
July 20, 2024 01:37 10s
[ Kernel ] FP8 Dynamic Per Token Quant - Add scale_ub
Add label on auto-merge enabled #53: Pull request #6593 auto_merge_enabled by mgoin
July 20, 2024 00:35 12s
July 20, 2024 00:35 12s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #52: Pull request #6557 auto_merge_enabled by Yard1
July 19, 2024 23:34 14s
July 19, 2024 23:34 14s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #51: Pull request #6557 auto_merge_enabled by Yard1
July 19, 2024 22:27 10s
July 19, 2024 22:27 10s
[Bugfix] [SpecDecode] AsyncMetricsCollector: update time since last collection
Add label on auto-merge enabled #50: Pull request #6578 auto_merge_enabled by cadedaniel
July 19, 2024 20:53 11s
July 19, 2024 20:53 11s
[ Kernel ] Enable Dynamic Per Token fp8
Add label on auto-merge enabled #49: Pull request #6547 auto_merge_enabled by robertgshaw2-redhat
July 19, 2024 18:34 13s
July 19, 2024 18:34 13s
[Misc] Fix input_scale typing in w8a8_utils.py
Add label on auto-merge enabled #48: Pull request #6579 auto_merge_enabled by mgoin
July 19, 2024 14:31 11s
July 19, 2024 14:31 11s
[Bugfix][Frontend] remove duplicate init logger
Add label on auto-merge enabled #47: Pull request #6581 auto_merge_enabled by DarkLight1337
July 19, 2024 14:19 13s
July 19, 2024 14:19 13s
[BUGFIX] Raise an error for no draft token case when draft_tp>1
Add label on auto-merge enabled #46: Pull request #6369 auto_merge_enabled by cadedaniel
July 19, 2024 08:21 9s
July 19, 2024 08:21 9s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #45: Pull request #6557 auto_merge_enabled by Yard1
July 19, 2024 05:06 11s
July 19, 2024 05:06 11s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #44: Pull request #6557 auto_merge_enabled by comaniac
July 19, 2024 04:57 12s
July 19, 2024 04:57 12s
[Bugfix][Frontend] Fix missing /metrics endpoint
Add label on auto-merge enabled #43: Pull request #6463 auto_merge_enabled by simon-mo
July 19, 2024 02:00 11s
July 19, 2024 02:00 11s
Add support for a rope extension method
Add label on auto-merge enabled #42: Pull request #6553 auto_merge_enabled by simon-mo
July 19, 2024 01:16 1m 17s
July 19, 2024 01:16 1m 17s
[Kernel] Implement fallback for FP8 channelwise using torch._scaled_mm
Add label on auto-merge enabled #41: Pull request #6552 auto_merge_enabled by robertgshaw2-redhat
July 18, 2024 22:00 13s
July 18, 2024 22:00 13s
[ci][test] add correctness test for cpu offloading
Add label on auto-merge enabled #40: Pull request #6549 auto_merge_enabled by youkaichao
July 18, 2024 21:49 12s
July 18, 2024 21:49 12s
ProTip! You can narrow down the results and go further in time using created:<2024-07-18 or the other filters available.