ywang96

Roger Wang ywang96

AI/ML Platform @Roblox; Working on @vllm-project when I catch a breath

81 followers · 30 following

Roblox
San Mateo
19:13 - 8h behind
in/rogerywang
@rogerw0108

Achievements

x4 x2

Achievements

x4 x2

Organizations

Pinned Loading

vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40.5k 6.1k
vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python

708 contributions in the last year

Learn how we count contributions

Less

@vllm-project

@mistralai

@huggingface

Activity overview

Contributed to vllm-project/vllm, sgl-project/sglang, ywang96/vllm and 11 other repositories

Contribution activity

March 2025

Created 1 commit in 1 repository

vllm-project/vllm 1 commit

Created a pull request in vllm-project/vllm that received 7 comments

Mar 5

[Misc][V1] Avoid using `envs.VLLM_USE_V1` in mm processing

The main difference between V0 and V1 multimodal input processing is that in V1 we need mm_hashes for downstream tasks (prefix caching, feature cac…

+38 −8 lines changed • 7 comments

Reviewed 13 pull requests in 1 repository

vllm-project/vllm 13 pull requests

[Doc] Fix a typo
This contribution was made on Mar 7
[BUGFIX] Skip tokenization support for throughput benchmark
This contribution was made on Mar 5
[Bugfix][V1] Fix allowed_token_ids for v1 Sampler
This contribution was made on Mar 5
[V1] V1 Enablement Oracle
This contribution was made on Mar 5
[Misc][V1] Avoid using envs.VLLM_USE_V1 in mm processing
This contribution was made on Mar 5
[Model] New model support for Phi-4-multimodal-instruct
This contribution was made on Mar 4
[V1][Bugfix] Do not reset prefix caching metrics
This contribution was made on Mar 4
[V1][Molmo] Fix get_multimodal_embeddings() in molmo.py
This contribution was made on Mar 4
[sleep mode] error out with expandable_segments
This contribution was made on Mar 4
[Model] Extend Ultravox to accept audio longer than 30s
This contribution was made on Mar 4
[Misc] typo find in deepseek_v2
This contribution was made on Mar 3
[Doc] V1 user guide
This contribution was made on Mar 2
[Misc] Accurately capture the time of loading weights
This contribution was made on Mar 1

Created an issue in vllm-project/vllm that received 1 comment

Mar 6

[Usage]: Clean up Engine Args & Documentation

Your current environment Currently vLLM has a lot of engine arguments listed here https://docs.vllm.ai/en/latest/serving/engine_args.html. Over tim…

1 task done

• 1 comment

	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb
Sun
Mon
Tue
Wed
Thu
Fri
Sat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Roger Wang ywang96

Achievements

Achievements

Organizations

Block or report ywang96

Pinned Loading

708 contributions in the last year

Activity overview

Contribution activity

March 2025

Created a pull request in vllm-project/vllm that received 7 comments

[Misc][V1] Avoid using `envs.VLLM_USE_V1` in mm processing

Created an issue in vllm-project/vllm that received 1 comment

[Usage]: Clean up Engine Args & Documentation

Roger Wang ywang96

Achievements

Achievements

Organizations

Pinned Loading

708 contributions in the last year

Activity overview

Contribution activity

March 2025

Created a pull request in vllm-project/vllm that received 7 comments

[Misc][V1] Avoid using envs.VLLM_USE_V1 in mm processing

Created an issue in vllm-project/vllm that received 1 comment

[Usage]: Clean up Engine Args & Documentation

[Misc][V1] Avoid using `envs.VLLM_USE_V1` in mm processing