-
Notifications
You must be signed in to change notification settings - Fork 605
Issues: sgl-project/sglang
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Feature] Due to GIL issues, the overlap mode doesn't actually always bring benefits?
#2573
opened Dec 25, 2024 by
CSEEduanyu
2 tasks done
[Feature] (Willing to PR) Proposal: Drop-in fast replacement of New feature or request
feature
high priority
RLHF
Using SGLang for post training
PreTrainedModel.generate
collaboration
enhancement
#2569
opened Dec 24, 2024 by
fzyzcjy
2 tasks done
[Feature] Running multi-node offline inference via SLURM
feature
help wanted
Extra attention is needed
#2561
opened Dec 23, 2024 by
aflah02
2 tasks done
[Feature] Improve the Zero-Overhead Batch Scheduler performance for the small model
#2558
opened Dec 23, 2024 by
libratiger
2 tasks done
[Bug] Error occurs when loading the gemma model in bitsandbytes format.
bug
Something isn't working
#2556
opened Dec 23, 2024 by
upskyy
5 tasks done
upgrade setuptools and wheel if you found "torch module not found" when installing
bug
Something isn't working
#2554
opened Dec 23, 2024 by
MiladInk
[Bug] Link error in SGLang Sampling Docs
documentation
Improvements or additions to documentation
#2551
opened Dec 23, 2024 by
zhaochenyang20
5 tasks done
[Bug] Outlines version error for Grammar Backend
bug
Something isn't working
good first issue
Good for newcomers
#2550
opened Dec 23, 2024 by
zhaochenyang20
5 tasks done
[Feature] Set outlines and xgrammar as addtional dependency
enhancement
New feature or request
grammar-backend
#2549
opened Dec 23, 2024 by
zhaochenyang20
2 tasks done
[Feature] (Willing to PR) Avoid KV cache occupying GPU memory when not used
collaboration
feature
high priority
#2542
opened Dec 22, 2024 by
fzyzcjy
2 tasks done
[Bug] Eagle2 has an unstable sampling rate during multi concurrency。
#2537
opened Dec 21, 2024 by
coolhok
5 tasks done
[Bug] Transformers doesn't recognize LLaVA variant architectures
#2532
opened Dec 20, 2024 by
amosyou
5 tasks done
[Feature] Add Docs For Quantization
good first issue
Good for newcomers
quant
LLM Quantization
#2531
opened Dec 20, 2024 by
binhtranmcs
[Feature] Support for Evicting Specific KV Cache to Save GPU Memory
#2510
opened Dec 18, 2024 by
ChenlongDeng
2 tasks done
[Feature] Integration SGLang into OpenRLHF
collaboration
high priority
#2506
opened Dec 17, 2024 by
zhaochenyang20
2 tasks done
[Feature] Add Tutorial for Constraint Decoding
documentation
Improvements or additions to documentation
good first issue
Good for newcomers
#2505
opened Dec 17, 2024 by
zhaochenyang20
2 tasks done
[Feature] Add Math in our CI
enhancement
New feature or request
good first issue
Good for newcomers
#2504
opened Dec 17, 2024 by
zhaochenyang20
2 tasks
[Feature] Benchmarking Performance on General Devices
collaboration
enhancement
New feature or request
#2488
opened Dec 16, 2024 by
zhaochenyang20
2 tasks done
[Feature] request smoothquant (int8, W8A8) quantization on 40G A100
#2474
opened Dec 13, 2024 by
Hao-YunDeng
2 tasks done
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.