generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Adding support for different losses which are now supported by Liger
#3815
opened Jul 31, 2025 by
Manan17
Loading…
1 of 5 tasks
Performance optimization: Replace list comprehensions with tensor operations in BCO and KTO trainers
#3813
opened Jul 30, 2025 by
chi2liu
Loading…
5 tasks
Add soft overlong punishment reward function and update documentation
#3804
opened Jul 30, 2025 by
qgallouedec
Loading…
5 tasks
Add vLLM server mode support to OnlineDPOTrainer
#3783
opened Jul 27, 2025 by
vaelev
Loading…
6 tasks done
change doc for
num_iterations
and steps_per_generation
to hopefully make them more clear and differentiate between them more clearly
#3761
opened Jul 23, 2025 by
avishaiElmakies
Loading…
2 of 5 tasks
Dynamic sampling option in GRPO trainer based on DAPO paper
#3758
opened Jul 23, 2025 by
almeidava93
Loading…
2 of 5 tasks
Add basic support for FSDP/Lora when using TRL/VLLM
#3735
opened Jul 14, 2025 by
ojh31
Loading…
5 tasks
Add warn0 utility and replace warnings.warn with rank-aware warnings in trainer
#3734
opened Jul 14, 2025 by
yafshar
Loading…
1 of 5 tasks
feat: Initial implementation of RePO trainer and components
#3655
opened Jun 26, 2025 by
celsowm
Loading…
5 tasks
Ensure Chat Template Safe Prompt Truncation
#3646
opened Jun 25, 2025 by
pramodith
Loading…
4 of 5 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.