generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[GRPO] Fix potential hang in get_high_entropy_mask
#4041
opened Sep 9, 2025 by
akakakakakaa
Loading…
Remove
average_tokens_across_devices
default replacement
#4039
opened Sep 8, 2025 by
qgallouedec
Loading…
Fix label shifting logic in
SFTTrainer
for compatibility with CP
#4038
opened Sep 8, 2025 by
qgallouedec
Loading…
5 tasks
Add autodoc for BestOfNSampler and improve docstrings
#4034
opened Sep 8, 2025 by
albertvillanova
Loading…
Made ref_model as None in PPO trainer for refined args
#4024
opened Sep 7, 2025 by
complete-dope
Loading…
Fix #3982: Fix DPO Trainer support for Gemma 3 vision models
#4022
opened Sep 6, 2025 by
akshay-babbar
Loading…
Fix: undefined
current_gradient_accumulation_steps
#4014
opened Sep 5, 2025 by
ysjprojects
Loading…
2 of 5 tasks
Fix: ignore precompute_ref_log_probs when use_liger_loss=True
#4008
opened Sep 4, 2025 by
ginkyenglee
Loading…
5 tasks
⚖️ Align SFT and DPO for model creation and deprecate
DPOConfig.padding_value
in favour or pad_token_id
#4006
opened Sep 4, 2025 by
qgallouedec
Loading…
5 tasks
Fix: Make sft script work when chat template is None
#3995
opened Sep 2, 2025 by
rabinadk1
Loading…
1 of 5 tasks
Enable saving and loading precomputed reference log probabilities in …
#3986
opened Sep 1, 2025 by
ginkyenglee
Loading…
3 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.