-
Notifications
You must be signed in to change notification settings - Fork 228
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: suppport stateless group and decouple vLLM in train backend
#1842
opened Jan 29, 2026 by
shuyixiong
•
Draft
4 tasks
refactor: unify entrypoint for different envs
CI:L1
Run doctests, unit tests, and functional tests
documentation
Improvements or additions to documentation
#1841
opened Jan 29, 2026 by
yuki-97
Loading…
feat: Mask sequences with high logprob error
CI:L1
Run doctests, unit tests, and functional tests
#1838
opened Jan 29, 2026 by
yfw
Loading…
4 tasks
fix: allow multi epoch training for async grpo
#1836
opened Jan 28, 2026 by
parthchadha
Loading…
4 tasks
feat: Allow loading of more general data types
CI:L1
Run doctests, unit tests, and functional tests
community-request
#1834
opened Jan 28, 2026 by
nathan-az
Loading…
Mcore dp coordinator implementation initial
#1833
opened Jan 27, 2026 by
shanmugamr1992
Loading…
4 tasks
feat: add lora config for dpo dtensor backend
CI:L1
Run doctests, unit tests, and functional tests
#1826
opened Jan 26, 2026 by
RayenTian
Loading…
4 tasks
ci: Allow repo to self publish docs
CI
Relating to CI
#1821
opened Jan 23, 2026 by
chtruong814
Loading…
4 tasks
perf: Update cudnn to 9.14
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1820
opened Jan 23, 2026 by
guyueh1
Loading…
4 tasks
fix: fix statistic of probs_ratio_clamped_min/max
CI:L1
Run doctests, unit tests, and functional tests
#1818
opened Jan 23, 2026 by
yuki-97
Loading…
fix: Unify custom model logits extraction across all inference methods
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1815
opened Jan 23, 2026 by
zpqiu
Loading…
4 tasks
feat: Implement ProRLv2 recipe
CI:L1
Run doctests, unit tests, and functional tests
#1809
opened Jan 22, 2026 by
hijkzzz
Loading…
chore: cuda13 support
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1803
opened Jan 21, 2026 by
guyueh1
Loading…
4 tasks
feat: Timer for the data sharding and job submission
CI:L1
Run doctests, unit tests, and functional tests
#1802
opened Jan 21, 2026 by
guyueh1
Loading…
4 tasks
feat: Support lora in dtensor grpo workflow by merging weight
CI:L1
Run doctests, unit tests, and functional tests
#1797
opened Jan 20, 2026 by
RayenTian
Loading…
feat: add speculative decoding during post-training
#1785
opened Jan 15, 2026 by
isomap
Loading…
2 of 4 tasks
feat: NeMo Gym GRPO on-policy fix params; Per-agent group-level rewards
CI:L1
Run doctests, unit tests, and functional tests
#1779
opened Jan 15, 2026 by
bxyu-nvidia
Loading…
4 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.