-
Notifications
You must be signed in to change notification settings - Fork 365
Pull requests: NovaSky-AI/SkyRL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[megatron] fused linear cross-entropy for 23x memory savings at 262k context
#1841
opened Jun 26, 2026 by
casper-hansen
Loading…
[megatron] add opt-in async distributed checkpoint save
#1838
opened Jun 26, 2026 by
dinhxuanvu
Contributor
Loading…
3 tasks
feat: Arctic RL training backend integration
#1837
opened Jun 26, 2026 by
sfc-gh-kganesan
Loading…
3 of 4 tasks
[generator] Add per global_step cache salt for each trajectory to invalidate super stale KV
#1836
opened Jun 26, 2026 by
erictang000
Collaborator
Loading…
[feat] Add Spec Decoding and MTP training support
#1832
opened Jun 25, 2026 by
zanderjiang
Contributor
Loading…
[megatron] add opt-in distributed HF checkpoint export
#1831
opened Jun 25, 2026 by
dinhxuanvu
Contributor
Loading…
Make packed sequence alignment FP8-safe
#1828
opened Jun 24, 2026 by
jinghanyao1-hub
Collaborator
Loading…
[fix] Memory-map dataset per SFT tokenize worker to avoid cache stampede
#1826
opened Jun 23, 2026 by
dinhxuanvu
Contributor
Loading…
fix(generators): compute batched rollout metrics from truncated responses
#1821
opened Jun 21, 2026 by
EazyReal
Loading…
fix(tracking): finalize MLflow runs on SIGTERM instead of leaving them RUNNING
#1819
opened Jun 20, 2026 by
EazyReal
Loading…
[megatron] Enable Nemotron-3-Ultra-550B GRPO RL + fix multi-rank (EP>16/PP>2) weight sync
#1816
opened Jun 19, 2026 by
erictang000
Collaborator
Loading…
[algorithm] add rollout KL loss + fix padding-microbatch field drops
#1811
opened Jun 19, 2026 by
erictang000
Collaborator
Loading…
[train] Async batch collation (double-buffering) for the SFT trainer
#1809
opened Jun 18, 2026 by
dyurk-lila
Loading…
2 tasks done
[train] Vectorize controller-side training-batch collation (SFT + RL)
#1808
opened Jun 18, 2026 by
dyurk-lila
Loading…
3 tasks done
[train] Skip building unused per-token loss_fn_outputs when the caller does not consume them
#1807
opened Jun 18, 2026 by
dyurk-lila
Loading…
[megatron] Stream ChunkedDistributedLogprob.backward into a preallocated buffer (lower peak memory)
#1806
opened Jun 18, 2026 by
dyurk-lila
Loading…
[megatron] Accept dtype-string optimizer_config_kwargs (coerce exp_avg_dtype etc. to torch.dtype)
#1805
opened Jun 18, 2026 by
dyurk-lila
Loading…
rename adv_estimator param to advantage_estimator in compute_advantages_and_returns
#1793
opened Jun 16, 2026 by
KTanmay1
Loading…
1 task
[train] Save HF processor on checkpoint export for VLMs
#1785
opened Jun 14, 2026 by
dinhxuanvu
Contributor
Loading…
1 of 2 tasks
[fix] Honor served_model_name and surface HTTP errors in RemoteInferenceEngine
#1783
opened Jun 13, 2026 by
discobot
Contributor
Loading…
[fix] Use masked mean in advantage batch normalization
#1782
opened Jun 12, 2026 by
discobot
Contributor
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.