-
Notifications
You must be signed in to change notification settings - Fork 527
Pull requests: AI-Hypercomputer/maxtext
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP-Experimental]Onboard qwen1.5-moe-a2.7b model in MaxText
#4070
opened Jun 4, 2026 by
YixuanWang-99
Collaborator
Loading…
4 tasks
Implement generic multimodal SFT training pipeline for TFDS.
#4069
opened Jun 4, 2026 by
copybara-service
Bot
Loading…
feat(lora): Add Gemma3 weight mappings, vLLM adapter serving, and end…
#4068
opened Jun 4, 2026 by
RexBearIU
Collaborator
Loading…
4 tasks done
refactor: streamline LoRA parameter restoration for JAX/NNX models
#4067
opened Jun 4, 2026 by
RexBearIU
Collaborator
Loading…
4 tasks done
Fix: Gemma 3 & 4 Base Model and Flax Linen Decoding
#4066
opened Jun 4, 2026 by
RexBearIU
Collaborator
Loading…
4 tasks done
Migrate distillation MaxTextCheckpointManager to reflect Tunix updated Checkpoint Manager.
#4064
opened Jun 4, 2026 by
copybara-service
Bot
Loading…
Add elastic training (Pathways + xpk) guide under docs/
#4063
opened Jun 4, 2026 by
inardini
Loading…
Fix cuDNN SDPA autoregressive inference when KV cache batch differs from decode batch
#4058
opened Jun 3, 2026 by
sfvaroglu
Contributor
Loading…
4 tasks done
Remove hardcoded vision hyperparams from Qwen mm preprocessor
#4057
opened Jun 3, 2026 by
hengtaoguo
Collaborator
Loading…
4 tasks done
Fix sparse distillation loss and speed up teacher top-k logit saving
pull ready
#4056
opened Jun 3, 2026 by
ajkv-google
Collaborator
Loading…
4 tasks done
[pallas:sc] Remove
use_tc_tiling_on_sc=True, because this is now a default
#4055
opened Jun 3, 2026 by
copybara-service
Bot
Loading…
Update vllm/tpu-inference commit and fix vllm installation
#4054
opened Jun 3, 2026 by
SurbhiJainUSC
Collaborator
•
Draft
4 tasks done
Qwen3 Coder 480B inference sharding changes for MaxText.
#4052
opened Jun 3, 2026 by
copybara-service
Bot
Loading…
Support of gdn kernel from tpu-inference
gemini-review
#4051
opened Jun 3, 2026 by
khatwanimohit
Collaborator
Loading…
4 tasks done
Fix: Refactor LoRA checkpoint restoration and simplify NNX weight extraction
#4050
opened Jun 3, 2026 by
RexBearIU
Collaborator
Loading…
4 tasks done
[RL] Honor tokenizer chat templates for base models that lack one
#4049
opened Jun 3, 2026 by
dasoto
Collaborator
Loading…
4 tasks done
[WIP-exp1] microsoft/Phi-4-mini-instruct
#4047
opened Jun 3, 2026 by
hengtaoguo
Collaborator
•
Draft
4 tasks
fix: raise RuntimeError when checkpoint step >= config.steps
bug
Something isn't working
gemini-review
pull ready
#4046
opened Jun 2, 2026 by
Dr-Left
Collaborator
Loading…
4 tasks done
Add intermediate eval hook: fire evaluate() every eval_interval outer steps
#4044
opened Jun 2, 2026 by
py4
Collaborator
Loading…
4 tasks done
[Qwen3.5] Add moe weight sync script for 35b model
gemini-review
pull ready
#4041
opened Jun 2, 2026 by
Rohan-Bierneni
Collaborator
Loading…
4 tasks done
Support Qwix quantization on NNX
#4040
opened Jun 2, 2026 by
hsuan-lun-chiang
Collaborator
Loading…
4 tasks done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.