Skip to content

[contrib] Add MiMo-V2.5-Pro (Xiaomi, 384 experts MoE, FP8 on Trn2)#150

Open
whn09 wants to merge 24 commits into
aws-neuron:mainfrom
whn09:contrib/MiMo-V2.5-Pro
Open

[contrib] Add MiMo-V2.5-Pro (Xiaomi, 384 experts MoE, FP8 on Trn2)#150
whn09 wants to merge 24 commits into
aws-neuron:mainfrom
whn09:contrib/MiMo-V2.5-Pro

[contrib] MiMo-V2.5-Pro: bump default seq_len 256 -> 512; document vL…

af27106
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs