-
Notifications
You must be signed in to change notification settings - Fork 116
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[code not in mergable state yet] Add MI325X DeepSeek-R1 FP8 disaggregated inference with Broadcom Thor 2 IBGDA
#985
opened Mar 31, 2026 by
JordanNanos
•
Draft
2 of 8 tasks
[AMD/ROCM] ATOM support for new models: Kimi-K2.5 FP4, GLM-5 FP8, and MiniMax-M2.5
AMD
#963
opened Mar 27, 2026 by
seungrokj
Loading…
[WIP] B200 Minimax FP8 vllm upgrade
NVIDIA
sweep-enabled
#947
opened Mar 26, 2026 by
kedarpotdar-nv
Loading…
fix: multi-turn benchmark hangs after all clients finish
#908
opened Mar 13, 2026 by
lishicheng1996-nv
Loading…
3 of 4 tasks
[NV - WIP] Qwen3.5 B200 SGLang FP4 configs
NVIDIA
sweep-enabled
#820
opened Feb 27, 2026 by
kedarpotdar-nv
Loading…
Performance Improvements for MI300X with GEMM and FP8 Enhancements
#811
opened Feb 26, 2026 by
chunfangamd
Loading…
ProTip!
Mix and match filters to narrow down what you’re looking for.