muon-optimizer

Here are 3 public repositories matching this topic...

HyperKuvid-Labs / muon_exps

High-performance CUDA implementation of Muon optimizer for LLM training. Features Newton-Schulz polar decomposition, cuBLAS acceleration, and transpose optimization for 8x FLOP savings on transformer FFN layers. Benchmarked on NVIDIA A100 with Llama 3.1 8B architectures (4096×11008 weights).

neural-network cublas mnist cuda-kernels gpu-optimization optimizers muon-optimizer newton-schulz

Updated Dec 21, 2025
Python

alessiaianes / deep-learning-project

Star

Few-Shot Adaptation for Vision-Language Models. Implements Base-to-Novel generalization on CLIP using LoRA, LP++, and Muon Optimizer to enhance performance on the Oxford Flowers-102 dataset.

pytorch lora clip peft few-shot-learning vision-language-model muon-optimizer

Updated Dec 24, 2025
Jupyter Notebook

dmf-archive / ARS

Star

ARS2-Neo: Slide directly with the geodesic of the loss landscape to the global optimum.

adam-optimizer ars second-order-optimization free-energy-principle flatness formalized-realism muon-optimizer

Updated Feb 2, 2026
Python

Improve this page

Add a description, image, and links to the muon-optimizer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the muon-optimizer topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly