Inactive

CENTER FOR
DIGITAL TRUST

LLM Optimizer Benchmark

Benchmark for comparing optimizers for LLM pretraining on Llama and MoE models.

Benchmarking suite for comparing optimizers (Adam, SGD, Muon, Scion, etc.) on LLM pretraining using Llama and MoE architectures. Supports configurable training parameters, WandB logging, multi-GPU and CPU setups, and guidelines for extending to new models or datasets.

BenchmarkLarge Language ModelOptimization

Maturity

Support

C4DT

Lab

Maturity

Support

C4DT

Lab

Research papers
Technical

Benchmarking Optimizers for Large Language Model Pretraining

Machine Learning and Optimization Laboratory

Machine Learning and Optimization Laboratory

Martin Jaggi

Prof. Martin Jaggi

The Machine Learning and Optimization Laboratory is interested in machine learning, optimization algorithms and text understanding, as well as several application domains.

This page was last edited on 2026-03-03.

This page was last edited on 2026-03-03.