Reference implementation of ClippedScion and UnconstrainedClippedScion optimizers extending Scion with gradient clipping. Supports multiple norms and hyperparameter configurations for nanoGPT, CNN, and DeiT training with example setups and citation information.
This page was last edited on 2026-03-03.
This page was last edited on 2026-03-03.