Knowledge distillation pipeline for training tiny monolingual encoders from XLM-R or HPLT teacher models. Supports downstream evaluation for POS tagging, lemmatization, dependency parsing, and NER. Includes checkpoint selection scripts, distillation pipelines, and evaluation tooling.
This page was last edited on 2026-03-03.
This page was last edited on 2026-03-03.