Inactive

CENTER FOR
DIGITAL TRUST

CHAMELEON

Data-mixing framework using Kernel Ridge Leverage Scores for LLM pretraining and finetuning.

Data-mixing framework using Kernel Ridge Leverage Scores (KRLS) for domain weighting in LLM pretraining and finetuning. Computes domain embeddings and weights to optimize universal generalization and transferability. Integrates with existing training pipelines via simple scripting.

Large Language Model

Maturity

Support

C4DT

Lab

Maturity

Support

C4DT

Lab

Technical

Source code: Lab Github
Last commit: 2025-07-23

Laboratory for Information and Inference Systems

Laboratory for Information and Inference Systems

Volkan Cevher

Prof. Volkan Cevher

At LIONS, we are concerned with optimized information extraction from signals or data volumes. We therefore develop mathematical theory and computational methods for information recovery from highly incomplete data.

This page was last edited on 2026-03-03.

This page was last edited on 2026-03-03.