MTBBench

MTBBench

Benchmark for evaluating multimodal LLM reasoning in complex oncology clinical decision-making scenarios

MTBBench is a benchmark for evaluating multimodal LLM reasoning in oncology, covering two core challenges: multimodal integration (pathology, genomics, radiology) and longitudinal reasoning across patient timelines. It includes agentic tasks requiring interaction with external foundation-model-based tools such as TRIDENT for pathology and DrugBank for pharmacology.

BenchmarkLarge Language ModelMachine LearningMedical
Key facts
Maturity
Support
C4DT
Inactive
Lab
Active
  • Technical

Artificial Intelligence in Molecular Medicine

Artificial Intelligence in Molecular Medicine
Charlotte Bunne

Prof. Charlotte Bunne

Our research aims to advance personalized medicine by utilizing machine learning and large-scale biomedical data.

This page was last edited on 2026-03-19.