MTBBench is a benchmark for evaluating multimodal LLM reasoning in oncology, covering two core challenges: multimodal integration (pathology, genomics, radiology) and longitudinal reasoning across patient timelines. It includes agentic tasks requiring interaction with external foundation-model-based tools such as TRIDENT for pathology and DrugBank for pharmacology.
This page was last edited on 2026-03-19.
This page was last edited on 2026-03-19.