MedQ-Deg

A Multidimensional Benchmark for Evaluating MLLMs Across Medical Image Quality Degradations

40
MLLMs Evaluated
24,894
QA Pairs
19
Degradation Types
30
Fine-grained Skills
7
Modalities

Benchmark Framework

Two orthogonal hierarchies structure the evaluation: a capability hierarchy decomposing clinical competence into 30 fine-grained skills, and a degradation hierarchy covering 19 degradation types across 7 modalities.

MedQ-Deg Benchmark Framework

Fig. 2: Overview of the MedQ-Deg benchmark framework. Left: Medical MLLM Capability Hierarchy. Middle: Benchmark Construction pipeline. Right: Medical Image Degradation Hierarchy.

Degradation Categories

19 degradation types across 5 major categories, each calibrated at 3 severity degrees by expert radiologists.

Ready to Explore?

Dive into our comprehensive medical image degradation benchmark.