Video-MME-Logical Tests Temporal-Logical Reasoning in MLLMs
June 25, 2026
Video-MME-Logical is a controlled benchmark designed to isolate video temporal-logical reasoning from simple object recognition. It evaluates models across five specific categories: state tracking, sequential counting, temporal ordering, dynamic spatiality, and structural composition.
HOW THIS AFFECTS YOU
●
researcherYou can specifically measure how well your multimodal models reason about evolving visual states over time.