[arXiv]score: 0.41

VideoSEAL: Mitigating Evidence Misalignment in Agentic Long Video Understanding by Decoupling Answer Authority

May 14, 2026

VideoSEAL addresses evidence misalignment in long video understanding agents, where MLLMs produce correct answers unsupported by retrieved evidence; introduces temporal groundedness and semantic grounding diagnostics to characterize this failure mode in agentic long video QA systems.

cs.CVcs.AI

SOURCE

https://arxiv.org/abs/2605.12571

← back to feed