[arXiv]score: 0.21

Multimodal Dementia Detection via Whisper and LLM Feature Extraction

July 1, 2026

A framework achieves F1-scores of up to 90.14% on ADReSS/ADReSSo datasets by fusing Whisper acoustic embeddings with LLM-extracted linguistic features. The method uses temporal networks with attention pooling and a gated fusion network to integrate acoustic and semantic biomarkers.

HOW THIS AFFECTS YOU

●

researcherYou can leverage LLM-augmented linguistic features to improve multimodal biomarker detection.

●

healthThis approach offers a non-invasive pathway for early dementia screening via speech.

read original ↗arxiv.org

← back to feed