Multimodal Dementia Detection via Whisper and LLM Feature Extraction
July 1, 2026
A framework achieves F1-scores of up to 90.14% on ADReSS/ADReSSo datasets by fusing Whisper acoustic embeddings with LLM-extracted linguistic features. The method uses temporal networks with attention pooling and a gated fusion network to integrate acoustic and semantic biomarkers.
HOW THIS AFFECTS YOU
●
researcherYou can leverage LLM-augmented linguistic features to improve multimodal biomarker detection.
●
healthThis approach offers a non-invasive pathway for early dementia screening via speech.