[arXiv]score: 0.39

Mechanistic Interpretability of EEG Foundation Models via Sparse Autoencoders

May 15, 2026

Applies TopK Sparse Autoencoders to three EEG foundation models (SleepFM, REVE, LaBraM) to extract interpretable feature dictionaries, benchmarking monosemanticity and entanglement against clinical taxonomy (abnormality, age, sex, medication).

cs.LGcs.HCcs.NE

SOURCE

https://arxiv.org/abs/2605.13930

← back to feed