[arXiv]score: 0.39
Mechanistic Interpretability of EEG Foundation Models via Sparse Autoencoders
May 15, 2026
Applies TopK Sparse Autoencoders to three EEG foundation models (SleepFM, REVE, LaBraM) to extract interpretable feature dictionaries, benchmarking monosemanticity and entanglement against clinical taxonomy (abnormality, age, sex, medication).
cs.LGcs.HCcs.NE