●researcherBenchmark exposes a critical evaluation gap — models trained and tested on smartphone audio do not transfer to wearable sensor modalities, requiring domain-specific fine-tuning or new pretraining data.
●healthAny clinical deployment of cough-based diagnostics on wearables should not assume smartphone-validated model performance holds — none of the five tested FMs meet minimum clinical sensitivity thresholds.