[arXiv]score: 0.16

HRM SSM Adapter Beats LoRA on Long-Context Tasks with Mistral-7B

June 26, 2026

The Hankel Reduced-order Model (HRM) adapter is an SSM-based PEFT module initialized via Balanced Truncation, using FFT-based parallel scan to match LoRA compute at all context lengths. On Mistral-7B with 8.4M trainable parameters, it achieves +34.8% relative accuracy on QuALITY and +71.6% relative ROUGE-1 on QMSum versus LoRA variants.

HOW THIS AFFECTS YOU

●

builderIf your fine-tuning workloads involve long-context summarization or QA, HRM offers a drop-in LoRA alternative with significant benchmark gains at identical parameter counts.

●

researcherStrong empirical evidence that MLP blocks are better SSM adapter injection sites than attention projectors for sequential state-accumulation tasks — worth replicating on other base models.

read original ↗arxiv.org

← back to feed