●builderReduces expensive large-model calls in speculative decoding pipelines without requiring a separately trained draft model.
●researcherIntra-model routing for tiered verification is a novel framing that challenges the binary accept/reject assumption in existing SD literature.