●builderThe 3.3x self-speculative decoding speedup requires no separate draft model, making it a potentially low-overhead inference optimization worth tracking as code drops.
●researcherThe latent belief-state compression objective offers a new training signal that could improve reasoning and planning benchmarks beyond standard next-token pretraining.