●builderIf you're using long Markdown system prompts to encode agent behaviors, a trained soft prefix could reduce inference-time token cost while improving task accuracy on compatible tasks.
●researcherSoft prefix tuning as a behavioral compression mechanism for frozen models shows strong gains on math and QA — the latent skill representation framing is worth exploring for multi-skill agent systems.