[arXiv]score: 0.13
Cross-Model Activation Injection Between Pythia Models Fails to Transfer Reasoning
June 3, 2026
A linear translation layer achieves 0.97 cosine similarity mapping hidden states from Pythia-160M to Pythia-410M, but injecting translated activations at inference time does not improve multi-hop reasoning and replacement-style injection actively degrades performance. The result suggests activation-space communication between heterogeneous models is not a viable inference-time reasoning channel.
cs.AI
HOW THIS AFFECTS YOU
●
researcherA clean negative result on cross-model activation transfer — saves effort for teams exploring hidden-state communication as an alternative to natural-language chain-of-thought passing between agents.