[arXiv]score: 0.13

Cross-Model Activation Injection Between Pythia Models Fails to Transfer Reasoning

June 3, 2026

A linear translation layer achieves 0.97 cosine similarity mapping hidden states from Pythia-160M to Pythia-410M, but injecting translated activations at inference time does not improve multi-hop reasoning and replacement-style injection actively degrades performance. The result suggests activation-space communication between heterogeneous models is not a viable inference-time reasoning channel.

cs.AI

HOW THIS AFFECTS YOU

●

researcherA clean negative result on cross-model activation transfer — saves effort for teams exploring hidden-state communication as an alternative to natural-language chain-of-thought passing between agents.

SOURCE

https://arxiv.org/abs/2606.03280

← back to feed