●builderPotential path to smaller embedding layers in production LMs, though no benchmark numbers are available yet to assess accuracy tradeoffs.
●researcherNew architecture for parameter-efficient causal LMs worth evaluating against standard embedding baselines on vocab-heavy domains.