HACKOBAR_item
[r/LocalLLaMA]score: 0.20

NVIDIA AI Releases Star Elastic: One Checkpoint that Contains 30B, 23B, and 12B Reasoning Models with Zero-Shot Slicing

May 9, 2026
NVIDIA released Star Elastic, a single checkpoint containing 30B, 23B, and 12B reasoning models accessible via zero-shot layer slicing. The nested architecture enables KV cache sharing across model sizes, allowing dynamic compute scaling without multiple deployments. Practitioners running local inference can trade quality for speed on-the-fly, a significant efficiency gain over maintaining separate model weights.
new model