[X]score: 0.47

RWKV-7 G1g is here: the world's best pure RNN LLM, and a competitive LLM in general. Try for bsz16 7B inference. G1h in June p.s. const 15000+tp…

May 23, 2026

RWKV-7 G1g (7B) is released, a pure RNN architecture claiming state-of-the-art among RNN-based LLMs and competitive with transformer models generally. Achieves 15,000+ tokens/second decoding on a single RTX 5090 at batch size 16, via the Albatross inference engine on GitHub. Relevant for edge/long-context deployments where constant memory scaling versus sequence length matters over transformer alternatives.

SOURCE

https://x.com/BlinkDL_AI/status/2058177941088149901#m

← back to feed