[r/MachineLearning]score: 0.13
[New Optimizer] 🌹 Rose: low VRAM, easy to use, great results, Apache 2.0 [P]
April 24, 2026
**Rose: A Stateless PyTorch Optimizer for Low-VRAM Training**
Rose is a newly released, Apache 2.0-licensed PyTorch optimizer that maintains no persistent gradient statistics (stateless), giving it a memory footprint comparable to SGD without momentum — lower than even 8-bit AdamW. The practical implication is meaningful for fine-tuning large models on memory-constrained hardware, where optimizer state typically consumes a significant portion of VRAM. However, the author notes that training loss can run higher than Adam while validation metrics remain competitive, which warrants careful evaluation before adopting it in production pipelines — independent benchmark reproduction is needed given the single-developer origin and limited external validation so far.
project