●builderYou can use this open recipe and data pipeline to train or fine-tune small terminal agents without needing frontier-scale compute.
●researcherWorth watching because it establishes a concrete open baseline for RL-trained terminal agents with a reproducible data generation methodology.