[r/LocalLLaMA]score: 0.24
Step 3.7 Flash: 196B MoE, 11B Active, Runs on 128GB RAM
May 28, 2026
StepFun's Step 3.7 Flash is a 196B total / 11B active MoE model with an integrated 1.8B ViT, scoring 56.26% on SWE-Bench Pro and 47.2% on HLE with tools — competitive with Gemini 2.5 Flash and DeepSeek V4 Flash. Available via OpenRouter and NVIDIA NIM for API access or self-hosted on 128GB RAM.
new model
HOW THIS AFFECTS YOU
●
builderYou can access it today via OpenRouter or NVIDIA NIM for agentic and coding workflows at flash-tier cost with benchmark performance matching top competitors.
●
researcherThe 11B active parameter efficiency achieving 56%+ SWE-Bench Pro is worth examining for MoE scaling and sparse activation research.