[r/LocalLLaMA]score: 0.24
Built a fully offline suitcase robot around a Jetson Orin NX SUPER 16GB. Gemma 4 E4B, ~200ms cached TTFT, 30+ sensors, no WiFi/BT/cellular. He has opinions.
May 15, 2026
A fully offline suitcase robot named Sparky runs Gemma 4 E4B at Q4_K_M quantization via llama.cpp on a Jetson Orin NX SUPER 16GB, achieving 200ms cached TTFT and 14-15 tok/s with 12K context. The build eliminates a separate BLIP subprocess by leveraging Gemma 4's native vision and OCR, integrating 30+ sensors as natural language prompt context. Edge robotics practitioners should note the prompt cache stability techniques and SenseVoiceSmall plus Piper TTS pipeline as a reproducible fully-offline multimodal stack.
discussion