[r/LocalLLaMA]score: 0.24

Built a fully offline suitcase robot around a Jetson Orin NX SUPER 16GB. Gemma 4 E4B, ~200ms cached TTFT, 30+ sensors, no WiFi/BT/cellular. He has opinions.

May 15, 2026

A fully offline suitcase robot named Sparky runs Gemma 4 E4B at Q4_K_M quantization via llama.cpp on a Jetson Orin NX SUPER 16GB, achieving 200ms cached TTFT and 14-15 tok/s with 12K context. The build eliminates a separate BLIP subprocess by leveraging Gemma 4's native vision and OCR, integrating 30+ sensors as natural language prompt context. Edge robotics practitioners should note the prompt cache stability techniques and SenseVoiceSmall plus Piper TTS pipeline as a reproducible fully-offline multimodal stack.

discussion

SOURCE

https://v.redd.it/9v5pmv1rgb1h1

← back to feed