Moondream Engineering Optimizes GPU Throughput to Reduce Inference Latency | HACKOBAR_