[X]score: 0.38

parakeet.cpp Runs NVIDIA Parakeet ASR 2x Faster Than ONNX on Apple Silicon

May 31, 2026

parakeet.cpp is a GGML-based inference pipeline for NVIDIA's Parakeet speech recognition models, delivering 2x throughput over ONNX-based pipelines on Apple Silicon. It runs locally and leverages GGML's Metal backend for on-device ASR.

HOW THIS AFFECTS YOU

●

builderYou can swap ONNX-based Parakeet inference for this GGML pipeline to cut ASR latency in half on Apple Silicon, useful for local or edge speech applications.

SOURCE

https://x.com/badlogicgames/status/2061201400059531729#m

← back to feed