Link parkin’: parakeet-mlx
Parakeet MLX
An implementation of the Parakeet models - Nvidia’s ASR (Automatic Speech Recognition) models - for Apple Silicon using MLX.
Regarding the models, here’s info from a June NVIDIA blog post:
NVIDIA Parakeet TDT 0.6B v2 is a 600-million-parameter automatic speech recognition (ASR) model designed for high-quality English transcription. It is currently ranked #1 on the Hugging Face ASR leaderboard, alongside four other top-ranking NVIDIA Parakeet models. NVIDIA NeMo Canary models have also made their mark on the Hugging Face ASR leaderboard.
This post explores how these and other cutting-edge NVIDIA speech AI models are setting new benchmarks for accuracy, speed, and versatility in automatic speech recognition (ASR). We will review model highlights, leaderboard performance, and practical deployment options so you can leverage these state-of-the-art models for real-world applications.
Another possible transcription backend for retrocast.