Compact Speech Recognition

Posted on: Sat 17 December 2022

High-performance inference of OpenAI’s Whisper automatic speech recognition (ASR) model:

…

Having such a lightweight implementation of the model allows to easily integrate it in different platforms and applications. As an example, here is a video of running the model on an iPhone 13 device - fully offline, on-device:

Really compact C++ version of a production speech-to-text model. If I can get it to build, I’ll try it against some podcasts to see how things come out. If halfway decent it could become a piece of a comprehensive personal knowledge extraction memex.