I work at a public access tv station and am looking into the services that provide captions, but man are they expensive. I’m wondering if there’s a way i can engineer a box that can do the same. I’m looking at mozilla’s speech to text implementation, which is an interesting thing i never heard of before. Anyone have any experience with such a thing?
besides mycroft,
https://github.com/mozilla/DeepSpeech\ http://kaldi-asr.org/\ https://github.com/cmusphinx/pocketsphinx