whisper speech to text support #394
Replies: 4 comments
-
I will have a look. |
Beta Was this translation helpful? Give feedback.
-
ibus-typing-booster already supports speech-to-text via googles service for quite a long time. But that service is neither open source nor free, only one hour per month is free and the setup is very difficult. I used it myself for a while trying to keep below the free limit but as it was hard to tell how much free time is left I stopped using it and have not even tested it for a while, so I am not even sure whether it still works. I want to add other services for speech to text, so that the user can choose which one to use and make a good free one the default. Because of this discussion: #324 (comment) I discovered VOSK: https://github.com/alphacep/vosk-api I tested it a bit on the command line. It seemed to work quite well. I tried only English though, I did not try how well the Turkish model works. For English I tried: https://alphacephei.com/vosk/models/vosk-model-small-en-us-0.15.zip (Lightweight wideband model for Android and RPi 40M) and https://alphacephei.com/vosk/models/vosk-model-en-us-0.22.zip (ccurate generic US English model, 1.8G) The accurate model is huge, 1.8 GB, but it worked very well. I was not so impressed by the 40M lightweight model. For Turkish there seems to be only: https://alphacephei.com/vosk/models/vosk-model-small-tr-0.3.zip (Lightweight wideband model for Android and RPi, 35M) I wonder how good that is. I definitely want to add more speech to text engines and make the best free one the default. Which one is better? VOSK or whisper.ccp? In the long run I want to give the user a choice between different engines, but which one should I try to implement first? |
Beta Was this translation helpful? Give feedback.
-
@mike-fabian Thank you very much for your support. I tried a few different The other
If you install whispercpp is a I prefered |
Beta Was this translation helpful? Give feedback.
-
Another thing I noticed now, whispercpp added mic support. I just tested it after building it with
It perfectly transcribes my broken accent. |
Beta Was this translation helpful? Give feedback.
-
I am running Ubuntu 22.04 Mate. I set up whispercpp and I use it with:
It performs very well in my language. I want to integrate it into
ibus-typing-booster
.Michael Sheldon posted a
deepspeech
module for youribus-typing-booster
here but I don't know how to integrate it toibus-typing-booster
in my case.Beta Was this translation helpful? Give feedback.
All reactions