[Looking for software] voice recognition/dictation application that is FOSS
submitted a month ago by Count Regal Inkwell@pawb.social
Title.
Don't really want to give my soul to a tech megacorps, but I do a lot of writing for work and hobbies, and sometimes carpal tunnel gets the best of me.
I don't know what platform you're using but for desktop linux I use Nerd Dictation and for android I use FUTO Keyboard, both are completely offline and FOSS.
i believe FUTO keyboard isn't FOSS
You are right-ish
They have a source-available license, which allows all modification and redistribution except modification and redistribution with commercial purposes.
How you feel about that is up to you. Personally I feel like allowing commercial usage of the code is actually a major blindspot with the normal FOSS licenses, as it leaves the doors open for TiVoization.
Ah sorry. Linux Desktop, yes.
EDIT: I have now tested Nerd Dictation for a bit.
I find it funny how the English model doesn't get a single word wrong, even through my 12 inch thick brazilian accent
.... But the portuguese-brazilian model could barely understand anything I was saying.
More an observation than anything. It amused me.
Ah yes, the anglocentrism of software development. I gave up on Google translate years ago because it clearly only catered to English speakers. The other way around was such ham fisted mistranslations with English syntax, it was basically useless.
https://github.com/openai/whisper
Uhhh is that not pretty much the definition of a "tech megacorp"?
Teeeeeeeeeechnically what they linked is for running the AI model inside your own machine, and so not much is given to OpenAI when you do so other than I guess attention.
.... But yes, they are absolutely a tech megacorp. It then depends on how you define "giving your soul" when it pertains to said tech megacorp.
I've been looking at Vosk, but haven't found a good client for their API yet. Anyone else?
What about Speech Note? I have used it for transcription of Zoom calls and it does a pretty good job!
GitHub link
Gave it a shot too (it even has an option to use the same language model as nerd-dictation cited by another poster)
It works. Once again, works very well in english and just gives up in portuguese.
I guess I'll just have to accept that computers are anglo-only tools. [cackles]