[Looking for software] voice recognition/dictation application that is FOSS

submitted a month ago by Count Regal Inkwell@pawb.social

Title.

Don't really want to give my soul to a tech megacorps, but I do a lot of writing for work and hobbies, and sometimes carpal tunnel gets the best of me.

22

Log in to comment

11 Comments

I don't know what platform you're using but for desktop linux I use Nerd Dictation and for android I use FUTO Keyboard, both are completely offline and FOSS.

i believe FUTO keyboard isn't FOSS

You are right-ish

They have a source-available license, which allows all modification and redistribution except modification and redistribution with commercial purposes.

How you feel about that is up to you. Personally I feel like allowing commercial usage of the code is actually a major blindspot with the normal FOSS licenses, as it leaves the doors open for TiVoization.

a month ago , edited a month ago

Ah sorry. Linux Desktop, yes.

EDIT: I have now tested Nerd Dictation for a bit.

I find it funny how the English model doesn't get a single word wrong, even through my 12 inch thick brazilian accent

.... But the portuguese-brazilian model could barely understand anything I was saying.

More an observation than anything. It amused me.

Ah yes, the anglocentrism of software development. I gave up on Google translate years ago because it clearly only catered to English speakers. The other way around was such ham fisted mistranslations with English syntax, it was basically useless.

a month ago

Uhhh is that not pretty much the definition of a "tech megacorp"?

Teeeeeeeeeechnically what they linked is for running the AI model inside your own machine, and so not much is given to OpenAI when you do so other than I guess attention.

.... But yes, they are absolutely a tech megacorp. It then depends on how you define "giving your soul" when it pertains to said tech megacorp.

I've been looking at Vosk, but haven't found a good client for their API yet. Anyone else?

What about Speech Note? I have used it for transcription of Zoom calls and it does a pretty good job!
GitHub link

Gave it a shot too (it even has an option to use the same language model as nerd-dictation cited by another poster)

It works. Once again, works very well in english and just gives up in portuguese.

I guess I'll just have to accept that computers are anglo-only tools. [cackles]