I can’t find an okish TTS to use. I tried espeak; I hate the way it sounds, though I could customize the sound.

@ajr@lemmy.ml
link
fedilink
5
edit-2
10 hilabete

I think mozilla TTS is the best. But I think this questions should have been asked in Open Source.

Arthur Besse
link
fedilink
210 hilabete

My understanding is that Mozilla is continuing to build the CommonVoice dataset for training speech models, but they are no longer developing TTS or STT software themselves.

https://github.com/coqui-ai/TTS is the new home of what was Mozilla’s TTS project. Coqui is a new company where some of the former mozilla speech team ended up. Coqui is continuing to develop both the TTS and STT code and models.

There are a number of other much older free software TTS options, but Coqui’s (formerly Mozilla’s) is by far the best one I’ve heard.

Jama
link
fedilink
310 hilabete

Is there a way to install these on android?

Arthur Besse
link
fedilink
210 hilabete

I went looking and found this which implies that the TTS isn’t working on android yet, and this which indicates the STT library does work on android but they have only a very simple and limited demo app so far.

I also found this Voice-Cloning repo which says it has an android app that uses Tacotron2 (one of the models coqui uses, which comes from Google) to do voice cloning… which sounds promising, but I don’t see an apk or build instructions.

Jama
link
fedilink
110 hilabete

Thanks for the answer, sadly it’s exactly what I expected: there is still no way to have a decent working TTS-STT system on android (without Google, of course)

Dessalines
link
fedilink
210 hilabete

Wow coqui sounds really good, i’m gonna have find a command line thing of that.

@monobot@lemmy.ml
link
fedilink
110 hilabete

Here is page with samples, it sounds pretty good: https://erogol.github.io/ddc-samples/

@Better_Rough_2554@lemmy.ml
link
fedilink
0
edit-2
9 hilabete

You can only use small sentences when using coqui-TTS. They should mention that somewhere in the repo, but they don’t. I thought it would be just passing a text file and getting an audio file like tts < input.txt > output.wav. But it only works if the text file is divided in small enough sentences which makes it impractical for most cases.

Arthur Besse
link
fedilink
29 hilabete

I didn’t notice that when i tried it before but now I see what you mean… that is really irritating :(

Also, just now I tried to have it just speak the word “hello” (no punctuation) and got something like “hello oh oh oh oh” with a bit of tonal variation in the strange sounds at the end. So, yeah, I guess they’ve got a ways to go still. Other short phrases I’m trying have good results, but somehow “hello” produces these odd sounds.

Amicese
creator
link
fedilink
110 hilabete

I agree.

A loosely moderated place to ask open ended questions

If your post is

  1. Open ended
  2. Not offensive
  3. Not regarding lemmy support (c/lemmy_support)
  4. not ad nauseam inducing (please make sure its a question that would be new to most members)

it’s welcome here!

  • 0 users online
  • 4 users / day
  • 36 users / week
  • 142 users / month
  • 515 users / 6 months
  • 8 subscribers
  • 1.08K Posts
  • 11.6K Comments
  • Modlog