Good quality with 7-8 second generation times, using about 4.5Gigs of VRAM #610
Pandaily591
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I've been tweaking tortoise for a little while and got something i'm happy with.
You can find the code here:
https://github.com/Pandaily591/OnlySpeakTTS
Essentially,
There are some other things, and some issues that can be resolved.
Generating a voice from clips has some randomization involved, so you may get a bad voice.
You should keep generating voices and testing them, until you find a generation that performs well on different sentences.
Save this voice's tensors to files and load them in the future, instead of generating new ones each time
There is an example video if you're curious.
Beta Was this translation helpful? Give feedback.
All reactions