Skip to content

API that uses Tortoise and RVC to speed up text-to-speech generation.

License

Notifications You must be signed in to change notification settings

mathieutrudeau/Fast-TTS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Fast-TTS

Fast-TTS is a module that offers a simple api in order to generate a speech from text.

Requirements

  • Python 3.9
  • CUDA enabled GPU with PyTorch installed.

How to add Voices

The following steps show how to add voices.

  • Create a folder with the speaker name '/MyVoice'. Here 'MyVoice' is the name of the speaker that you wish to add.
  • Find 1-3 audio samples of the speaker, those samples should be ~1min in length and be .wav files. The clearer the voice, the better the generated audio will be.
  • Add those audio sampples to the speaker folder that was previously created.
  • Add the .index and .pth files for the speaker model. Name them 'MyVoice.index' and 'MyVoice.pth'.
  • You can either get the .index and .pth files from trained RVC models, or you can train your own model.