Voice Synthesis Telegram Bot

This bot provides the ability to synthesize voice samples using tts-tortoise API from local host. It has multilanguage support and voice-to-voice conversion via whisper model.

There should be a running instance of the bot, if you would like to check it out at https://t.me/tts_tbot

Here's the preview:

Notable features

all of the features of tts-tortoise, including voice cloning, emotion steering, multisampling
user-settings database
custom voice adding
voice-to-voice conversion via mic recording

Prerequisites

NVIDIA GPU (between 4 and 10Gb of VRAM required for inference, depending on tortoise settings)
python 3.10.6
pip or anaconda env with python 3.10.6
ffmpeg

Tested on Linux only, written with cross-platform in mind, should work on Windows. Although tortoise-tts environment may fail to resolve.

install

Clone the repo with submodules

git clone --recurse-submodules $(URL)

For example, let's create new conda environment and install bot there

conda create -n voice_tbot python=3.10.6
conda activate voice_tbot

cd to repo directory and execute to install dependencies:

python -m pip install -r ./requirements.txt

Usage

Create configuration file named "config" inside bot_data directory and set user parameters. Use this example as reference.

Run module package voice_bot from the required environment, for example - from repo directory:

python -m voice_bot

Notes on text promts (how to get desired results)

Use punctuation (ellipses, exclamation points, CAPS, semicolons, commas) to add emphasis and shape the speech. You can also try to add different emotions to sentences by prepending parts of texts with "[describe emotion]" notations (For more info please visit the original model source page).

TODO

add synthensis from audio files

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
bot_data		bot_data
faster-whisper @ f144e4c		faster-whisper @ f144e4c
rsc		rsc
tortoise-tts-fast @ 46b30a1		tortoise-tts-fast @ 46b30a1
voice_bot		voice_bot
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice Synthesis Telegram Bot

Notable features

Prerequisites

install

Usage

Notes on text promts (how to get desired results)

TODO

License

About

Releases

Packages

Languages

License

Helther/voice-pick-tbot

Folders and files

Latest commit

History

Repository files navigation

Voice Synthesis Telegram Bot

Notable features

Prerequisites

install

Usage

Notes on text promts (how to get desired results)

TODO

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages