zuoxiang95 / Speech-Image_Tool Public

Notifications You must be signed in to change notification settings
Fork 2
Star 3

This repository are used to collect some programs that use in TTS.

Apache-2.0 license

3 stars 2 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Image		Image
Speech		Speech
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements		requirements

Repository files navigation

Speech-Image_Tool

This repositories are used to collect some programs that preprocess audio and image.

Image

The Image dir includes 3 functions to make data augmentation.

Image rotation
Random color
Random Gaussian

Speech

The Speech dir includes 4 python scripts.

character_to_pinyin.py: used to translate Chinese character to pinyin;
trim_silence.py: used to trim the silence in begin and end of audio;
mp3_translate_wav.py: used to translate .mp3 to .wav;
generate_audio.py: used to generate audio from Baidu's api;

About

This repository are used to collect some programs that use in TTS.

Apache-2.0 license

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%