🔔 Pydiogment

Pydiogment aims to simplify audio augmentation. It generates multiple audio files based on a starting mono audio file. The library can generates files with higher speed, slower, and different tones etc.

📥 Installation

Dependencies

Pydiogment requires:

Python (>= 3.5)
NumPy (>= 1.17.2)
SciPy (>= 1.3.1)
FFmpeg

On Linux

On Linux you can use the following commands to get the libraries:

Numpy: pip install numpy
Scipy: pip install scipy
FFmpeg: sudo apt install ffmpeg

On Windows

On Windows you can use the following installation binaries:

Numpy: https://www.lfd.uci.edu/~gohlke/pythonlibs/#numpy or if you have Python already installed you can use install it using pip3 install numpy
Scipy: https://www.lfd.uci.edu/~gohlke/pythonlibs/#scipy
FFmpeg: https://ffmpeg.org/download.html#build-windows

On MacOS

On MacOs, use homebrew to install the packages:

Numpy: brew install numpy --with-python3
Scipy: You need to first install a compilation tool like Gfortran using homebrew brew install gfortran when it's done, install Scipy pip install scipy for more information and guidelines you can check this link: https://github.com/scipy/scipy/blob/master/INSTALL.rst.txt#mac-os-x
FFmpeg: brew install ffmpeg

Installation

If you already have a working installation of NumPy and SciPy , you can simply install Pydiogment using pip:

pip install pydiogment

To update an existing version of Pydiogment, use:

pip install -U pydiogment

💡 How to use

Amplitude related augmentation

Apply a fade in and fade out effect

from pydiogment.auga import fade_in_and_out

test_file = "path/test.wav"
fade_in_and_out(test_file)

Apply gain to file

from pydiogment.auga import apply_gain

test_file = "path/test.wav"
apply_gain(test_file, -100)
apply_gain(test_file, -50)

Add Random Gaussian Noise based on SNR to file

from pydiogment.auga import add_noise

test_file = "path/test.wav"
add_noise(test_file, 10)

Frequency related augmentation

Change file tone

from pydiogment.augf import change_tone

test_file = "path/test.wav"
change_tone(test_file, 0.9)
change_tone(test_file, 1.1)

Time related augmentation

Slow-down/ speed-up file

from pydiogment.augt import slowdown, speed

test_file = "path/test.wav"
slowdown(test_file, 0.8)
speed(test_file, 1.2)

Apply random cropping to the file

from pydiogment.augt import random_cropping

test_file = "path/test.wav"
random_cropping(test_file, 1)

Change shift data on the time axis in a certain direction

from pydiogment.augt import shift_time

test_file = "path/test.wav"
shift_time(test_file, 1, "right")
shift_time(test_file, 1, "left")

Audio files format

This library currently supports mono WAV files only.

📑 Documentation

A thorough documentation of the library is available under pydiogment.readthedocs.io.

👷 Contributing and bugs report

Contributions are welcome and encouraged. To learn more about how to contribute to Pydiogment please refer to the Contributing guidelines

To report bugs, request a feature or just ask for help you can refer to the issues section. Before reporting a bug please make sure it is not addressed by an older issue and make sure to add your operating system type, its version number and the versions of the dependencies used.

🎉 Acknowledgment and credits

The test file used in the pytests is OSR_us_000_0060_8k.wav from the Open Speech Repository.

Name		Name	Last commit message	Last commit date
Latest commit History 129 Commits
docs		docs
paper		paper
pydiogment		pydiogment
tests		tests
.codecov.yml		.codecov.yml
.coveragerc		.coveragerc
.coveralls.yml		.coveralls.yml
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
.travis.yml		.travis.yml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
appveyor.yml		appveyor.yml
language.json		language.json
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔔 Pydiogment

📥 Installation

Dependencies

On Linux

On Windows

On MacOS

Installation

💡 How to use

Amplitude related augmentation

Apply a fade in and fade out effect

Apply gain to file

Add Random Gaussian Noise based on SNR to file

Frequency related augmentation

Change file tone

Time related augmentation

Slow-down/ speed-up file

Apply random cropping to the file

Change shift data on the time axis in a certain direction

Audio files format

📑 Documentation

👷 Contributing and bugs report

🎉 Acknowledgment and credits

About

Releases 4

Packages

Contributors 3

Languages

License

SuperKogito/pydiogment

Folders and files

Latest commit

History

Repository files navigation

🔔 Pydiogment

📥 Installation

Dependencies

On Linux

On Windows

On MacOS

Installation

💡 How to use

Amplitude related augmentation

Apply a fade in and fade out effect

Apply gain to file

Add Random Gaussian Noise based on SNR to file

Frequency related augmentation

Change file tone

Time related augmentation

Slow-down/ speed-up file

Apply random cropping to the file

Change shift data on the time axis in a certain direction

Audio files format

📑 Documentation

👷 Contributing and bugs report

🎉 Acknowledgment and credits

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 4

Packages 0

Contributors 3

Languages

Packages