Audio
A library for audio and music analysis, feature extraction.
Audio Plugin for Audio to MIDI transcription using deep learning.
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
🔊 Text-Prompted Generative Audio Model
Singing Voice Conversion via diffusion model
*CREPE+HYBRID TRAINING* A very experimental fork of the Retrieval-based-Voice-Conversion-WebUI repo that incorporates a variety of other f0 methods, along with a hybrid f0 nanmedian method.
リアルタイムボイスチェンジャー Realtime Voice Changer
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
speech self-supervised representations
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A flexible universal ASIO driver that uses the PortAudio sound I/O library. Supports WASAPI (shared and exclusive), KS, DirectSound and MME.
Stream audio over UDP with low latency (can be used for remote speakers)
A multi-purpose MCU+FPGA development platform with high speed data streaming support
Think DSP: Digital Signal Processing in Python, by Allen B. Downey.
A modern C++ MIDI 1 / MIDI 2 real-time & file I/O library. Supports Windows, macOS, Linux and WebMIDI.