bongovaad (বঙ্গবাদ)

bongovaad is a Python package that provides functionality for transcribing audio from YouTube videos. It utilizes the powerful ASR (Automatic Speech Recognition) model provided by the Whisper library from Hugging Face.

Features

We have already LoRA-tuned the whisper-large-v2 model on the 'bn' subset of Mozilla Common Voice 13, obtaining a Word-Error-Rate(WER) of 57, compared to the WER of 103.4 obtained by the original OpenAI paper(Page 23). More information available here.
Handles audio segmentation for longer videos using AudioSegment.

ToDo

Automatic SRT file creation

Installation

Before using bongovaad, you need to install ffmpeg:

sudo apt install ffmpeg -y

To install bongovaad, you can use pip: pip install bongovaad

Usage

bongovaad provides a command-line interface (CLI) that allows you to transcribe audio from YouTube videos. Here's how to use it:

bongovaad --url <youtube_url>

Replace <youtube_url> with the actual YouTube URL of the video you want to transcribe. The output will be written to text files containing the transcriptions of the audio segments.

Example

bongovaad --url https://www.youtube.com/watch?v=ABC12345

This command transcribes the audio from the YouTube video with the specified URL (https://www.youtube.com/watch?v=ABC12345).

License

This project is licensed under the MIT License. See the LICENSE file for more information.

Contributing

Contributions are welcome! Please refer to the contributing guidelines for more information. If you encounter any issues or have suggestions for improvements, please create a new issue on the GitHub repository.

Acknowledgements

PEFT

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/workflows		.github/workflows
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bongovaad.py		bongovaad.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bongovaad (বঙ্গবাদ)

Features

ToDo

Installation

Usage

Example

License

Contributing

Acknowledgements

About

Releases

Packages

Languages

License

ucalyptus/bongovaad

Folders and files

Latest commit

History

Repository files navigation

bongovaad (বঙ্গবাদ)

Features

ToDo

Installation

Usage

Example

License

Contributing

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages