CSC 475 Final Project

Shae Brown & Jarred Hawkins

See our written report for more technical info.

A live demo based off of the deploy branch is available here.

Installation

Install all the dependencies in the project using your package manager of choice. For pip users this will be pip install -r requirements.txt

Then run the app using:

flask run

Using the program

Once the app is running, open the flask webpage (defaults to 127.0.0.1:5000).

This will prompt you to upload an audio file. Currently .wav and .mp3 files are supported.

Once you upload a file it will redirect to the annotation editing page. This page allows you to:

create new drum events by double clicking the canvas
edit current event times by dragging and dropping the circle
changing/deleting classes by right clicking the events

Retraining the model

Right now there is no way to do this using the front end. You may retrain the model by running the train.py file. This file retrains a model using the data in static/test_data.

The format of the training data is as follows:

File structure:

static/test_data
├── song1_name
│   ├── Bass drum.txt
│   ├── song1.wav
│   ├── Chinese ride cymbal.txt
│   ├── Hi-hat closed.txt
│   ├── Hi-hat open.txt
│   ├── Low tom.txt
│   └── Snare drum.txt
└── song2_name
    ├── Bass drum.txt
    ├── song2.mp3
    ├── Hi-hat closed.txt
    ├── Hi-hat open.txt
    └── Snare drum.txt

Annotation file structure example:

Note: Class names in this file are optional. You may use a list of times separated by newlines as well.

0.773469387   bd
2.537142857   bd
4.323129251   bd
6.106122448   bd
7.866303854   bd
9.659297052   bd
11.426757369   bd
13.219863945   bd

Once train.py is run, you can change the model file in the header of flask_app.py

Exporting data

Using the export feature on the main window, you can output the file in the desired formats. Using python style string formatting you may also change the format of these output lines and file names.

Once the annotated data is exported, you may also move it into the the training folder and retrain the model using your annotated data.

Included data sets

Our initial model was trained with several data sets. These data sets are as follows:

Due to licensing issues, we can not redistribute the DREANSS data set along with this repository. We hope that these links will provide a starting point to anyone wishing to build their own models.

The MDB DRUMS data set is included in the static/test_data/ folder. An accompanying license file is also present in licenses/

Name		Name	Last commit message	Last commit date
Latest commit History 108 Commits
licenses		licenses
static		static
templates		templates
trained_models		trained_models
.gitignore		.gitignore
Generating-annotated-training-data-for-drum-detection-in-polyphonic-audio.pdf		Generating-annotated-training-data-for-drum-detection-in-polyphonic-audio.pdf
Procfile		Procfile
README.md		README.md
debugger.py		debugger.py
drum_annotation.py		drum_annotation.py
feature_extraction.py		feature_extraction.py
flask_app.py		flask_app.py
onset_detection.py		onset_detection.py
report_final.pdf		report_final.pdf
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CSC 475 Final Project

Installation

Using the program

Retraining the model

Exporting data

Included data sets

About

Releases

Packages

Contributors 2

Languages

ShaeBrown/csc475

Folders and files

Latest commit

History

Repository files navigation

CSC 475 Final Project

Installation

Using the program

Retraining the model

Exporting data

Included data sets

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages