How to evaluate SDR for a stem when it is not present in the ground truth track #84

jacoblam3112 · 2024-10-17T07:12:35Z

Hi, Thank you for building this awesome repo.

I have trained a custom 5-stem model (bass, drum, guitar, vocal, and other) with the 5th stem being 'other'. For some of the evaluation tracks, there is no 'other' stem because the track only contains 4 stems to begin with. For those tracks, my model will predict an other stem, although with a very small amplitude. When I try to use an all '0' as a reference signal and calculate SDR, it gives a very high SDR value (-80dB for example) because of reference signal energy being 0. This ruins the average SDR score of the model. How do we generally handle those cases when we quantitatively evaluate the model performance ?

iver56 · 2024-10-17T08:00:03Z

I asked myself that question last year too, and couldn't find a good answer. I asked some respected researchers in the source separation community about it. They also didn't have a satisfying answer, so I decided to develop my own metric that does not have this problem. I called it logWMSE, and you can find more information about it here:

https://github.com/nomonosound/log-wmse-audio-quality/

Some people have also reported success using this metric as a training objective (i.e. as a loss function). You can find code for that here:
https://github.com/crlandsc/torch-log-wmse

ZFTurbo · 2024-10-17T08:10:07Z

Yes I've also heard about LogWMSE for such cases. It's already implemented in repo:

Music-Source-Separation-Training/train.py

Line 124 in 7e2cc6e

    
           parser.add_argument("--metrics", nargs='+', type=str, default=["sdr"], choices=['sdr', 'l1_freq', 'si_sdr', 'log_wmse', 'aura_stft', 'aura_mrstft'], help='List of metrics to use.')

You can use it like that:

--metrics log_wmse sdr
--metric_for_scheduler log_wmse

jacoblam3112 · 2024-10-17T13:46:55Z

Thank you for great answers.

jarredou · 2024-10-17T19:26:46Z

The newly added 'L1_freq' metric is also behaving great in case of silent content (probably other STFT based metrics too, but I've not tested)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to evaluate SDR for a stem when it is not present in the ground truth track #84

How to evaluate SDR for a stem when it is not present in the ground truth track #84

jacoblam3112 commented Oct 17, 2024

iver56 commented Oct 17, 2024

ZFTurbo commented Oct 17, 2024

jacoblam3112 commented Oct 17, 2024

jarredou commented Oct 17, 2024

How to evaluate SDR for a stem when it is not present in the ground truth track #84

How to evaluate SDR for a stem when it is not present in the ground truth track #84

Comments

jacoblam3112 commented Oct 17, 2024

iver56 commented Oct 17, 2024

ZFTurbo commented Oct 17, 2024

jacoblam3112 commented Oct 17, 2024

jarredou commented Oct 17, 2024