WIP: BSSEval v4 #272

faroit · 2018-02-02T00:25:37Z

Okay, here is some news regarding the source separation evaluation model:
@aliutkus and me did make a spontaneous rewrite of the bsseval method to fix some shortcomings in the principle of bsseval (see #270 and #271).

The BSSEval metrics, as implemented in the MATLAB toolboxes and its re-implementation here in mir_eval are widely used in the audio separation literature. One particularity of BSSEval is to compute the metrics after optimally matching the estimates to the true sources through linear distortion filters. This arguably allows the criteria to be robust to some linear mismatches. Apart from the optional computation of all possible permutations of the sources, this matching is the reason for most of the computation cost of BSSEval, especially considering it is done for each evaluation window when the metrics are computed on a framewise basis.

In this years SiSEC, we decided to drop the assumption that distortion filters could be varying over time, but considered instead they are fixed for the whole length of the track. First, this significantly reduces the computational cost for evaluation because matching needs to be done only once for the whole signal. Second, this introduces much more dynamics in the evaluation, because time-varying matching filters turn out to over-estimate performance. Third, this makes matching more robust, because true sources are not silent throughout the whole recording, while they often were for short windows.

Technically, we are offering a new evaluation function bss_eval() which covers all previous functionality (images, sources/ framewise/ global) and additionally allows to use the new v4 mode. We maintained compatibility to the old functions by wrapping bss_eval. The benefit is mainly that v4 is significantly faster. Depending on the frame size it can be up to 50 %.

Long story short.... we touched most of the code but are 100% backwards compatible (regression tests are passing) and now we consider the options how to get this code out as soon as possible. The reason is that we want to use this in our evaluation kit that we give out to the SiSEC participants as soon as possible (as early as next week).

Currently, we don't know if it would be a good idea to move the new version bsseval version directly into mir_eval. Here are my concerns:

we cannot predict how often it will be used outside of sisec (even though this was often the case before).
the code certainly needs to pass a bunch of code reviews very soon, @craffel, @bmcfee?

So maybe the best option is to have bsseval v4 in a separate python package first and then merge it into mir_eval later this year. @craffel are you okay with releasing a stripped down version of mir_eval (all modules removed, except for separation) under a different package name (probably bsseval) on pypi?

craffel · 2018-02-02T00:30:35Z

Thanks for all your hard work. Provided the regression tests still pass, I am open to giving this some sort of fast-track, but I personally won't be able to do any kind of review until after Feb 9. It looks like this is still "WIP" though, I see commented-out code, early return statements, print statements, etc. in the tests. In principle I would be most happy to have there be a single reference implementation of bss_eval in Python, instead of having two separate versions (one in mir_eval, one outside).

@craffel are you okay with releasing a stripped down version of mir_eval (all modules removed, except for separation) on pypi?

I'm not sure what you mean by this or why it's helpful.

faroit · 2018-02-02T00:45:03Z

In principle I would be most happy to have there be a single reference implementation of bss_eval in Python, instead of having two separate versions (one in mir_eval, one outside).

I agree but for most other implementations in mir_eval there were existing reference implementations outside of mir_eval. So this would be something shiny and new here ;-)

@craffel are you okay with releasing a stripped down version of mir_eval (all modules removed, except for separation) on pypi?

I'm not sure what you mean by this or why it's helpful.

We would like to use bsseval v4 in the SiSEC evaluation as part of the python musdb parser. And we are running a bit out of time, so we have to basically use the WIP version for the evaluation campaign. I would therefore either copy the separation code into the parser or make a separate bsseval package. Having participants install our fork directly from github seems not a great option. Do you think there are better options?

craffel · 2018-02-02T00:48:54Z

I would therefore either copy the separation code into the parser or make a separate bsseval package. Having participants install our fork directly from github seems not a great option. Do you think there are better options?

I support you doing whatever is most convenient, but I do think it would be awkward if there ended up with two PIP packages -- bss_eval_v4 and mir_eval which contains bss_eval_v4. pip can install directly from a github repo/fork, maybe that's a decent option?
http://codeinthehole.com/tips/using-pip-and-requirementstxt-to-install-from-the-head-of-a-github-branch/

carlthome · 2018-02-02T15:14:26Z

So maybe the best option is to have bsseval v4 in a separate python package first and then merge it into mir_eval later this year. @craffel are you okay with releasing a stripped down version of mir_eval (all modules removed, except for separation) under a different package name (probably bsseval) on pypi?

I personally don't mind installing mir_eval for running separation metrics as the full package is lightweight in terms of size and dependencies, and it's nice to not fragment the community too much.

faroit · 2018-02-06T13:41:27Z

@craffel okay, @aliutkus and me decided for now to copy the bsseval v4 into our musdb18 evaluation package to reduce confusion to users (installing a mir_eval fork from github is still a bit fragile for users not having virtual environments). Before start with the actual code review, it would be great if we can agree on the API changes and function namings so that we can easily switch so the new version of mir_eval in our evaluation tool, as soon as this PR is merged and a new release is issued:

do you like the idea of the new bss_eval() core function and the wrapper functions [bss_eval_sources..., bss_eval_images...` ] that keep mir_eval backwards compatible?
do the new flags framewise_filters and bsseval_sources_version make sense to you?

craffel · 2018-02-12T03:51:31Z

do you like the idea of the new bss_eval() core function and the wrapper functions [bss_eval_sources..., bss_eval_images... ] that keep mir_eval backwards compatible?

Seems reasonable to me. As I said as long as things are backwards compatible and that there is community consensus that this way is "correct" I am ok.

do the new flags framewise_filters and bsseval_sources_version make sense to you?

Yes. In some cases in mir_eval when previous evaluation criteria is generally recognized as wrong, we go for the "right" way and carefully document why. I am open to considering this as one of those cases.

I would be interested to hear what the reaction of the larger SISEC community is to this.

faroit · 2018-02-12T13:10:16Z

I would be interested to hear what the reaction of the larger SISEC community is to this.

okay I then would suggest to wait for the LVA/ICA conference in summer. We will then have a good feedback from the community. Also we are in contact with Emmanuel Vincent so that there is a chance that bss_eval v4 will 'officially' replace the older version and be added to the bsseval website.

Until then v4 will live in https://github.com/sigsep/sigsep-mus-eval/blob/master/museval/metrics.py

I would suggest to close the PR for now and I will reopen it when we are ready.

craffel · 2018-02-12T16:21:26Z

SG!

aliutkus and others added 30 commits January 12, 2018 15:51

yummy yummy'

6b2219c

finished updating the separation module

9a3fff3

fixing the test for separation

c34ce5f

update

954c0ec

pep8

a6d9ea8

update header

ccc34a0

doc

02877fe

added framewise filters, still not passing

0e13e7f

evaluation passing 1e-3 hopefully

3e6fea9

put back unit test

04e8aee

pep8 again

f5f3867

pep8 for tests

e51a30d

fix tests and increase tolerance back to 10e-12

b21c933

tolerance change

a188ad0

remove dereference operator for compatibility

0b63b02

change framing class to support nwin property

c4b9515

style guided

645f1ec

len -> length

fc9d413

python 2

2a4e17c

2to3

63a0e19

tolerance e-11

81548e4

tolerance e-10

b072b12

tolerance e-9

e0d87a7

epsilon diagonal to G

75f4c29

fix shape

404a7de

try older numpy

8f6bf60

try only running separation tests

3a232e5

another fix

ea87305

remove square

ec8cb0b

back to full tests

e962499

Fabian-Robert Stöter added 4 commits January 29, 2018 15:38

remove Fortran order

c785628

detect silent frames

adfbac6

pep8

7df87f1

change some docstrings

3a8184a

remove return

28dbd02

remove comments

c3e3117

remove print and early returns from tests

9e8b6df

Fabian-Robert Stöter added 2 commits February 7, 2018 23:28

fix some docstrings

1196439

revert inf > nan

4e73714

faroit closed this Feb 12, 2018

faroit mentioned this pull request Jul 30, 2018

Move BSS eval to own project? sigsep/sigsep-mus-eval#22

Open

faroit mentioned this pull request May 23, 2019

Implement more separation metrics #68

Open

bmcfee mentioned this pull request May 15, 2024

fftpack → fft #381

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: BSSEval v4 #272

WIP: BSSEval v4 #272

faroit commented Feb 2, 2018 •

edited

Loading

craffel commented Feb 2, 2018

faroit commented Feb 2, 2018

craffel commented Feb 2, 2018

carlthome commented Feb 2, 2018

faroit commented Feb 6, 2018

craffel commented Feb 12, 2018

faroit commented Feb 12, 2018

craffel commented Feb 12, 2018

WIP: BSSEval v4 #272

WIP: BSSEval v4 #272

Conversation

faroit commented Feb 2, 2018 • edited Loading

craffel commented Feb 2, 2018

faroit commented Feb 2, 2018

craffel commented Feb 2, 2018

carlthome commented Feb 2, 2018

faroit commented Feb 6, 2018

craffel commented Feb 12, 2018

faroit commented Feb 12, 2018

craffel commented Feb 12, 2018

faroit commented Feb 2, 2018 •

edited

Loading