Running umxhq on a large test track (Georgia Wonder - Siren) blows up memory >64GB #113

sevagh · 2022-02-10T18:48:17Z

Running the umxhq separator with the default wiener separation (niter=1) really blows up my memory usage when I run umx with the CPU. Is it really supposed to do that?

I could swear this used to run fine before, and I never had more than 64GB of RAM. It sounds like a conspiracy but I wonder if some possible ffmpeg version upgrade could be silently causing more memory to be used?

sevagh · 2022-02-10T19:19:29Z

I just saw the other suggestion to do inference in 30s-sized chunks so I'll do it that way.

aliutkus · 2022-02-10T19:19:38Z

hmmm, could you check whether the batch_size parameter inside the expectation_maximization method of filtering.py is being used ?

If not, it means that the system is trying to process the whole track, which may be the source of the problem

sevagh · 2022-02-10T19:27:01Z

Yes, it is being used (the default of 200).

aliutkus · 2022-02-10T19:29:11Z

ok, and when you have 0 iterations, it works fine ?

sevagh · 2022-02-10T19:33:39Z

Yes, umxhq(device="cpu",niter=0) works well. The total memory usage is at 29GB, while with niter=1, it grows to >64 and gets killed. I guess this is a duplicate of #7 which is my bad

I'm just surprised because it's the first time I had an issue running a full evaluation.

aliutkus · 2022-02-10T19:39:40Z

ok. Oh I guess I should fix the memory usage

aliutkus · 2022-02-10T19:45:46Z

what's the length of the track ?

sevagh · 2022-02-10T19:55:52Z

If you would like, I can take a look with memory_profiler and see if I can create any savings to contribute to this project?

Song looks like it's 7:10:

(nsgt-torch) sevagh:nsgt $ mpv /run/media/sevagh/windows-games/MDX-datasets/MUSDB18-HQ/test/Georgia\ Wonder\ -\ Siren/mixture.wav
 (+) Audio --aid=1 (pcm_s16le 2ch 44100Hz)
AO: [pulse] 44100Hz stereo 2ch s16
A: 00:00:00 / 00:07:10 (0%) Cache: 429s/81MB

aliutkus · 2022-02-10T20:32:29Z

well ok we could do that together ! Thanks

(I'm not super available these times, but I'm curious about it). Normally this batchsize parameter should be saving quite a lot of ram, so if you may profile as a start to see what are the tensors that are exploding ?

Are you tracking gradient btw ?

sevagh · 2022-02-10T20:37:20Z

I just tried disabling grad on my audio tensor, didn't save much.

Some heavy lines from my profiling:

278 21639.691 MiB 1933.609 MiB          30           v = torch.mean(torch.abs(y[..., 0, :]) ** 2 + torch.abs(y[..., 1, :]) ** 2, dim=-2)

307 21639.691 MiB    0.000 MiB          54               Cxx = regularization
308 21639.691 MiB    0.000 MiB         270               for j in range(nb_sources):
309 21639.691 MiB 3472.941 MiB         216                   Cxx = Cxx + (v[t, ..., j, None, None, None] * R[j][None, ...].clone())

332 48965.359 MiB 3347.324 MiB         516                   gain = gain * v[t, ..., None, None, None, j]
333
334                                                         # apply it to the mixture
335 48965.359 MiB -2756.098 MiB        1548                   for i in range(nb_channels):
336 48965.359 MiB 8034.758 MiB        1032                       y[t, ..., j] = _mul_add(gain[..., i, :], x[t, ..., i, None, :], y[t, ..., j])

sevagh · 2022-02-10T21:17:02Z

I thought I could be smart and only apply Wiener on max_bin = bandwidth_to_bin(16000). It saves ~5-10 GB of memory but loses a bit of SDR.

sevagh closed this as completed Feb 10, 2022

sevagh reopened this Feb 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running umxhq on a large test track (Georgia Wonder - Siren) blows up memory >64GB #113

Running umxhq on a large test track (Georgia Wonder - Siren) blows up memory >64GB #113

sevagh commented Feb 10, 2022

sevagh commented Feb 10, 2022

aliutkus commented Feb 10, 2022

sevagh commented Feb 10, 2022

aliutkus commented Feb 10, 2022

sevagh commented Feb 10, 2022

aliutkus commented Feb 10, 2022

aliutkus commented Feb 10, 2022

sevagh commented Feb 10, 2022 •

edited

Loading

aliutkus commented Feb 10, 2022

sevagh commented Feb 10, 2022

sevagh commented Feb 10, 2022

Running umxhq on a large test track (Georgia Wonder - Siren) blows up memory >64GB #113

Running umxhq on a large test track (Georgia Wonder - Siren) blows up memory >64GB #113

Comments

sevagh commented Feb 10, 2022

sevagh commented Feb 10, 2022

aliutkus commented Feb 10, 2022

sevagh commented Feb 10, 2022

aliutkus commented Feb 10, 2022

sevagh commented Feb 10, 2022

aliutkus commented Feb 10, 2022

aliutkus commented Feb 10, 2022

sevagh commented Feb 10, 2022 • edited Loading

aliutkus commented Feb 10, 2022

sevagh commented Feb 10, 2022

sevagh commented Feb 10, 2022

sevagh commented Feb 10, 2022 •

edited

Loading