[Question] Can we reconstruct the original audio from the latent embeddings? #3

sleepingcat4 · 2024-09-10T17:08:46Z

I wanted to use your project to create a dataset but if original audio can be reconstructed then I can't : ( That's why, I need to know before getting started. [I want to make latent for 20-30TB data]

marcoppasini · 2024-09-10T17:32:01Z

Yes, latents can be decoded back to waveforms (this is the main goal of Music2Latent, so you may explore other models in your case).

sleepingcat4 · 2024-09-10T17:53:42Z

@marcoppasini are you guys planning to make the original training code open source? If yes, then I may use your model for training and completely hide the source of my audio

marcoppasini · 2025-02-11T18:37:12Z

Hey @sleepingcat4 I have just released the training code under the 'training' branch, feel free to try it out!

sleepingcat4 · 2025-02-11T18:39:22Z

@marcoppasini btw did you consider classification with your model? I tried but didn't receive good outcome over Wav2vec

marcoppasini · 2025-02-11T18:44:57Z

@sleepingcat4 I tried some music-related downstream tasks in the paper (https://arxiv.org/abs/2408.06500), but I did not explore much more unfortunately.

sleepingcat4 · 2025-02-11T19:10:08Z

@marcoppasini do you have any experience with feature extractor and how it can be used for downstream task like classifications or analysis in general?

Because if you have some examples to show, I have enough data to run some good Futher experiments

sleepingcat4 · 2025-02-11T20:17:59Z

@marcoppasini btw I wanted to ask, if you are available for a collaboration. Cuz I had a been planning to release a dataset in the next few days that suppose to be an improvement and beating LAION-DISCO 12M at many regards and both mine and LAION-DISCO 12M are based on Music.

the lab, I am working now LAION AI, was looking for someone who can help us and lend a hand in training and creating foundational music generative model and a few other generative models both in music and human speech. Including several dataset releases.

I love this Music2latent project as it removes fundamental burden of modelling music through naive terms like (tempo, time difference, components and etc) so having you with us will definitely be interesting.

We have the compute and resources but finding someone niche in music in a bit hard so definitely going to appreciate any help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Can we reconstruct the original audio from the latent embeddings? #3

[Question] Can we reconstruct the original audio from the latent embeddings? #3

sleepingcat4 commented Sep 10, 2024

marcoppasini commented Sep 10, 2024

sleepingcat4 commented Sep 10, 2024 •

edited

Loading

marcoppasini commented Feb 11, 2025

sleepingcat4 commented Feb 11, 2025

marcoppasini commented Feb 11, 2025

sleepingcat4 commented Feb 11, 2025

sleepingcat4 commented Feb 11, 2025

[Question] Can we reconstruct the original audio from the latent embeddings? #3

[Question] Can we reconstruct the original audio from the latent embeddings? #3

Comments

sleepingcat4 commented Sep 10, 2024

marcoppasini commented Sep 10, 2024

sleepingcat4 commented Sep 10, 2024 • edited Loading

marcoppasini commented Feb 11, 2025

sleepingcat4 commented Feb 11, 2025

marcoppasini commented Feb 11, 2025

sleepingcat4 commented Feb 11, 2025

sleepingcat4 commented Feb 11, 2025

sleepingcat4 commented Sep 10, 2024 •

edited

Loading