Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problem with generated audio from pre-trained checkpoints #11

Open
swamiviv opened this issue Jan 24, 2021 · 3 comments
Open

Problem with generated audio from pre-trained checkpoints #11

swamiviv opened this issue Jan 24, 2021 · 3 comments

Comments

@swamiviv
Copy link

I used the pretrained checkpoints (64md_8k) for the sc09 dataset and generated samples as recommended. I used the following to read it and listen:

fname = 'commands_listen.mat'
mat = scipy.io.loadmat(fname)
import IPython.display as ipd
sr = 22050 # sample rate
ipd.Audio(mat['reconstructed'][0, :], rate=sr) # play a NumPy array
  1. I find that most samples are illegible, but I can find some sounds here and there. Is that normal?
  2. Out of curiosity, are the examples you present in the website cherry-picked?
  3. Am I doing something wrong in generating the samples?
@andimarafioti
Copy link
Member

  1. I don't think that is normal
  2. Not cherry picked, it's an array of sounds, you might notice some sound better than others.
  3. Most likely you are doing something wrong, can you give more details?

@andimarafioti
Copy link
Member

Are you using the speech checkpoint? there are a few checkpoints for piano too

@andimarafioti
Copy link
Member

Could it be that you are indexing the matrix wrong? So mat['reconstructed'][:, 0] instead of mat['reconstructed'][0, :]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants