-
-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Investigate using error to signal with pre-emphasis filter and custom dataloader #11
Comments
@GuitarML I had a look at the paper you sent me and realized that there might be a problem with the way the The way the model is trained in this repo, it looks like we're taking a window of size n for input, but providing an output for timestep n+1. I think if we switch to that formulation (I'll start trying tonight and let you know how it goes), then we should hopefully see some better results. |
Still working on this! Found a pretty big contributor to the problem. As was mentioned in the issue you linked to, there is a problem with how the pre-emphasis filter is being calculated. Right now, you are concatenating the vector and the coefficient * the filter. Ex:
What we really want to do is subtract the coefficient * the previous value from the array. For example:
Practically, here's how we'd implement this in tensorflow:
I'm just starting to test this out, and it's looking good so far! Trying to listen better for audio quality now. I'm going to try and have a PR open for this either tonight or tomorrow. Happy Valentine's day! |
Cant wait to try it out, good work! |
A new method of loading the data by batch was developed and applied to the Colab notebook. This method caused the original error to signal loss calculation to blow up, so the loss function was changed to MSE. This fixed the out of memory issues by loading the data by batch_size, instead of all at once or in large chunks. However, the original error to signal loss produces more accurate models, especially on highly distorted/complex guitar signals.
Investigate using the original error to signal loss function with the custom Sequence dataloader class from the Colab notebook.
Update: It looks like the pre-emphasis filter is calculated incorrectly, as noted here:
GuitarML/PedalNetRT#16 (comment)
Apply the fix of using [t-1] and see if this fixed the instability issues.
The text was updated successfully, but these errors were encountered: