Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Need for Stage 1 Model in Stage 3 training #17

Open
sahilvora10 opened this issue Nov 29, 2023 · 1 comment
Open

Need for Stage 1 Model in Stage 3 training #17

sahilvora10 opened this issue Nov 29, 2023 · 1 comment

Comments

@sahilvora10
Copy link

Hi,

Thank you for this repository. It is great work. I have a few questions and would like to clarify some doubts.

I was successfully able to run all three stages, but I have a question: Do we need to pass the Stage 1 model to Stage 3 during training? I noticed that when I train the Stage 3 diffusion stage by loading the network of Stage 1 results through the config file, the results are better.

I pass the Stage 1 trained model under this:
"noise_model": { "resume_state":

However, if the Stage 1 Noise model is not passed through Stage 3 training, then the results drop and the network is not able to learn even after all the iterations.

Can you explain why that might be the case and why the weights and state dictionary of Stage 1 are needed for Stage 3?

Thank you.

@tiangexiang
Copy link
Collaborator

Hi, thanks for your interest in our work, and sorry for the late reply.
Yes, loading Stage1 trained model for Stage3 is a necessary step. For each iteration at stage3, we require a 'clean' image to be generated by the noise model (Stage1) (see paper fig.2). Without the pretrained Stage1 model, such a 'clean' image is not possible and Stage3 will result in degenerated solutions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants