GPU memory continue to grow for MAML #62

suoql · 2021-02-04T16:40:06Z

Hi,

Thank you for sharing the code. When running MAML with conv4 backbone, the memory usage accumulates as epoch increases, causing CUDA out of memory. It seems the problem is caused by grad = torch.autograd.grad(set_loss, fast_parameters, create_graph=True). I tried to set create_graph=False (to approximate first-order MAML), and the memory usage becomes normal. This indicates that the created graph cannot be released after each epoch, if setting create_graph=True.

Did you meet with such problem in training MAML? May I get some suggestions on solving this problem?

Thanks!

suoql · 2021-02-04T22:53:35Z

The problem has been resolved by degrade torch version. Please forget about it. :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU memory continue to grow for MAML #62

GPU memory continue to grow for MAML #62

suoql commented Feb 4, 2021

suoql commented Feb 4, 2021

GPU memory continue to grow for MAML #62

GPU memory continue to grow for MAML #62

Comments

suoql commented Feb 4, 2021

suoql commented Feb 4, 2021