Detaching l_t #29

Pozimek · 2020-01-21T18:07:37Z

At the moment the location tensor l_t is never detached from the computational graph in spite of both being produced by and 'consumed' by trainable modules. As far as I understand the code this enables the gradients to 'backpropagate through time' in a way that the authors of RAM did not intend: the gradients that originated in the action_network and reached the fc2 layer inside the glimpse network would travel back to the previous timestep's location_network and alter its weights and only stop once they reach the detached RNN memory vector h_t. As far as I understand the authors intended the location_network to only be trained using reinforcement learning.

This could be a bug or it could be an accidental improvement to the network; either way please let me know if my understanding is correct in here as I am still learning Pytorch and my project is heavily reliant on your code :)

yxiao54 · 2020-03-25T00:02:28Z

Yes agree. Same confusion. The author says:The location network is always trained with REINFORCE. So should we build another loss function?

lijiangguo · 2020-04-11T15:51:51Z

Note aside from stop at h_t, the gradient originated from action_network will continue recursively through g_t in core_network to modify all previous time l_t. Meanwhile, I wonder why location_network and baseline_network have to detach from h_t? Anywhere in the paper suggested core_network is only trained via classification loss? @Pozimek @yxiao54

litingfeng · 2020-12-30T03:13:56Z

@Pozimek it seems that l_t is detached in location network

lizhenstat · 2021-01-07T07:10:49Z

@Pozimek Hi, you explanation helps me understand why the authors use l_t.detach() in the code, thanks!

lizhenstat mentioned this issue Jan 6, 2021

Question on how to train Location network with detach function #37

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detaching l_t #29

Detaching l_t #29

Pozimek commented Jan 21, 2020

yxiao54 commented Mar 25, 2020

lijiangguo commented Apr 11, 2020

litingfeng commented Dec 30, 2020

lizhenstat commented Jan 7, 2021

Detaching l_t #29

Detaching l_t #29

Comments

Pozimek commented Jan 21, 2020

yxiao54 commented Mar 25, 2020

lijiangguo commented Apr 11, 2020

litingfeng commented Dec 30, 2020

lizhenstat commented Jan 7, 2021