how to assign reward to selector? #1

yeliu918 · 2019-10-12T15:07:34Z

I read the paper and have a question on how to assign a reward to the extractor? The reasoner gets the reward 1 if it reaches the correct target entity, and the intermediate reward is 0. You mention that the extractor receives the reward from the reasoner in the step-wise. But how can reasoner give the extractor reward in each of the steps, since the reasoner can only get the reward in the end-step?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to assign reward to selector? #1

how to assign reward to selector? #1

yeliu918 commented Oct 12, 2019

how to assign reward to selector? #1

how to assign reward to selector? #1

Comments

yeliu918 commented Oct 12, 2019