Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Added support for minibatch in PPO process_fn (#1168)
Closes #1164 In PPOPolicy, the method `process_fn()` now computes `logp_old` in minibatch instead of all at once. --------- Co-authored-by: Michael Panchenko <[email protected]>
- Loading branch information