[Question] What actions are expected for a previously terminated sub-env in a vector (next env.step()
should return reset obs so the actions shouldn't matter)?
#1077
Labels
question
Further information is requested
Question
Quick question:
In a vector env with say 2 sub-envs (0 and 1).
When I do
env.step([action0, action1])
and now sub-env 0 is terminated (returns the final observation), what action would the env expect next for sub-env 0? I would assume that it wouldn't matter, b/c the nextenv.step([action0, action1])
should return the reset obs for sub-env 0 anyways, correct? So can I pass in any random action?The text was updated successfully, but these errors were encountered: