Skip to content

Commit

Permalink
Decrease the infeasible reward.
Browse files Browse the repository at this point in the history
It is possible that we have a legitimate reward smaller than -1.

PiperOrigin-RevId: 663800480
Change-Id: I7f5064dfee8dd228ffb6b643e56f2ce240d4f574
  • Loading branch information
esonghori authored and copybara-github committed Aug 16, 2024
1 parent 53b036d commit d90d595
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion circuit_training/environment/environment.py
Original file line number Diff line number Diff line change
Expand Up @@ -133,7 +133,7 @@ def cost_info_function(
class CircuitEnv(object):
"""Defines the CircuitEnv class."""

INFEASIBLE_REWARD = -1.0
INFEASIBLE_REWARD = -4.0

def __init__(
self,
Expand Down

0 comments on commit d90d595

Please sign in to comment.