Add new optimizer state row_counter
for Adam [Frontend]
#3558
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary:
A new optional optimizer state
row_counter
is added to Adam to perform bias correction per embedding row.row_counter
serves as the iteration counter when a row (an index) occurs and used to do bias correction.Without rowwise bias correction (existing Adam),
With rowwise bias correction enabled.
This request is from IG to allow all the models to be scaled on sparse features with expected 1.5% NE on Stories.
The functionality is not set by default. Frontend: D64848802
To enable the bias correction,
use_rowwise_bias_correction
needs to be set to True through extra_optimizer_config.Performance
* Baseline: before changes
** default: default setting; use_bias_correction = False
*** enabled: use_bias_correction = True
Reviewed By: sryap
Differential Revision: D64848802