-
Notifications
You must be signed in to change notification settings - Fork 487
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nesterov #3232
base: main
Are you sure you want to change the base?
Nesterov #3232
Conversation
This pull request was exported from Phabricator. Differential Revision: D63875074 |
❌ Deploy Preview for pytorch-fbgemm-docs failed.
|
5efe096
to
7625a29
Compare
This pull request was exported from Phabricator. Differential Revision: D63875074 |
Summary: X-link: facebookresearch/FBGEMM#330 using step_mode to cover a few special cases: step_mode=0: embedding scaling step_mode=1: nesterov accelerated gradient step_mode=2: pure ema (compatible with previous diff) Reviewed By: q10 Differential Revision: D63875074
7625a29
to
79389ff
Compare
This pull request was exported from Phabricator. Differential Revision: D63875074 |
Summary: X-link: facebookresearch/FBGEMM#330 using step_mode to cover a few special cases: step_mode=0: embedding scaling step_mode=1: nesterov accelerated gradient step_mode=2: pure ema (compatible with previous diff) Reviewed By: q10 Differential Revision: D63875074
79389ff
to
78a11d8
Compare
Summary: X-link: facebookresearch/FBGEMM#330 using step_mode to cover a few special cases: step_mode=0: embedding scaling step_mode=1: nesterov accelerated gradient step_mode=2: pure ema (compatible with previous diff) Reviewed By: q10 Differential Revision: D63875074
78a11d8
to
cd181f7
Compare
This pull request was exported from Phabricator. Differential Revision: D63875074 |
1 similar comment
This pull request was exported from Phabricator. Differential Revision: D63875074 |
Summary: X-link: facebookresearch/FBGEMM#330 using step_mode to cover a few special cases: step_mode=0: embedding scaling step_mode=1: nesterov accelerated gradient step_mode=2: pure ema (compatible with previous diff) Reviewed By: q10 Differential Revision: D63875074
cd181f7
to
d3ede55
Compare
This pull request was exported from Phabricator. Differential Revision: D63875074 |
Summary: X-link: facebookresearch/FBGEMM#330 using step_mode to cover a few special cases: step_mode=0: embedding scaling step_mode=1: nesterov accelerated gradient step_mode=2: pure ema (compatible with previous diff) Reviewed By: q10 Differential Revision: D63875074
d3ede55
to
4847322
Compare
This pull request was exported from Phabricator. Differential Revision: D63875074 |
Summary: X-link: facebookresearch/FBGEMM#330 using step_mode to cover a few special cases: step_mode=0: embedding scaling step_mode=1: nesterov accelerated gradient step_mode=2: pure ema (compatible with previous diff) Reviewed By: q10 Differential Revision: D63875074
4847322
to
0e10546
Compare
This pull request was exported from Phabricator. Differential Revision: D63875074 |
Summary:
using step_mode to cover a few special cases:
step_mode=0: embedding scaling
step_mode=1: nesterov accelerated gradient
step_mode=2: pure ema (compatible with previous diff)
Differential Revision: D63875074