Skip to content

Apply missing lr_mult and wd_mult to the lr and weight_decay of megatron param groups. #1960

Apply missing lr_mult and wd_mult to the lr and weight_decay of megatron param groups.

Apply missing lr_mult and wd_mult to the lr and weight_decay of megatron param groups. #1960

Triggered via pull request February 13, 2025 16:54
Status Success
Total duration 40s
Artifacts

code-linting.yml

on: pull_request
Matrix: linting
Nemo_Linting_Test
0s
Nemo_Linting_Test
Fit to window
Zoom out
Zoom in