-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
nemo-automodel checkpoint-io refactor #12070
base: main
Are you sure you want to change the base?
nemo-automodel checkpoint-io refactor #12070
Conversation
2ebc0ed
to
62c9b22
Compare
162eaa2
to
103326b
Compare
5a687db
to
e4327a3
Compare
ef2c811
to
cf72230
Compare
04afe44
to
ec8b098
Compare
Signed-off-by: Alexandros Koumparoulis <[email protected]>
Signed-off-by: Alexandros Koumparoulis <[email protected]>
Signed-off-by: Alexandros Koumparoulis <[email protected]>
Signed-off-by: akoumpa <[email protected]>
# if trainer.state.fn == TrainerFn.FITTING: | ||
# # Load optimizer | ||
# trainer.strategy.load_optimizer_state_dict(adapter_state) | ||
# # Load lr scheduler | ||
# if (lr_schedulers := adapter_state.get('lr_schedulers', None)) is not None: | ||
# for config, lrs_state in zip(trainer.lr_scheduler_configs, lr_schedulers): | ||
# config.scheduler.load_state_dict(lrs_state) |
Check notice
Code scanning / CodeQL
Commented-out code Note
Show autofix suggestion
Hide autofix suggestion
Copilot Autofix AI about 18 hours ago
To fix the problem, we should remove the commented-out code entirely. This will help in maintaining a clean and readable codebase. The specific lines to be removed are 264 to 270 in the file nemo/lightning/pytorch/callbacks/peft.py
.
@@ -263,9 +263,2 @@ | ||
# trainer.lightning_module.configure_optimizers() | ||
# if trainer.state.fn == TrainerFn.FITTING: | ||
# # Load optimizer | ||
# trainer.strategy.load_optimizer_state_dict(adapter_state) | ||
# # Load lr scheduler | ||
# if (lr_schedulers := adapter_state.get('lr_schedulers', None)) is not None: | ||
# for config, lrs_state in zip(trainer.lr_scheduler_configs, lr_schedulers): | ||
# config.scheduler.load_state_dict(lrs_state) | ||
|
Signed-off-by: Alexandros Koumparoulis <[email protected]>
Signed-off-by: Alexandros Koumparoulis <[email protected]>
Signed-off-by: Alexandros Koumparoulis <[email protected]>
08f0c82
to
fb995a8
Compare
Signed-off-by: Alexandros Koumparoulis <[email protected]>
bfb4b87
to
75255dc
Compare
Signed-off-by: Alexandros Koumparoulis <[email protected]>
Signed-off-by: Alexandros Koumparoulis <[email protected]>
7d302fd
to
7bcd293
Compare
Signed-off-by: Alexandros Koumparoulis <[email protected]>
10d67b5
to
7ddb611
Compare
Signed-off-by: Alexandros Koumparoulis <[email protected]>
51c9bb6
to
a173c05
Compare
Signed-off-by: akoumpa <[email protected]>
Changes:
MegatronStrategy
do not define aconsumed_samples
attribute, and instead thestep
attribute is used.What does this PR do ?
Add a one line overview of what this PR aims to accomplish.
Collection: [Note which collection this PR will affect]
Changelog
Usage
# Add a code snippet demonstrating how to use this
GitHub Actions CI
The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.
The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".
Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information