-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
resume卡住 #3671
Comments
Same Question. |
After checking source code of mmengine, I found that they just called if self._iter > 0:
print_log(
f'Advance dataloader {self._iter} steps to skip data '
'that has already been trained',
logger='current',
level=logging.WARNING)
for _ in range(self._iter):
next(self.dataloader_iterator) In other words, "-- resume" will load data like regular training, but discard all of this data before reaching the specified iteration. Therefore, the time required for resuming will not be much faster than starting a new training session. |
I discovered that using a lower version of
|
I think this is the cause of the problem. Here's the PR. open-mmlab/mmengine#1471 |
@chtzs Thanks! |
I don't understand how to solve it,can you tell me?Extremely thanks! |
Just comment out these lines. |
@Saillxl Solution could be found in this issue:open-mmlab/mmengine#1520 |
Hi, will degrade the mmengine version affect the model's training performance? |
resume模型时卡住也不报错,重新开始训练是可以的。将num_workers设置为1也没用
The text was updated successfully, but these errors were encountered: