Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to load a fine-tuned model for predictions #183

Open
marcopeix opened this issue Feb 6, 2025 · 4 comments
Open

How to load a fine-tuned model for predictions #183

marcopeix opened this issue Feb 6, 2025 · 4 comments
Labels
bug Something isn't working

Comments

@marcopeix
Copy link

Once fine-tuning is done, I know the model is saved according to the path specified here, in the YAML file.

However, I don't see any checkpoints there. I only see .log files and a Hydra yaml.

Did I miss something? I used the default YAML config with no modifications.

In my case, the model should be saved in outputs/finetune/moirai_1.0_R_small/store_finetune/store_sales_finetune.

@marcopeix marcopeix added the bug Something isn't working label Feb 6, 2025
@chenghaoliu89
Copy link
Contributor

Hi @marcopeix , it is supposed to be stored in outputs/finetune/moirai_1.0_R_small/store_finetune/store_sales_finetune/.../checkpoints folder. If you could not find it, please check if there is any error report from the fine-tuning process

@marcopeix
Copy link
Author

HI @chenghaoliu89 , there are no errors during fine-tuning and the process runs fine. When I run !ls -la outputs/finetune/moirai_1.0_R_small/store_finetune/store_sales_finetune/, I get this output:

total 20
drwxr-xr-x 4 root root 4096 Feb  7 20:52 .
drwxr-xr-x 3 root root 4096 Feb  7 20:52 ..
drwxr-xr-x 2 root root 4096 Feb  7 20:52 .hydra
drwxr-xr-x 3 root root 4096 Feb  7 20:52 logs
-rw-r--r-- 1 root root  232 Feb  7 20:52 train.log

What am I missing?

Thanks for your help!

@chenghaoliu89
Copy link
Contributor

Hi @marcopeix could you try the fine-tuning example case in readme.md first and see if you can get the model checkpoint file.

@liyujia011025
Copy link

您好,您能否先尝试一下 readme.md 中的微调示例案例,看看是否可以获取模型检查点文件。

Hello expert:
May I ask if you have made any minor adjustments using the process and code in the official README file? It's strange that after fine-tuning with my own data according to this operation, the effect is even worse than not fine-tuning at all. Did you make any specific changes to hyperparameter settings or other content during the fine-tuning process? Also, may I ask if you only fine tuned certain layers or all layers during the fine-tuning process?
Looking forward to your reply, thank you very much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants