How to load a fine-tuned model for predictions #183

marcopeix · 2025-02-06T16:56:44Z

Once fine-tuning is done, I know the model is saved according to the path specified here, in the YAML file.

However, I don't see any checkpoints there. I only see .log files and a Hydra yaml.

Did I miss something? I used the default YAML config with no modifications.

In my case, the model should be saved in outputs/finetune/moirai_1.0_R_small/store_finetune/store_sales_finetune.

The text was updated successfully, but these errors were encountered:

chenghaoliu89 · 2025-02-07T14:09:46Z

Hi @marcopeix , it is supposed to be stored in outputs/finetune/moirai_1.0_R_small/store_finetune/store_sales_finetune/.../checkpoints folder. If you could not find it, please check if there is any error report from the fine-tuning process

marcopeix · 2025-02-07T21:41:26Z

HI @chenghaoliu89 , there are no errors during fine-tuning and the process runs fine. When I run !ls -la outputs/finetune/moirai_1.0_R_small/store_finetune/store_sales_finetune/, I get this output:

total 20
drwxr-xr-x 4 root root 4096 Feb  7 20:52 .
drwxr-xr-x 3 root root 4096 Feb  7 20:52 ..
drwxr-xr-x 2 root root 4096 Feb  7 20:52 .hydra
drwxr-xr-x 3 root root 4096 Feb  7 20:52 logs
-rw-r--r-- 1 root root  232 Feb  7 20:52 train.log

What am I missing?

Thanks for your help!

chenghaoliu89 · 2025-02-08T08:50:33Z

Hi @marcopeix could you try the fine-tuning example case in readme.md first and see if you can get the model checkpoint file.

liyujia011025 · 2025-02-16T13:08:28Z

您好，您能否先尝试一下 readme.md 中的微调示例案例，看看是否可以获取模型检查点文件。

Hello expert:
May I ask if you have made any minor adjustments using the process and code in the official README file? It's strange that after fine-tuning with my own data according to this operation, the effect is even worse than not fine-tuning at all. Did you make any specific changes to hyperparameter settings or other content during the fine-tuning process? Also, may I ask if you only fine tuned certain layers or all layers during the fine-tuning process?
Looking forward to your reply, thank you very much!

marcopeix added the bug Something isn't working label Feb 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to load a fine-tuned model for predictions #183

How to load a fine-tuned model for predictions #183

marcopeix commented Feb 6, 2025

chenghaoliu89 commented Feb 7, 2025

marcopeix commented Feb 7, 2025

chenghaoliu89 commented Feb 8, 2025

liyujia011025 commented Feb 16, 2025

How to load a fine-tuned model for predictions #183

How to load a fine-tuned model for predictions #183

Comments

marcopeix commented Feb 6, 2025

chenghaoliu89 commented Feb 7, 2025

marcopeix commented Feb 7, 2025

chenghaoliu89 commented Feb 8, 2025

liyujia011025 commented Feb 16, 2025