How to do model fine tuning? #12

colemanhindes · 2024-03-15T19:33:02Z

Really cool project! Enjoy the paper and have had fun testing it out. Will instructions on fine tuning be released?

Thanks for your time

abdulfatir · 2024-03-15T20:49:12Z

@colemanhindes Thanks for your interest. We are planning to release the training scripts soon but due to some other engagements there's no ETA yet. In the meantime, @canerturkmen and @shchur are working towards integrating Chronos into AutoGluon-TimeSeries (autogluon/autogluon#3978) and they're also planning to offer ways of fine-tuning the models.

Saeufer · 2024-03-30T13:57:19Z

+1 for this, if possible please mind #22 too for some custom data. Thanks!

HALF111 · 2024-04-11T07:59:53Z

+1, looking forward to releasing the scripts of training and fine-tuning!

TPF2017 · 2024-04-15T11:07:12Z

+1, looking forward to releasing the scripts of training and fine-tuning!

0xrushi · 2024-05-01T13:47:02Z

I caught a glimpse of it and noticed it's utilizing a torch.nn model. I've put together this notebook for training/finetuning. Could someone verify if it's set up correctly? The losses seem unusual, but I suspect it's due to the dataset being quite small and my use of:

sequence_length = 10
prediction_length = 5

notebook: here

iganggang · 2024-05-03T10:51:24Z

+1 for this, if possible please mind #22 too for some custom data. Thanks!

lostella · 2024-05-09T16:00:55Z

Training and fine-tuning script was added in #63, together with configurations that were used for pretraining the models on HuggingFace. We still need to add proper documentation, but roughly speaking:

required dependencies can be installed with pip install ".[training]" (or pip install "chronos[training] @ git+https://github.com/amazon-science/chronos-forecasting.git"
python scripts/training/train.py --help lists all available options
the config files in .scripts/training/config can be adapted by
- changing data files to other GluonTS-compatible files (arrow format recommended for efficiency, but parquet and json lines also supported)
- pointing to Chronos models (instead of the original T5), setting random_init: false, and adjusting learning rate and number of steps for fine-tuning

Happy training! cc @colemanhindes @Saeufer @HALF111 @TPF2017 @0xrushi @iganggang

lostella · 2024-05-10T14:38:39Z

More detailed examples at: https://github.com/amazon-science/chronos-forecasting/tree/main/scripts

Alonelymess · 2024-05-30T14:48:42Z

I get this error when training chronos-t5-small:
ValueError: --tf32 requires Ampere or a newer GPU arch, cuda>=11 and torch>=1.7

abdulfatir · 2024-05-30T22:54:34Z

@Alonelymess that means your GPU does not support TF32 floating point format. Please run training/fine-tuning with the --no-tf32 flag or set tf32 to false in your yaml config.

teshnizi · 2024-06-04T22:12:26Z

I only have a single timeseries, and I want to do forecasting on it. Does it make sense to do fine-tuning in this case?

I was thinking maybe I could split the data chronologically (use data from 2022 to 2023 for training and data from 2023 to 2024 for testing), but I'm not sure if that makes sense.

lostella · 2024-06-05T07:24:16Z

@teshnizi answered you in #98

sagivphilipp · 2024-12-23T11:10:39Z

@lostella does the training script #63 also support the Chronos-Bolt family of models?

lostella · 2024-12-23T11:46:29Z

@sagivphilipp no, that will need to be updated in order to support also the Bolt models. I cannot give an ETA for that at the moment, but we will update these FAQ threads when that happens.

MoradLaglil · 2025-01-12T19:14:49Z

Hi I hope you are doing well. I wanted to ask which method you used to fine-tune Chronos on specific data. Did you fine-tune all the model parameters, or did you use a parameter-efficient fine-tuning method (e.g Lora) ? Thank you!

lostella · 2025-01-13T14:00:21Z

@moradMIRO in the fine-tuning experiments presented in the Chronos paper, as well as in the code in this repository and the fine-tuning that AutoGluon offers, all model parameters are fine-tuned without using parameter-efficient tuning (LoRA).

ammahmoudi · 2025-02-13T16:45:57Z

I tried to fine tune using the training scripts but after fine tuning i got high value inference results.my data is 6,6 windows and is about 1811 rows. i set the train and test seed fixed to 2021.is there someting that i missed? some results after finetuning:
inputs,ground_truth,predictions
"174, 173, 173, 172, 170, 169","16 9, 168, 167, 166, 166, 165","6014.1665, 6014.1665, 6014.1665, 6014.1665, 6014.1665, 6014.1665"
"173, 173, 172, 170, 169, 169","168, 167, 166, 166, 165, 165","5985.0, 5985.0, 5985.0, 5985.0, 5985.0, 5985.0"
"173, 172, 170, 169, 169, 168","167, 166, 166, 165, 165, 165","5955.8335, 5955.8335, 5955.8335, 5955.8335, 5955.8335, 5955.8335".
saved config.json: {
"_name_or_path": "amazon/chronos-t5-base",
"architectures": [
"T5ForConditionalGeneration"
],
"chronos_config": {
"context_length": 6,
"eos_token_id": 1,
"model_type": "seq2seq",
"n_special_tokens": 2,
"n_tokens": 4096,
"num_samples": 20,
"pad_token_id": 0,
"prediction_length": 6,
"temperature": 1.0,
"tokenizer_class": "MeanScaleUniformBins",
"tokenizer_kwargs": {
"high_limit": 500,
"low_limit": 35
},
"top_k": 50,
"top_p": 1.0,
"use_eos_token": true
},
"classifier_dropout": 0.0,
"d_ff": 3072,
"d_kv": 64,
"d_model": 768,
"decoder_start_token_id": 0,
"dense_act_fn": "relu",
"dropout_rate": 0.1,
"eos_token_id": 1,
"feed_forward_proj": "relu",
"initializer_factor": 0.05,
"is_encoder_decoder": true,
"is_gated_act": false,
"layer_norm_epsilon": 1e-06,
"model_type": "t5",
"n_positions": 512,
"num_decoder_layers": 12,
"num_heads": 12,
"num_layers": 12,
"pad_token_id": 0,
"relative_attention_max_distance": 128,
"relative_attention_num_buckets": 32,
"torch_dtype": "float32",
"transformers_version": "4.47.1",
"use_cache": true,
"vocab_size": 4096
}
saved training_info.json:
{
"training_config": {
"training_data_paths": "['./data/formatted/6_6/596-ws-training.arrow']",
"probability": "[1.0]",
"context_length": 6,
"prediction_length": 6,
"min_past": 6,
"max_steps": 10000,
"save_steps": 1000,
"log_steps": 500,
"per_device_train_batch_size": 32,
"learning_rate": 0.001,
"optim": "adamw_torch_fused",
"shuffle_buffer_length": 100000,
"gradient_accumulation_steps": 1,
"model_id": "amazon/chronos-t5-base",
"model_type": "seq2seq",
"random_init": false,
"tie_embeddings": true,
"output_dir": "./logs/logs_2025-02-12_15-14-31/chronos-t5-base",
"tf32": true,
"torch_compile": true,
"tokenizer_class": "MeanScaleUniformBins",
"tokenizer_kwargs": "{'low_limit': 35,'high_limit': 500}",
"n_tokens": 4096,
"n_special_tokens": 2,
"pad_token_id": 0,
"eos_token_id": 1,
"use_eos_token": true,
"lr_scheduler_type": "linear",
"warmup_ratio": 0.0,
"dataloader_num_workers": 1,
"max_missing_prop": 0.9,
"num_samples": 20,
"temperature": 1.0,
"top_k": 50,
"top_p": 1.0,
"seed": 2021
},
"job_info": {
"cuda_available": true,
"device_count": 2,
"device_names": {
"0": "NVIDIA GeForce RTX 4080 SUPER",
"1": "NVIDIA GeForce RTX 4080 SUPER"
},
"mem_info": {
"0": [
10790895616,
17170956288
],
"1": [
12411559936,
17170956288
]
},
"torchelastic_launched": false,
"python_version": "3.12.8 (main, Dec 4 2024, 08:54:12) [GCC 11.4.0]",
"torch_version": "2.4.1+cu121",
"numpy_version": "1.26.4",
"gluonts_version": "0.16.0",
"transformers_version": "4.47.1",
"accelerate_version": "0.34.2"
}
}

lostella added the FAQ Frequently asked question label Mar 18, 2024

lostella pinned this issue Mar 18, 2024

lostella changed the title ~~Fine tuning?~~ How to do model fine tuning? Mar 19, 2024

abdulfatir unpinned this issue Mar 26, 2024

abdulfatir mentioned this issue Mar 26, 2024

How to finetune on custom loss function? #29

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to do model fine tuning? #12

How to do model fine tuning? #12

colemanhindes commented Mar 15, 2024

abdulfatir commented Mar 15, 2024

Saeufer commented Mar 30, 2024

HALF111 commented Apr 11, 2024

TPF2017 commented Apr 15, 2024

0xrushi commented May 1, 2024

iganggang commented May 3, 2024

lostella commented May 9, 2024 •

edited

Loading

lostella commented May 10, 2024

Alonelymess commented May 30, 2024

abdulfatir commented May 30, 2024

teshnizi commented Jun 4, 2024

lostella commented Jun 5, 2024

sagivphilipp commented Dec 23, 2024

lostella commented Dec 23, 2024

MoradLaglil commented Jan 12, 2025

lostella commented Jan 13, 2025

ammahmoudi commented Feb 13, 2025

How to do model fine tuning? #12

How to do model fine tuning? #12

Comments

colemanhindes commented Mar 15, 2024

abdulfatir commented Mar 15, 2024

Saeufer commented Mar 30, 2024

HALF111 commented Apr 11, 2024

TPF2017 commented Apr 15, 2024

0xrushi commented May 1, 2024

iganggang commented May 3, 2024

lostella commented May 9, 2024 • edited Loading

lostella commented May 10, 2024

Alonelymess commented May 30, 2024

abdulfatir commented May 30, 2024

teshnizi commented Jun 4, 2024

lostella commented Jun 5, 2024

sagivphilipp commented Dec 23, 2024

lostella commented Dec 23, 2024

MoradLaglil commented Jan 12, 2025

lostella commented Jan 13, 2025

ammahmoudi commented Feb 13, 2025

lostella commented May 9, 2024 •

edited

Loading