Caformer finetuning. Adapter/LoRa? #7

kacwin · 2023-01-10T11:34:10Z

Hey,
great job with this repo. Caformer with 100M parameters is really powerful, though I am struggling with the finetuning due to hardware limitations. Did you already make experiments with something like adapter finetuning or LoRA? At first glance, the code looks like one would need to rewrite a lot for this.

yuweihao · 2023-01-12T14:50:06Z

Hi @kacwin ,

Many thanks for your attention. I did not conduct experiments with adapter finetuning or LoRA. For hardware limitations, since all models in the paper do not use Batch Norm, we can set --grad-accum-steps.

kacwin · 2023-01-12T15:56:35Z

Thanks for the info,
using gradient accumulation would be kind of a last resort; there simply is too much data. I will try some LoRA experiments in the near future, can give an update on that if you want.

yuweihao · 2023-01-13T03:22:48Z

Many thanks!

kacwin · 2023-02-13T07:41:04Z

Hello,
we did some experiments with LoRa finetuning.

We started with Caformer_b36_384 and did linear probing (freezing the network aside from the mlp head) on a classification task with domain shift ---> accuracy ~30%
We repeated the experiment with a full finetuning ---> accuracy ~65%
Lastly, we implemented parallel LoRa layers for all linear layers in Caformer. We then froze the original linear weights and only trained the LoRa layers as well as the Conv2d layers (in the downsampling stages and in the SepConv blocks). This left us with 10M trainable parameters, compared to 100M parameters at full finetuning ---> accuracy 61%

So with a fraction of trainable parameters, we achieved similar results. However, training time/GPU space sadly did not decrease that much (maybe factor 0.66). I think there is some potential here :)

yuweihao · 2023-02-14T19:20:29Z

Thanks for sharing valuable experiment results.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Caformer finetuning. Adapter/LoRa? #7

Caformer finetuning. Adapter/LoRa? #7

kacwin commented Jan 10, 2023

yuweihao commented Jan 12, 2023

kacwin commented Jan 12, 2023

yuweihao commented Jan 13, 2023

kacwin commented Feb 13, 2023

yuweihao commented Feb 14, 2023

Caformer finetuning. Adapter/LoRa? #7

Caformer finetuning. Adapter/LoRa? #7

Comments

kacwin commented Jan 10, 2023

yuweihao commented Jan 12, 2023

kacwin commented Jan 12, 2023

yuweihao commented Jan 13, 2023

kacwin commented Feb 13, 2023

yuweihao commented Feb 14, 2023