[Flux] Add advanced training script + support textual inversion inference #9434

linoytsaban · 2024-09-13T14:51:00Z

This PR adds an advanced version of the dreambooth lora flux script, with additional accompanying update to FluxPipline:

adds an advanced script with pivotal tuning feature for CLIP & T5 encoder
- --train_text_encoder_ti (enables CLIP pivotal tuning)
- --enable_t5_ti adds T5 to the mix
- --train_text_encoder_ti_frac the portion of epochs to train embeddings on, when using just clip
- --train_transformer_frac the portion of epochs to train the transformer on, train_transformer_frac==0 will trigger a "pure_textual_inversion" train run (= "classical" textual inversion, no optimization of transformer lora layers)
- --initializer_token the token to init the textual inversion embeddings with instead of randomly (random by default).
--lora_blocks the blocks\layers to apply lora training on.
modifies Flux Pipeline (and related pipelines) to allow for textual inversion inference (that is also required by Loras trained with pivotal tuning)

motivation: to allow for fast iterations & experimental features. I think it'd be good to fork the canonical script and bring the above changes into the advanced folder in a similar manner to what we have for SDXL.

cc @apolinario

https://github.com/ostris/ai-toolkit/blob/9ee1ef2a0a2a9a02b92d114a95f21312e5906e54/toolkit/samplers/custom_flowmatch_sampler.py#L95

…uxPipeline for inference

…' into dreambooth-lora-flux-exploration

linoytsaban · 2024-09-13T14:54:47Z

@sayakpaul @apolinario wdyt about me closing #9160 and moving the changes here as I suggest above?

sayakpaul · 2024-09-13T15:02:46Z

I would prefer #9160 as it helps to review the changes to the canonical script in isolation. Would that work for you?

…es to readme

sayakpaul

Thanks!

sayakpaul · 2024-10-15T01:19:53Z

@yiyixuxu could you also give the changes in the pipelines a look? It's just about adding TextualInversionMixin so that we can enable pivotal tuning on Flux.

yiyixuxu

thanks!

apolinario

Let's go! 🚀

…ence (#9434) * add ostris trainer to README & add cache latents of vae * add ostris trainer to README & add cache latents of vae * style * readme * add test for latent caching * add ostris noise scheduler https://github.com/ostris/ai-toolkit/blob/9ee1ef2a0a2a9a02b92d114a95f21312e5906e54/toolkit/samplers/custom_flowmatch_sampler.py#L95 * style * fix import * style * fix tests * style * --change upcasting of transformer? * update readme according to main * add pivotal tuning for CLIP * fix imports, encode_prompt call,add TextualInversionLoaderMixin to FluxPipeline for inference * TextualInversionLoaderMixin support for FluxPipeline for inference * move changes to advanced flux script, revert canonical * add latent caching to canonical script * revert changes to canonical script to keep it separate from #9160 * revert changes to canonical script to keep it separate from #9160 * style * remove redundant line and change code block placement to align with logic * add initializer_token arg * add transformer frac for range support from pure textual inversion to the orig pivotal tuning * support pure textual inversion - wip * adjustments to support pure textual inversion and transformer optimization in only part of the epochs * fix logic when using initializer token * fix pure_textual_inversion_condition * fix ti/pivotal loading of last validation run * remove embeddings loading for ti in final training run (to avoid adding huggingface hub dependency) * support pivotal for t5 * adapt pivotal for T5 encoder * adapt pivotal for T5 encoder and support in flux pipeline * t5 pivotal support + support fo pivotal for clip only or both * fix param chaining * fix param chaining * README first draft * readme * readme * readme * style * fix import * style * add fix from #9419 * add to readme, change function names * te lr changes * readme * change concept tokens logic * fix indices * change arg name * style * dummy test * revert dummy test * reorder pivoting * add warning in case the token abstraction is not the instance prompt * experimental - wip - specific block training * fix documentation and token abstraction processing * remove transformer block specification feature (for now) * style * fix copies * fix indexing issue when --initializer_concept has different amounts * add if TextualInversionLoaderMixin to all flux pipelines * style * fix import * fix imports * address review comments - remove necessary prints & comments, use pin_memory=True, use free_memory utils, unify warning and prints * style * logger info fix * make lora target modules configurable and change the default * make lora target modules configurable and change the default * style * make lora target modules configurable and change the default, add notes to readme * style * add tests * style * fix repo id * add updated requirements for advanced flux * fix indices of t5 pivotal tuning embeddings * fix path in test * remove `pin_memory` * fix filename of embedding * fix filename of embedding --------- Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: YiYi Xu <[email protected]>

linoytsaban and others added 28 commits August 12, 2024 17:30

add ostris trainer to README & add cache latents of vae

90686c2

add ostris trainer to README & add cache latents of vae

7b12ed2

style

17dca18

Merge branch 'main' into dreambooth-lora

de24a4f

readme

8b314e9

Merge branch 'main' into dreambooth-lora

a59b063

add test for latent caching

df54cd8

add ostris noise scheduler

e0e0319

https://github.com/ostris/ai-toolkit/blob/9ee1ef2a0a2a9a02b92d114a95f21312e5906e54/toolkit/samplers/custom_flowmatch_sampler.py#L95

style

18aa369

fix import

f97d53d

style

0156bec

fix tests

c4c2c48

style

d514c7b

Merge branch 'main' into dreambooth-lora

7ee6041

--change upcasting of transformer?

d5c2a36

Merge branch 'main' into dreambooth-lora

e760cda

Merge branch 'main' into dreambooth-lora

f78ba77

Merge branch 'main' into dreambooth-lora

1b19593

update readme according to main

fbacbb5

Merge branch 'main' into dreambooth-lora

23f0636

add pivotal tuning for CLIP

44c534e

fix imports, encode_prompt call,add TextualInversionLoaderMixin to Fl…

087b982

…uxPipeline for inference

TextualInversionLoaderMixin support for FluxPipeline for inference

d9c3e45

Merge branch 'huggingface:main' into dreambooth-lora-flux-exploration

5f9b74f

Merge branch 'huggingface:main' into dreambooth-lora-flux-exploration

f14617b

move changes to advanced flux script, revert canonical

b4328f8

Merge remote-tracking branch 'origin/dreambooth-lora-flux-exploration…

6254d04

…' into dreambooth-lora-flux-exploration

add latent caching to canonical script

7b7a671

linoytsaban and others added 4 commits October 14, 2024 17:06

Merge branch 'main' into dreambooth-lora-flux-exploration

366a35e

make lora target modules configurable and change the default, add not…

e4fe609

…es to readme

style

3d0955b

Merge branch 'main' into dreambooth-lora-flux-exploration

452cef4

linoytsaban requested a review from sayakpaul October 14, 2024 19:26

Merge branch 'main' into dreambooth-lora-flux-exploration

f62af61

sayakpaul approved these changes Oct 15, 2024

View reviewed changes

sayakpaul requested a review from yiyixuxu October 15, 2024 01:19

linoytsaban and others added 5 commits October 15, 2024 14:36

Merge branch 'main' into dreambooth-lora-flux-exploration

a4429e0

Merge branch 'main' into dreambooth-lora-flux-exploration

c41dfff

add tests

cb265ad

style

61426f0

fix repo id

03f19f6

yiyixuxu approved these changes Oct 16, 2024

View reviewed changes

yiyixuxu and others added 10 commits October 15, 2024 14:44

Merge branch 'main' into dreambooth-lora-flux-exploration

b1b2128

Merge branch 'main' into dreambooth-lora-flux-exploration

bfb0741

add updated requirements for advanced flux

bd2be32

fix indices of t5 pivotal tuning embeddings

69d28b5

Merge branch 'main' into dreambooth-lora-flux-exploration

450f072

fix path in test

31de752

remove pin_memory

5dfd685

Merge branch 'main' into dreambooth-lora-flux-exploration

9bdb6a1

fix filename of embedding

9093a4b

fix filename of embedding

f1b08cb

linoytsaban requested a review from apolinario October 16, 2024 15:25

apolinario approved these changes Oct 17, 2024

View reviewed changes

linoytsaban merged commit 9a7f824 into huggingface:main Oct 17, 2024
15 checks passed

linoytsaban deleted the dreambooth-lora-flux-exploration branch November 26, 2024 10:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Flux] Add advanced training script + support textual inversion inference #9434

[Flux] Add advanced training script + support textual inversion inference #9434

linoytsaban commented Sep 13, 2024 •

edited

Loading

linoytsaban commented Sep 13, 2024

sayakpaul commented Sep 13, 2024

sayakpaul left a comment

sayakpaul commented Oct 15, 2024

yiyixuxu left a comment

apolinario left a comment

[Flux] Add advanced training script + support textual inversion inference #9434

[Flux] Add advanced training script + support textual inversion inference #9434

Conversation

linoytsaban commented Sep 13, 2024 • edited Loading

linoytsaban commented Sep 13, 2024

sayakpaul commented Sep 13, 2024

sayakpaul left a comment

Choose a reason for hiding this comment

sayakpaul commented Oct 15, 2024

yiyixuxu left a comment

Choose a reason for hiding this comment

apolinario left a comment

Choose a reason for hiding this comment

linoytsaban commented Sep 13, 2024 •

edited

Loading