batch inputs for compute_clip_text_embedding #263

piercus · 2024-02-07T09:52:51Z

Context

I need this for #165, to do GPU efficient evaluation (batch_size > 1).
This is the extension of #213
This is also a baby step in the context of #255 @limiteinductive

One question

In compute_clip_image_embedding there is a concat_batches bool, i'm shared if we need to put it in compute_clip_text_embedding or not.

2 contradictory POV

I did not put it now cause the functionnal use case is not clear,
For convergence of inputs formats (in the context of SD : classifier free guidance tensor shape #255 SD refactoring), we might want to tend to uniformity here.

piercus · 2024-02-07T12:12:27Z

@limiteinductive just added corresponding changes + unit tests for SDXL

src/refiners/foundationals/latent_diffusion/stable_diffusion_1/model.py

deltheil · 2024-02-14T15:12:52Z

In compute_clip_image_embedding there is a concat_batches bool, i'm shared if we need to put it in compute_clip_text_embedding or not.

@piercus no, you can ignore it IMO. It has been added mainly for documentation purposes, in the context of IP-Adapter with multiple image prompts (see #218). And in this context, you want to create a longer sequence of tokens (like double the size for two images).

…/finegrain-ai/refiners/pull/263\#discussion_r1488512476

limiteinductive

@deltheil lgtm

deltheil · 2024-02-20T09:28:41Z

@deltheil lgtm

Thanks! @piercus will do the final round shortly, stay tuned

deltheil

Some comments/suggestions, please take a look

src/refiners/foundationals/latent_diffusion/stable_diffusion_1/model.py

src/refiners/foundationals/latent_diffusion/stable_diffusion_xl/model.py

src/refiners/foundationals/latent_diffusion/stable_diffusion_xl/text_encoder.py

tests/foundationals/latent_diffusion/test_sdxl_double_encoder.py

tests/e2e/test_diffusion.py

piercus · 2024-02-21T11:03:06Z

The batch-stability on GPU suffers from the below effect

Source : Torch Double Conv2d

In the above example, a pure pytorch double Conv2d leads to 2e-6 l1 norm diff

import torch
from torch.nn import Conv2d 
device = "cuda:0"

def distance (x: torch.Tensor, y: torch.Tensor) -> float:
    return torch.max((x - y).abs()).item()

with torch.no_grad():
    torch.cuda.manual_seed_all(0)
    x_b2 = torch.randn(2, 4, 32, 32).to(device)
    conv2d_1 = Conv2d(in_channels=4, out_channels=320, kernel_size=3, padding=1, device=device)
    conv2d_2 = Conv2d(in_channels=320, out_channels=640, kernel_size=3, padding=1, device=device)

    output_b2 = conv2d_2(conv2d_1(x_b2))
    output_b1 = conv2d_2(conv2d_1(x_b2[0:1]))

    print(distance(output_b2[0], output_b1[0]))

will output 2e-06

Amplification

This batch discrepancy is then amplified by the following layers

Level	Error `max((x - y).abs())`
Double Conv	2e-6
Conv + ResidualBlock	5e-5
DownBlocks	2e-3
Unet	1e-3
SD	2e-3

Details of the analysis can be found in https://gist.github.com/piercus/07d03f258907542d312c0c735445e793

Result

As a results the batch stability with torch.allclose is only calculated with tolerance of 5e-3

@deltheil

@deltheil comments on finegrain-ai#263, torch.allclose with tolerance of 5e-3, compact code following @limiteinductive suggestion Co-authored-by: Cédric Deltheil <[email protected]>

deltheil

See final nits. Please squash the extra commit afterwards. Thanks!

tests/e2e/test_diffusion.py

piercus · 2024-02-21T13:58:48Z

@deltheil please review

Co-authored-by: Cédric Deltheil <[email protected]>

Added in #263

Added in finegrain-ai#263

deltheil requested review from limiteinductive and deltheil February 7, 2024 09:53

piercus force-pushed the batch-sd branch 2 times, most recently from 0164a98 to 2a03655 Compare February 8, 2024 16:43

limiteinductive reviewed Feb 13, 2024

View reviewed changes

src/refiners/foundationals/latent_diffusion/stable_diffusion_1/model.py Outdated Show resolved Hide resolved

piercus referenced this pull request in piercus/refiners Feb 15, 2024

compact code following @limiteinductive suggestion https://github.com…

298fb35

…/finegrain-ai/refiners/pull/263\#discussion_r1488512476

piercus requested a review from limiteinductive February 16, 2024 20:32

limiteinductive previously approved these changes Feb 20, 2024

View reviewed changes

deltheil reviewed Feb 20, 2024

View reviewed changes

piercus dismissed limiteinductive’s stale review via 1a5aee9 February 20, 2024 12:42

piercus added a commit to piercus/refiners that referenced this pull request Feb 20, 2024

@deltheil comments on finegrain-ai#263

39ee335

piercus force-pushed the batch-sd branch 3 times, most recently from d3432ad to b3d53f1 Compare February 21, 2024 12:16

piercus requested a review from deltheil February 21, 2024 12:17

deltheil previously approved these changes Feb 21, 2024

View reviewed changes

tests/e2e/test_diffusion.py Outdated Show resolved Hide resolved

tests/e2e/test_diffusion.py Outdated Show resolved Hide resolved

deltheil added the run-ci Run CI label Feb 21, 2024

piercus dismissed deltheil’s stale review via 514b18b February 21, 2024 13:49

piercus force-pushed the batch-sd branch from a5a38b5 to b51ea52 Compare February 21, 2024 13:54

piercus requested a review from deltheil February 21, 2024 13:58

deltheil approved these changes Feb 21, 2024

View reviewed changes

deltheil added run-ci Run CI and removed run-ci Run CI labels Feb 21, 2024

batch sdxl + sd1 + compute_clip_text_embedding

c77ff95

Co-authored-by: Cédric Deltheil <[email protected]>

deltheil force-pushed the batch-sd branch from b51ea52 to c77ff95 Compare February 21, 2024 14:07

deltheil added run-ci Run CI and removed run-ci Run CI labels Feb 21, 2024

deltheil merged commit d199cd4 into finegrain-ai:main Feb 21, 2024
2 checks passed

deltheil added a commit that referenced this pull request Feb 21, 2024

test_style_aligned: switch to CLIP text batch API

af078cd

Added in #263

deltheil mentioned this pull request Feb 21, 2024

test_style_aligned: switch to CLIP text batch API #298

Merged

deltheil added a commit that referenced this pull request Feb 21, 2024

test_style_aligned: switch to CLIP text batch API

4469678

Added in #263

rodSiry pushed a commit to rodSiry/refiners that referenced this pull request Feb 28, 2024

test_style_aligned: switch to CLIP text batch API

3c231b4

Added in finegrain-ai#263

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

batch inputs for compute_clip_text_embedding #263

batch inputs for compute_clip_text_embedding #263

piercus commented Feb 7, 2024

piercus commented Feb 7, 2024

deltheil commented Feb 14, 2024

limiteinductive left a comment

deltheil commented Feb 20, 2024

deltheil left a comment

piercus commented Feb 21, 2024 •

edited

Loading

deltheil left a comment

piercus commented Feb 21, 2024

batch inputs for compute_clip_text_embedding #263

batch inputs for compute_clip_text_embedding #263

Conversation

piercus commented Feb 7, 2024

Context

One question

piercus commented Feb 7, 2024

deltheil commented Feb 14, 2024

limiteinductive left a comment

Choose a reason for hiding this comment

deltheil commented Feb 20, 2024

deltheil left a comment

Choose a reason for hiding this comment

piercus commented Feb 21, 2024 • edited Loading

Source : Torch Double Conv2d

Amplification

Result

deltheil left a comment

Choose a reason for hiding this comment

piercus commented Feb 21, 2024

piercus commented Feb 21, 2024 •

edited

Loading