-
Notifications
You must be signed in to change notification settings - Fork 48
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Unifies SD pipeline APIs, adds sd3 support, punet integration (#706)
- Introduces a new sd_pipeline.py that handles inference for sd1.5, sd2.1, sdxl, sdxl-turbo, sd3. The pipeline is a child of the new pipeline_base.py that provides a comprehensive starting point to bringing up new pipelines. - Generally moves SDXL away from the "scheduled unet" approach, instead compiling small scheduler models that fit around a standalone unet module. - Reworks pipeline API to enable deployment / compatibility APIs - Adds multi-device pipelining support to SD pipeline - Carries flag updates for key targets - file management improvements - integrates sharktank int8 partitioned unet. Signed-off-by: aviator19941 <[email protected]> Signed-off-by: monorimet <[email protected]> Co-authored-by: Ian <[email protected]> Co-authored-by: dan <[email protected]> Co-authored-by: IanNod <[email protected]> Co-authored-by: aviator19941 <[email protected]> Co-authored-by: saienduri <[email protected]>
- Loading branch information
1 parent
4f5f31f
commit e46a2a2
Showing
61 changed files
with
10,347 additions
and
3,135 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -28,3 +28,7 @@ wheelhouse | |
*.safetensors | ||
*.gguf | ||
*.vmfb | ||
*.mlir | ||
*.npy | ||
*.png | ||
*tmp* |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,11 +1,16 @@ | ||
protobuf | ||
sentencepiece | ||
shark_turbine | ||
gguf | ||
transformers==4.37.1 | ||
torchsde | ||
accelerate | ||
diffusers @ git+https://github.com/nod-ai/[email protected] | ||
peft | ||
diffusers @ git+https://github.com/nod-ai/[email protected] | ||
brevitas @ git+https://github.com/Xilinx/brevitas.git@6695e8df7f6a2c7715b9ed69c4b78157376bb60b | ||
# turbine tank downloading/uploading | ||
azure-storage-blob | ||
# microsoft/phi model | ||
einops | ||
pytest | ||
scipy | ||
shark-turbine @ git+https://github.com/iree-org/iree-turbine.git@main | ||
-e git+https://github.com/nod-ai/sharktank.git@main#egg=sharktank&subdirectory=sharktank |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
169 changes: 169 additions & 0 deletions
169
models/turbine_models/custom_models/llama_argmax_td_spec.mlir
Large diffs are not rendered by default.
Oops, something went wrong.
Oops, something went wrong.