Stable Diffusion, Segment Anything models for replacing background in generated images

Using Stable diffusion (SD) model and Segment Anything model from Meta (SAM) for removing the background and add newer background for the image. Created with a Multi model pipeline.

Stable Diffusion model

Huggingface Link : CompVis/stable-diffusion-v1-4

Model Link : Model

Stable Diffusion is the diffusion model which generates images based on the text input. Stable Diffusion model is created by the researcheres from CompVis, StabilityAI and LAION.

The model is trained on LAION-5b dataset which contains 5.85 billion CLIP-filtered image-text pairs. The model is used to generate the images and the background needed for the images. We use the Stable diffusion v1-4 checkpoint. This particular model is being created by finetuning on v1-2 checkpoint with 225k steps at a resolution of 512x512.

Segment Anything (SAM)

Huggingface link: facebook/sam-vit-base

Model Link : Model

Segment Anything Model (SAM) is used to produce object masks for the input image. SAM model is created by researchers from Meta. This model is trained on 11 million images and 1.1 billion masks. The model type is SAM-ViT consists of ViT Encoder, Prompt Encoder, Mask Decoder and The Neck.

The model is used for extracting the object from the SD output.

Steps

Generate image and background using SD
Extract the object
Merge the resultant with SD.

References

Stable Diffusion model

@InProceedings{Rombach_2022_CVPR,
    author    = {Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj\"orn},
    title     = {High-Resolution Image Synthesis With Latent Diffusion Models},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2022},
    pages     = {10684-10695}
}

Segment Anything by Meta

@article{kirillov2023segment,
  title={Segment anything},
  author={Kirillov, Alexander and Mintun, Eric and Ravi, Nikhila and Mao, Hanzi and Rolland, Chloe and Gustafson, Laura and Xiao, Tete and Whitehead, Spencer and Berg, Alexander C and Lo, Wan-Yen and others},
  journal={arXiv preprint arXiv:2304.02643},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Pictures		Pictures
LICENSE		LICENSE
README.md		README.md
Replacing Background using SD and SAM.ipynb		Replacing Background using SD and SAM.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stable Diffusion, Segment Anything models for replacing background in generated images

Stable Diffusion model

Segment Anything (SAM)

Steps

References

About

Releases

Packages

Languages

License

mahendran-narayanan/SD-SAM-for-Background

Folders and files

Latest commit

History

Repository files navigation

Stable Diffusion, Segment Anything models for replacing background in generated images

Stable Diffusion model

Segment Anything (SAM)

Steps

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages