ImagetoPrompts

ImagetoPrompts is used to Stable Diffusion - Image to Prompts, and now is 237/1231. Public Score 0.58145, Private Score 0.57859.

Project Structure

|___data_output
    |_____output.csv
|___temp_test
|___test_models
    |____test_clip_interrogator.py
    |____test_coca.py
    |____test_ofa.py
    |____test_vit_gpt2.py
    |____test.ipynb
|___cal_cv_model.py
|___main.ipynbk
|___README.md

data_output save the output.
temp_test just for test, useless.
test_models use different pretrain models to generate the prompts.
cal_cv_model.py is find best weighted to combine the feature.
main.ipynb is the upload code.

Project Core

Generate the images from prompts by using stable diffusion.
Train our ViT model through the dataset.
Fine-tuning pretrain BLIP CLIP OFA model through the dataset.
Using data augementation methods at predict time.
Using a feature combination method to enhance the score. (Weighted accumulation ✅, MLP[TODO])

Reference

OFA
CoCa
pharmapsychotic/interrogator
And other public notes & codes from kaggle, click this to find more.

  @misc{stable-diffusion-image-to-prompts,
      author = {Ashley Chow, inversion, Will Cukierski},
      title = {Stable Diffusion - Image to Prompts},
      publisher = {Kaggle},
      year = {2023},
      url = {https://kaggle.com/competitions/stable-diffusion-image-to-prompts}
  }

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ImagetoPrompts

Project Structure

Project Core

Reference

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data_output		data_output
temp_test		temp_test
test_models		test_models
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
cal_cv_model.py		cal_cv_model.py
main.ipynb		main.ipynb

490CAD/ImagetoPrompts

Folders and files

Latest commit

History

Repository files navigation

ImagetoPrompts

Project Structure

Project Core

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages