Skip to content

Latest commit

 

History

History
63 lines (41 loc) · 4.59 KB

README.md

File metadata and controls

63 lines (41 loc) · 4.59 KB

VLMs zero-to-hero

coming: january 2025...

hello

Welcome to VLMs Zero to Hero! This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.

tutorials

notebook open in colab video paper
01.01. Word2Veq: Distributed Representations of Words and Phrases and their Compositionality link soon link

roadmap

natural language processing (NLP) fundamentals

computer vision (CV) fundamentals

early vision-language models

scale and efficiency

modern vision-language models

extra

contribute and suggest more papers

Are there important papers, models, or techniques we missed? Do you have a favorite breakthrough in vision-language research that isn't listed here? We’d love to hear your suggestions!