,1,Stable Diffusion is a text-based image generation machine learning model released by Stability.AI. It's default ability generated image from text, but the model is open source which means that it can also do much more. In this video I explain how Stable Diffusion works at a high level, briefly talk about how it is different from other Diffusion-based models, compare it to DALL-E 2, and mess around with the code.
Outline
0:00 - Intro
1:58 - Stable Diffusion Explained
4:32 - Why The Buzz?
6:46 - Starting The Code
12:45 - DALL-E 2 Comparison
20:15 - Custom Pipeline
32:01 - Making Diffusion Videos
37:30 - Generating Similar Images
40:23 - Image-to-Image
53:42 - Wrapping it Up
Social Media
YouTube - https://youtube.com/c/EdanMeyer
Twitter - https://twitter.com/ejmejm1
Sources:
Colab Notebook - https://colab.research.google.com/drive/1_kbRZPTjnFgViPrmGcUsaszEdYa8XTpq?usp=sharing
Blog post - https://stability.ai/blog/stable-diffusion-public-release
Source Code - https://github.com/CompVis/stable-diffusion
Hugging Face Models - https://huggingface.co/spaces/stabilityai/stable-diffusion
Paper - https://arxiv.org/pdf/2112.10752.pdf