Stable Diffusion is based on the latent diffusion model of Rombach et al. (2022), which runs the diffusion process in a compressed latent space rather than directly on pixels — cutting compute enough to generate high-resolution images on a single consumer GPU. Its open release spawned a vast ecosystem of tools and fine-tuned variants.
It is the canonical example of open-weight image generation, complementing closed systems like DALL·E.