1 Making our First Image: “A damn fine cup of coffee”
This chapter introduces Stable Diffusion through a creative lens, comparing its remixing of visual culture to Grandmaster Flash’s early turntable experimentation. It frames text-to-image generation as an accessible way to turn daydreams into images, while emphasizing that mastery still requires iteration, technique, and tooling. Because Stable Diffusion is open source, learners benefit from a fast-evolving ecosystem, community knowledge, and the freedom to customize the software—all achievable on consumer hardware.
Readers are guided to create their first image with CompVis Stable Diffusion using the prompt “A damn fine cup of coffee,” learning core concepts along the way. The chapter stresses generating many candidates, clarifies the difference between iterations and batch size (with limited practical speed gains from batches and higher VRAM demands), and shows how seeds enable both reproducibility and variety (default 42). It demonstrates how changing width and height—especially aspect ratio—does more than reshape the canvas; it meaningfully shifts composition and subject emphasis.
Foundational prompt-engineering habits follow: favor clear, descriptive language over poetic phrasing, supply scene context to steer composition, and specify stylistic cues (for example, “surrealist painting” or “wood etching”) to avoid the uncanny valley and better match intent. The chapter closes by showcasing the power of open source through a small code edit that disables the built-in NSFW “rick-roll” replacement, underscoring user control over model behavior. Overall, it equips readers with practical commands, mental models, and iterative tactics to move from vague ideas to images that feel intentional and personal.
Getting my imagination on the screen
Browsing an infinite library of Pulp Sci-Fi that never was.
Who knew monks were such avid readers of sci-fi?
Envisioning ancient aliens.
The initial 6 images created by our prompt: “A damn fine cup of coffee.”
Average seconds to create an image, comparing iterations and batch.
Generating 30 images at once.
Creating 6 different images with seed 12345.
Images with a 5:3 landscape aspect ratio.
Images with a 3:5 aspect ratio using the same seed.
Images with a 3:7 aspect ratio using the same seed.
Images with a 4:1 aspect ratio using the same seed.
A poetic prompt does not always yield poetic images.
A straight forward prompt yields more cups of black coffee.
Adding a scene to an image can help provide context.
Choosing a landscape aspect ratio helps display the counter.
Creating surrealistic images.
Images in the style of a wood etching.
I would say that’s a damn fine cup of coffee!
Being “Rick-rolled” by Stable Diffusion.
Summary
- Generating with Stable Diffusion is an iterative process, in which we are constantly revising our settings and prompts.
- Despite the many ways to improve images, it’s always a good idea to generate a variety of images to see if we find a particular one that stands out to us as pleasing.
- Our prompts should be clear and descriptive. Giving some context for the object we’re prompting can change the image dramatically. Describing the style of the image can further let us change the feeling of the images we’re generating.
- The aspect ratio that we use to generate an image can have a major impact on the way the image looks. Consider whether the image you want to create would look better as a square, a landscape or portrait.
- Because Stable Diffusion is open source we (as well as the entire community of users) can change and extend its behavior.
A Damn Fine Stable Diffusion Book ebook for free