Prompting Image Generation Clearly

From Reading Images to Making Them 🎯

Reading images is one half of the job. The other half is making them. When you ask AI to generate an image, the model is guessing at thousands of choices you didn't make: where the camera sits, who's in the shot, what time of day it is, what's lying on the desk in the background. Leave those choices to chance and you get a generic, vaguely-stock-photo result that doesn't fit the slide, the post, or the doc. This unit gives you a structure for making those choices on purpose.

Apply the Image Prompt Anatomy 🦴

The Image Prompt Anatomy is a five-part checklist for what to put into an image prompt: Subject, Style, Composition, Mood/Lighting, and Constraints. If any one is missing, the model fills the gap with whatever its training data suggests, which is usually a cliché. Subject is what is literally shown: who or what is in the frame, doing what. Style is the visual treatment: photo, line illustration, 3D render, watercolor, or a reference to a known brand look. Composition is how the shot is framed: close-up or wide, eye-level or overhead, centered or off-axis, where the negative space sits. Mood/Lighting sets atmosphere: warm afternoon light, cool overcast, dramatic side-lighting, neutral studio. Constraints are the guardrails: aspect ratio (16:9 for slides, 1:1 for social), exclusions (no readable text, no fake logos, no specific stereotypes), and any "must include" or "must avoid" notes. Run through all five every time, even when one feels obvious. "Obvious" is exactly where models default to the average. Here's how the checklist works in practice. Milo is helping Jessica turn a vague workplace image idea into a usable prompt before she opens the image tool: Milo: What's the subject? Jessica: A team collaborating around a laptop. Milo: That's a stock-photo prompt. Be specific. Where? Who's in the shot? What are they actually doing? Jessica: Three people, side-by-side at a kitchen-style office counter, looking at one laptop, mid-conversation. Hands gesturing, not posed. Milo: Better. Now: style, composition, lighting, constraints. Skip any of those and you'll get the LinkedIn version of your idea. Notice Milo's pressure isn't on creativity, it's on specificity. Vague subject equals vague output.

Draft a Prompt from a Workplace Goal 🥅

A prompt starts with the communication goal, not the image. Ask yourself: what does this image need to do for the reader? Reinforce a slide's theme? Set a tone on a wiki page? Anchor a social post? The goal tells you the constraints before you touch subject or style. Once you know the goal, translate it into a concrete scene. "Customer obsession" is a theme, not a subject; a model can't draw a theme. Push it into something you could photograph: a support specialist on a video call with a customer, sticky-note walls covered in user quotes, a designer watching someone use a prototype. Pick one. Then layer the other four anatomy elements on top. Write the prompt as one block of clear sentences, not a list of tags. Something like: a wide-angle photograph of three colleagues at a standing desk reviewing customer feedback on a laptop, mid-conversation; warm afternoon light from a window on the left; muted earth tones; 16:9 aspect ratio with empty space on the upper right for slide title text; no readable signage, no fake logos, no clip-art-style poses. That's all five elements, on purpose, in one paragraph.

Refine by Tightening Specificity and Exclusions 🔧

Your first output will almost never be the one you ship. Treat it as a diagnostic. Look at what's off and trace it back to an anatomy element. Faces look uncanny? Tighten Style (specify "documentary photo, natural skin texture") or add an exclusion ("no airbrushed faces"). Background a mess? Tighten Composition ("clean background, shallow depth of field"). Image feels generic? Subject is probably still too abstract. Title text won't fit? You forgot a safe-area Constraint. The move is targeted edits, not rewrites. Change one or two elements, regenerate, compare. Keep a short list of exclusions you find yourself adding every time (no readable text, no warped hands, no stereotyped depictions, no fake brand logos), and reuse it as a baseline. The throughline of this unit: a usable image prompt is a deliberate set of five choices, not a wish. Vague in, generic out; specific in, slide-ready out. The test of any of this is whether it holds up live, so the next step is a conversation with a design partner where you'll walk through each anatomy element for a real hero image before you ever open the tool. Bring a concept that's already concrete, and expect to leave the call with it sharper.

Summary

This lesson introduces the Image Prompt Anatomy, a five-part framework for creating specific, professional visuals. You will learn how to translate abstract communication goals into concrete scenes and layer details like style, composition, and constraints to avoid generic results. By mastering this structured approach, you can move from vague requests to high-quality, on-brand images through deliberate drafting and iterative refinement.

Previous Lesson

Next Lesson: Responsible Image Editing

Join the 1M+ learners on CodeSignal

Be a part of our community of 1M+ users who develop and demonstrate their skills on CodeSignal