DEV Community

Biricik Biricik
Biricik Biricik

Posted on • Originally published at zsky.ai

The 5-Element Prompt Formula Every Photographer Knows (And AI Users Should Steal)

The 5-Element Prompt Formula Every Photographer Knows (And AI Users Should Steal)

I spent a decade as a professional photographer before I ever typed a text-to-image prompt. When I started using AI tools, I realized most people were missing the one framework every photo pro already knows: every image is five decisions.

I'm Cemhan. I run ZSky AI. Before that I shot fashion editorial work and was shortlisted for the Sony World Photography Awards. This post is the crossover guide I wish existed when I first started prompting — the exact formula I use to write prompts that actually produce what I'm picturing.

(Well, what I'm "picturing" — I have aphantasia, so my mental imagery is text-based. Which is ironically perfect training for prompt writing.)

The formula

[SUBJECT] + [COMPOSITION] + [LIGHTING] + [STYLE] + [MOOD]
Enter fullscreen mode Exit fullscreen mode

Every real photograph is answering five questions:

  1. Subject — what is the image of?
  2. Composition — how is the camera positioned and framed?
  3. Lighting — where is the light coming from and what's its quality?
  4. Style — what visual tradition is the image working inside?
  5. Mood — what should the viewer feel?

Every AI image generator works better when you address all five explicitly. And here's the secret: you don't have to be fancy. You just have to not skip any of them.

Let me show you.

A bad prompt vs a good prompt

Bad prompt:

a woman walking in the city
Enter fullscreen mode Exit fullscreen mode

This is what 90% of first-time AI users write. It's three of the five elements at best — and they're all at the weakest level of specificity.

Good prompt using the formula:

a woman in her 60s with silver hair wearing a long wool coat (SUBJECT),
medium wide shot from slightly below eye level, she's walking
left-to-right across the frame (COMPOSITION),
late afternoon golden hour sunlight raking in from the left,
warm amber tones, shadows stretching across the pavement (LIGHTING),
shot on 35mm film with gentle grain, cinematic aspect ratio,
Kodak Portra 400 color palette (STYLE),
contemplative, peaceful, a quiet moment of independence (MOOD)
Enter fullscreen mode Exit fullscreen mode

This is the same idea. It's also ~10x the fidelity in output. You can copy-paste it into ChatGPT DALL-E, Midjourney, Runway, Pika, ZSky, Kling, Stable Diffusion — anything — and get back something recognizably in the same family across tools.

Why this works (the photography background)

Professional photographers don't walk into a shoot thinking "I want a nice picture." They walk in with answers to all five questions before they press the shutter. The lighting diagram is drawn. The shot list is written. The mood board is pinned. The composition is blocked out on set before the model arrives.

When you're writing a prompt, you're doing the same mental work — just at a keyboard instead of on a set. The camera equivalent is already in the AI. You just have to tell it what you would have told a gaffer, a DP, a wardrobe stylist, and a retoucher.

The reason AI image generators feel random to beginners is that the beginners are leaving four of the five decisions up to the model. Random output is exactly what should happen. Once you make the decisions yourself, the output stops being random.

The shorthand version

If the full formula feels long, here's the 1-line shorthand that captures 80% of the value:

[SUBJECT], [LIGHTING WORD], [STYLE WORD], [MOOD WORD]
Enter fullscreen mode Exit fullscreen mode

Examples:

  • A young dancer mid-leap in a rehearsal room, harsh window light, 35mm film grain, triumphant
  • An old fisherman mending a net on a rocky beach, overcast diffuse light, oil painting style, weary but patient
  • A stray cat sitting in an alley, neon backlight from a restaurant sign, comic book ink style, curious and suspicious

You can type these in 15 seconds and they'll outperform most 200-word "mega prompts" because they hit all the important decisions without wasting tokens on decoration.

What to skip

I see beginners stuffing prompts with things like:

  • beautiful, stunning, amazing, masterpiece
  • high quality, 8k, photorealistic, ultra detailed
  • professional photography, award winning, trending on artstation

None of these are prompt elements. They're hopes. They don't tell the model what to do, they just tell it to be good. The model is already trying to be good. Wishing harder doesn't help.

Replace hope words with decisions. Instead of beautiful write soft side light, face partially in shadow. Instead of amazing write from below, slightly off-center, dramatic perspective. Instead of masterpiece write Vermeer-style chiaroscuro. Decisions, not hopes.

The 3-tool portability test

Here's a practical exercise: write one prompt using the 5-element formula, then run it on 3 different AI tools. You should see three recognizably different images — different because the tools have different stylistic defaults — but all three should land in the same conceptual neighborhood.

If your results are wildly different (one is a portrait, one is a landscape, one is an abstract), your prompt is too vague. Go back and tighten the composition and subject elements.

If your results are identical (all three tools produced the same mood and style), your prompt is too specific — probably naming a specific artist or technique that all three tools have in their training data. That's sometimes what you want and sometimes a sign you're being lazy. Real photography is more specific than "like Van Gogh" and more generalizable than naming a single painter.

Negative prompts: the opposite of the formula

If the 5-element formula tells the model what to do, the negative prompt tells it what not to do. I use a standard negative prompt that I add to almost everything:

deformed hands, extra fingers, misplaced features,
blurry, low resolution, watermark, text overlay,
duplicate subjects, inconsistent anatomy, poorly drawn face
Enter fullscreen mode Exit fullscreen mode

Negative prompts work best when they're specific to the kinds of errors the model you're using actually makes. If you're getting weird hands, you put "deformed hands" and "extra fingers" first. If you're getting weird eyes, "asymmetric eyes" and "cross-eyed" go in.

The camera-as-prompt analogy

The deeper reason the formula works is that a prompt is a camera setting. You're telling a virtual camera what to look at, where to stand, how to light, what lens filter to use, and what mood to aim for. The AI is doing the image processing (what used to happen in a darkroom or in Photoshop), but you are still the one making the photograph.

I say this a lot because I think it matters: AI didn't replace photographers. It replaced cameras. Which means the photographer's craft is now the craft of prompting. Photographers who embrace this will have an edge. Photographers who resist it will lose relevance.

If you're reading this as a photographer wondering whether to learn AI tools: yes, learn them. Your visual decision-making is exactly the skill the prompt layer needs. You already know how to answer the five questions. You just have to type them instead of setting up a tripod.

And if you're reading this as an AI user who wishes your outputs were more consistent and more intentional: steal the formula. Make all five decisions before you hit generate. The randomness goes away.

TL;DR

  • Every image is 5 decisions: Subject + Composition + Lighting + Style + Mood
  • Bad prompts leave 2-4 of those undecided and blame the AI for being random
  • Replace "beautiful" and "high quality" with actual decisions
  • Test portability by running the same prompt on 3 tools
  • Negative prompts handle the specific error patterns of the tool you're using
  • Photographers already know this formula — they just call it "the shot"

Copy this post, bookmark it, use it. It's MIT-licensed in spirit. Go make something.

— Cemhan


I run ZSky AI, a free AI image and video platform built by a photographer with aphantasia. The prompt library on zsky.ai has ~1,700 tested prompts across every category. MIT licensed. Use them with any tool.

Top comments (0)