How to Write AI Image Prompts: The Complete Guide for Every Model (2026)

Q: What makes a good AI image prompt?

A good AI image prompt includes up to six elements: a specific subject, a visual style, lighting description, composition details, mood or atmosphere, and technical specifications like aspect ratio. Being specific about each element — such as naming an art style, a lighting technique, or a camera angle — gives you control over the output instead of hoping for something usable.

Q: Do different AI image models need different prompts?

Yes. DALL-E 3 works best with natural, conversational descriptions. Midjourney responds better to concise, keyword-driven prompts with parameters like --ar and --stylize. Stable Diffusion and Flux use weighted token syntax. Tailoring your prompt style to the model produces noticeably better results.

Q: How do you describe lighting in AI image prompts?

Use specific lighting terms that photographers use: golden hour for warm directional light, Rembrandt lighting for dramatic portrait shadows, rim lighting for silhouette effects, or volumetric lighting for visible light rays through fog. Lighting is the most underused element in image prompts and has a major impact on the mood and quality of the generated image.

SurePrompts Team

How to Write AI Image Prompts: The Complete Guide for Every Model

Most people type "a cool dragon" and wonder why the output looks generic. The difference between mediocre AI images and stunning ones isn't the model — it's the prompt. This guide breaks down exactly how image prompts work across every major model, with 15+ examples you can steal and modify. For instant image prompts, try our image prompt generator or model-specific tools for Midjourney.

85%

Of AI image quality is determined by the prompt — not the model, not the settings, not the upscaler

The Anatomy of a Good Image Prompt

Every effective image prompt contains up to six building blocks. You don't need all six every time, but knowing the full toolkit means you can control the output instead of hoping for something usable.

1. Subject

The core of what you want to see. Be specific. "A dog" gives you a generic stock photo dog. "A weathered border collie with heterochromia, mid-leap over a fallen log" gives you something worth keeping.

✗Before

a woman in a city

✓After

a Japanese woman in her 60s crossing a rain-slicked Shibuya intersection at dusk, carrying a yellow umbrella, shot from street level

Principles:

Name the subject precisely — species, age, gender, action, expression
Describe what they're doing, not just what they are
Include spatial relationships ("standing in front of," "reflected in," "towering above")
Quantity matters — "two" is very different from "a crowd of"

2. Style

The visual language. This is where most prompts either shine or collapse into "generic AI art."

Major style categories:

Category	Examples
Art movements	Impressionism, Art Deco, Bauhaus, Ukiyo-e, Art Nouveau
Photography styles	Street photography, editorial fashion, product photography, photojournalism
Media	Oil painting, watercolor, charcoal sketch, digital illustration, 3D render
Artist reference	"In the style of Edward Hopper" / "Reminiscent of Studio Ghibli"
Era	1970s Polaroid, Victorian illustration, Y2K aesthetic, retro-futurism
Genre	Noir, cyberpunk, cottagecore, brutalism, vaporwave

Tip

Combine styles for unique results. "Ukiyo-e woodblock print + cyberpunk Tokyo" produces something far more interesting than either style alone.

3. Lighting

Lighting is the single most underused element in image prompts. Photographers spend years learning this. You can describe it in five words.

Key lighting terms:

Golden hour — warm, directional, long shadows. The Instagram favorite.
Blue hour — cool, ambient, moody. Right before sunrise or after sunset.
Rembrandt lighting — dramatic, triangle shadow on cheek. Portraiture classic.
Rim/backlighting — subject outlined in light, dramatic silhouette effect.
Overcast/diffused — soft, even, no harsh shadows. Natural documentary feel.
Neon/artificial — colored light sources, urban night photography, cinematic.
Chiaroscuro — extreme contrast between light and dark. Caravaggio territory.
Volumetric lighting — visible light rays, fog, atmosphere. God rays through forest canopy.
Studio lighting — controlled, professional, clean. Product shots, headshots.

✗Before

a portrait of a chef

✓After

a portrait of a chef, Rembrandt lighting from the left, warm kitchen glow in the background, steam rising from a pan catching the backlight

4. Composition

How the image is framed. Most people skip this entirely, which is why AI defaults to center-framed, eye-level shots.

Composition terms that work:

Camera angle: bird's eye view, worm's eye view, Dutch angle, over-the-shoulder
Shot type: extreme close-up, medium shot, wide establishing shot, macro
Lens effect: fisheye, tilt-shift (miniature effect), bokeh (blurred background), lens flare
Framing: rule of thirds, symmetrical, leading lines, frame within a frame
Depth: shallow depth of field, deep focus, foreground/midground/background layering

5. Mood and Atmosphere

The emotional temperature of the image. This is often the difference between a technically correct image and one that feels like something.

Mood descriptors:

Ethereal, haunting, serene, ominous, whimsical, melancholic, chaotic, intimate
Nostalgic, surreal, clinical, raw, dreamlike, tense, celebratory, lonely

Atmosphere descriptors:

Foggy, dusty, smoky, rainy, hazy, crisp, humid, frozen
Dense, sparse, cluttered, minimal, lush, barren, vibrant, muted

6. Technical Specifications

Model-specific parameters that control output quality and format.

Aspect ratio: 16:9 (landscape/cinematic), 9:16 (vertical/mobile), 1:1 (square), 4:5 (Instagram portrait)
Quality terms: highly detailed, 8K, photorealistic, ultra-sharp, high resolution
Rendering: octane render, unreal engine, ray tracing, V-Ray (for 3D)
Camera: shot on Hasselblad, 35mm film, Fujifilm color science, Canon 85mm f/1.4

Model-Specific Tips

Each model interprets prompts differently. Here's what works (and what doesn't) for each.

DALL-E 3 (ChatGPT)

DALL-E 3 is integrated into ChatGPT, which means you write natural language and ChatGPT actually rewrites your prompt before sending it to the image model. This is both a strength and a limitation.

What works well:

Natural, conversational descriptions
Complex scenes with multiple elements and spatial relationships
Text in images (DALL-E 3 handles text better than most)
Specific art styles and cultural references

Tips:

Be descriptive but don't keyword-stuff — write like you're describing the image to someone
If ChatGPT's rewrite isn't producing what you want, ask it to use your exact prompt
For text in images, put the text in quotes and keep it short (under 6 words works best)
Specify what you DON'T want in natural language ("without any text overlay," "no people in the background")

For 50 copy-paste DALL-E prompts, see our ChatGPT image prompts guide.

Example prompt:

code

A cozy independent bookshop interior, late afternoon. Warm golden light streaming through tall windows, casting long shadows across wooden shelves overflowing with books. A ginger cat sleeping on a stack of hardcovers near the register. Dust motes visible in the light beams. Shot from a low angle, looking up at the shelves. Watercolor illustration style with loose, expressive brushwork and a warm ochre and sage color palette.

Midjourney

Midjourney excels at aesthetic quality and tends to produce "beautiful" images by default. It responds well to concise, keyword-driven prompts and has its own parameter system.

What works well:

Shorter prompts (Midjourney often performs better with less text)
Stylistic keywords and aesthetic references
Parameters like --ar, --style, --chaos, --stylize
Photography terms and camera references

Key parameters:

--ar 16:9 — aspect ratio
--s 250 — stylize level (0-1000, higher = more Midjourney aesthetic)
--c 30 — chaos (0-100, higher = more variation)
--no text, borders — negative prompts
--style raw — less Midjourney "beautification," more literal

Tips:

Front-load the most important elements
Use :: for weighted sections — portrait of a woman::2 neon cityscape background::1 emphasizes the portrait
Keep prompts under 60 words for best results
Use --style raw when you want the model to follow your prompt literally

For a complete Midjourney guide with advanced techniques, read our Midjourney v7 prompting guide.

Example prompt:

code

weathered lighthouse on a rocky cliff, stormy ocean, dramatic clouds breaking to reveal sunset, cinematic lighting, shot on medium format film, moody atmosphere --ar 16:9 --s 400

Stable Diffusion (and SDXL)

Stable Diffusion gives you the most control but requires the most prompt engineering. It's an open-source model that runs locally or through services like Stability AI, Automatic1111, or ComfyUI.

What works well:

Keyword-heavy, comma-separated prompts
Quality boosters ("masterpiece, best quality, highly detailed")
Negative prompts (essential — often more important than the positive prompt)
LoRA and model-specific syntax
Weighted tokens using (parentheses) for emphasis

Tips:

Start with quality keywords: "masterpiece, best quality, highly detailed, sharp focus"
Use negative prompts aggressively: "low quality, blurry, deformed, watermark, text, bad anatomy, extra fingers"
Parentheses increase weight: (golden hour lighting:1.3) emphasizes the lighting
Order matters — put the most important elements first
Checkpoint model choice matters more than in other platforms — a photorealistic checkpoint handles photos better than an anime checkpoint

Negative prompt template (use as a starting base):

code

(worst quality:1.4), (low quality:1.4), (normal quality:1.4), blurry, bad anatomy, bad hands, extra digits, fewer digits, cropped, watermark, text, signature, jpeg artifacts, deformed, disfigured, mutation, extra limbs

Example positive prompt:

code

masterpiece, best quality, a Japanese tea house in autumn, maple leaves in brilliant red and orange, koi pond reflecting the trees, soft morning mist, traditional architecture, highly detailed, warm color palette, Studio Ghibli aesthetic, (volumetric lighting:1.2), serene atmosphere

Flux (by Black Forest Labs)

Flux is the newer model gaining serious traction for its prompt adherence and photorealistic capabilities. Available in Flux Pro, Flux Dev, and Flux Schnell variants.

What works well:

Detailed, natural language descriptions (similar to DALL-E 3)
Precise spatial relationships and complex compositions
Photorealistic outputs with accurate human anatomy
Text rendering (among the best of any model)
Consistent style across multiple generations

Tips:

Flux follows long, detailed prompts accurately — don't fear length
Natural language outperforms keyword-stuffing (unlike Stable Diffusion)
Specify camera and lens for photorealistic shots
For maximum control, describe foreground, midground, and background separately
Flux handles multiple subjects and their relationships better than most models

For Flux-specific techniques and advanced prompts, see our Flux Pro prompting guide.

Example prompt:

code

A photorealistic aerial view of a coastal Italian village at golden hour. Terracotta rooftops cascade down a hillside toward a small harbor with colorful fishing boats. Narrow cobblestone streets are visible between buildings. Mediterranean cypress trees dot the hillside. Warm sunset light bathes the scene in amber, with long shadows stretching east. Shot on a DJI Mavic with a wide-angle lens, slight tilt-shift effect making the village look miniature. 16:9 aspect ratio.

15 Example Prompts With Explanations

Here are prompts across different use cases, with annotations explaining why each element works.

Product Photography

Prompt 1 — Minimalist Product Shot:

code

A frosted glass perfume bottle on a smooth marble surface, soft diffused studio lighting from above, water droplets on the glass, a single orchid petal in the foreground slightly out of focus, clean white background, product photography, shot on Phase One IQ4 150MP, 4:5 aspect ratio

Why it works: Specifies material (frosted glass), surface (marble), lighting (diffused, from above), details (water droplets), depth (orchid petal out of focus), and technical specs (camera, aspect ratio). Every element serves the "premium product" goal.

Prompt 2 — Lifestyle Product Context:

code

Flat lay of a leather journal, brass pen, and espresso in a ceramic cup on a worn oak desk, morning sunlight from a window on the left creating soft shadows, a smartphone showing a calendar app slightly cropped at the edge, warm earth tones, editorial photography style for a lifestyle brand

Portraits

Prompt 3 — Environmental Portrait:

code

An elderly Vietnamese fisherman mending nets on a wooden boat at dawn, Hoi An river, golden hour sidelight illuminating weathered hands, shallow depth of field with lanterns blurred in the background, National Geographic documentary style, shot on 85mm f/1.4, warm color grade

Why it works: Specific subject (not "old man"), specific location, specific time, specific action, specific lens. The "National Geographic documentary style" anchors the entire aesthetic.

Prompt 4 — Stylized Fashion Portrait:

code

Editorial fashion portrait, a model with shaved head and geometric face paint in cobalt blue, wearing an oversized deconstructed blazer, standing in a concrete brutalist parking garage, hard directional flash creating sharp shadows, high contrast black and white with only the face paint in color, Helmut Newton meets Hype Williams, 3:4 aspect ratio

Landscapes

Prompt 5 — Natural Landscape:

code

A glacial lake in Patagonia at blue hour, mirror-still water reflecting jagged snow-capped mountains, single dead tree in the foreground breaking the symmetry, thin clouds streaking across a deep blue sky, long exposure effect on the water, shot on large format film, Ansel Adams tonal range, panoramic 21:9 aspect ratio

Prompt 6 — Urban Landscape:

code

Tokyo alleyway at night in the rain, neon signs in Japanese reflecting off wet asphalt, a single figure with a transparent umbrella walking away from camera, steam rising from a ramen shop vent, cyberpunk color palette of magenta and teal, shallow depth of field, street photography, Liam Wong inspired

Illustration Styles

Prompt 7 — Vintage Botanical:

code

Scientific botanical illustration of a Venus flytrap (Dionaea muscipula) in various stages — closed trap, open trap, and catching a fly. Detailed cross-section showing trigger hairs and digestive glands. Aged parchment background, hand-drawn with fine ink line work and delicate watercolor tinting, 18th century natural history style, labeled in elegant cursive handwriting

Prompt 8 — Children's Book:

code

A bear cub and a fox kit building a treehouse together in an autumn forest, the bear holding a plank while the fox hammers a nail, scattered tools and acorns around the base of the oak tree, golden afternoon light filtering through red and orange leaves, warm and inviting children's book illustration style, gouache on paper texture, slightly flat perspective, inspired by Jon Klassen

Prompt 9 — Concept Art:

code

A floating marketplace on a terraformed Mars, domed glass structures connected by covered bridges over red dust plains, vendors selling hydroponic produce, mixed human and android shoppers, two pale moons visible in a pinkish sky, hard science fiction aesthetic, detailed concept art for a film production, wide establishing shot, Greg Rutkowski lighting

Abstract and Artistic

Prompt 10 — Abstract:

code

Abstract fluid art composition, deep ocean blues and liquid gold merging and separating in organic patterns, microscopic detail in the paint cell structures, high contrast between dark prussian blue depths and bright metallic gold peaks, macro photography of actual fluid pour, dramatic studio lighting from the side creating depth in the paint surface, 1:1 square format

Prompt 11 — Surrealist:

code

A massive antique pocket watch melting over the edge of a cliff overlooking the ocean, the clock face filled with water and small fish swimming among the gears, surrealist painting, muted earth tones with a single vivid turquoise ocean, soft sfumato edges, Dalí meets Magritte, oil on canvas texture, museum quality

Architecture and Interior

Prompt 12 — Architectural Visualization:

code

A modern Japanese-Scandinavian fusion house, floor-to-ceiling windows framing a bamboo garden, exposed timber ceiling beams, polished concrete floors with a sunken living area, minimal furniture in natural oak and bouclé fabric, late afternoon sun creating warm rectangles of light on the floor, architectural photography by Hiroshi Sugimoto, wide angle, clean and serene

Prompt 13 — Interior Design:

code

A maximalist home library, floor-to-ceiling bookshelves on every wall, rolling ladder in aged brass, a tufted emerald velvet reading chair with a floor lamp, stacks of books on a Persian rug, warm incandescent lighting, afternoon light through a single arched window creating god rays in the dusty air, overhead view looking down at the chair, rich and moody color palette

Food Photography

Prompt 14 — Overhead Food Shot:

code

Overhead flat lay of a homemade pasta dinner spread, hand-cut pappardelle with wild mushroom ragu in a rustic ceramic bowl, torn crusty bread, half-empty wine glass casting a red shadow, scattered parmesan shavings and fresh thyme, worn wooden table surface with flour dusting, moody food photography, natural window light from the top, dark and warm color grade, shot on Fuji GFX 100S

Graphic Design and Branding

Prompt 15 — Logo Concept:

code

A minimal, geometric logo mark for a sustainable architecture firm called "Verdant." The mark should merge a leaf shape with architectural blueprint lines, using clean vectors with no gradients. Single color — deep forest green (#2D5016) on white. Scalable from favicon to billboard. Modern, clean, and immediately readable. Show the mark alone, no text, centered on white background, vector illustration style

Common Mistakes (and How to Fix Them)

Mistake 1: Being Too Vague

"A beautiful landscape" tells the model nothing. Beautiful how? Landscape where? What season? What time of day?

✗Before

a beautiful landscape

✓After

a misty Scottish highland at dawn, heather in bloom across rolling hills, a single stone cottage with smoke from the chimney, low clouds clinging to the valleys, atmospheric landscape photography, shot on medium format film

Mistake 2: Keyword Stuffing Without Structure

"Beautiful amazing stunning gorgeous incredible breathtaking epic masterpiece" is wasted tokens. These words all mean roughly the same thing to the model. Use specific descriptors instead of stacking superlatives.

Mistake 3: Ignoring Negative Prompts

Especially for Stable Diffusion: if you don't specify what you don't want, the model will give you its default — which often includes watermarks, extra fingers, blurry backgrounds, and text overlays. Always use negative prompts for SD/SDXL.

Mistake 4: Fighting the Model's Strengths

Midjourney makes everything beautiful. If you want raw, ugly, or documentary-style, use --style raw or switch to a model better suited to realism. DALL-E 3 handles text well. SD gives you technical control. Flux does photorealism. Use the right tool.

Mistake 5: Forgetting Aspect Ratio

The default is usually square (1:1). If you want a landscape scene, specify 16:9 or wider. If you want a phone wallpaper, specify 9:16. If you want an Instagram portrait, specify 4:5. Aspect ratio dramatically affects composition.

Mistake 6: Not Iterating

Your first prompt is a draft. Generate, evaluate, adjust. Move the lighting. Change the camera angle. Add a foreground element. Remove a detail that's cluttering the composition. The best images come from 3-5 iterations, not one lucky shot.

Advanced Techniques

Multi-Subject Composition Control

When your image has multiple subjects, describe their spatial relationship explicitly:

code

In the foreground: a child's hand reaching up to release a red balloon. In the midground: the balloon rising past the branches of a cherry blossom tree. In the background: the balloon as a tiny dot against a vast cloudy sky. Shot from below looking up, vertical composition, each layer at a different focus distance.

Style Mixing

Combine two unexpected styles for unique results:

code

Art Deco poster design meets Japanese woodblock print. A great wave (Hokusai-inspired) rendered in geometric Art Deco patterns, gold and deep navy color palette, strong linework, limited color print aesthetic, vintage poster format with aged paper texture

Time and Sequence

Describe temporal elements for dynamic images:

code

A coffee cup on a desk, time-lapse style showing the coffee at three stages: full and steaming on the left, half-empty in the center, empty with lipstick stain on the right. Same desk, same angle, changing natural light from morning (cool) to afternoon (warm). Triptych layout, editorial photography.

Texture and Material Focus

When material matters more than subject:

code

Extreme macro photography of rust patterns on an abandoned ship hull. Iron oxide in deep oranges and browns, flaking paint revealing layers of color underneath — red, then green, then bare metal. Water droplet trapped in a rust crater reflecting the overcast sky. Shot with extension tubes, razor-thin depth of field, every metallic crystal visible.

Negative Space and Minimalism

Less is more — but you have to describe the "less":

code

A single red maple leaf resting on a vast, smooth grey stone surface, positioned in the lower right third of the frame. The remaining 80% of the image is empty stone texture with subtle grain variation. Minimal, contemplative, Japanese wabi-sabi aesthetic. Muted colors except the leaf. Overhead shot, soft overcast light.

Consistent Character Across Generations

For projects needing the same character in multiple images, create a detailed character brief and reuse it:

code

Character brief — ALWAYS include in prompts for this character:
"A woman in her mid-30s with short silver-grey hair in a textured pixie cut, warm brown skin, prominent cheekbones, small scar above left eyebrow, wearing a navy blue mechanic's jumpsuit with rolled sleeves, oil-stained hands, confident posture."

Scene-specific prompt: [CHARACTER BRIEF] + standing in a neon-lit garage, leaning against a vintage motorcycle she's restoring, warm tungsten lighting mixing with cool neon, cinematic medium shot, shallow depth of field

The Prompt Formula Cheat Sheet

When you're staring at a blank prompt field, start with this formula and fill in what matters:

code

[SUBJECT] + [ACTION/POSE] + [SETTING/ENVIRONMENT] + [STYLE/MEDIUM] + [LIGHTING] + [COMPOSITION/CAMERA] + [MOOD] + [TECHNICAL SPECS]

You don't need every element. A strong subject + style + lighting prompt often beats a prompt with all eight elements but weak specifics. Quality of description beats quantity of keywords.

1

Start with the subject — be ruthlessly specific about what you see in your head

2

Add the style — one clear reference is better than five vague ones

3

Set the lighting — this single addition will improve 80% of prompts

4

Frame the shot — camera angle and lens choice control the viewer's experience

5

Set the mood — two or three atmosphere words anchor the emotional tone

6

Add technical specs — aspect ratio at minimum, camera/render engine if relevant

Build Prompts Faster

Writing image prompts from scratch every time is slow. These tools speed up the process:

Image Prompt Generator — Describe what you want in plain English, get a structured image prompt for any model
Midjourney Prompt Generator — Build Midjourney-optimized prompts with parameters and style controls
SurePrompts Builder — Save your best image prompt structures as reusable templates with customizable fields

Keep Reading

ChatGPT Image Prompts: 50 Copy-Paste Templates — Ready-to-use prompts for DALL-E 3 via ChatGPT
Midjourney v7 Prompting Guide — Deep dive into Midjourney's latest model
Flux Pro Prompting Guide — Master prompting for Black Forest Labs' Flux
Prompt Engineering Basics — Fundamentals that apply to both text and image prompts
AI Prompt Frameworks — Structured approaches to writing any kind of prompt
Prompt Templates — Browse 320+ templates including image generation

How to Write AI Image Prompts: The Complete Guide for Every Model (2026)

How to Write AI Image Prompts: The Complete Guide for Every Model

The Anatomy of a Good Image Prompt

1. Subject

2. Style

3. Lighting

4. Composition

5. Mood and Atmosphere

6. Technical Specifications

Model-Specific Tips

DALL-E 3 (ChatGPT)

Midjourney

Stable Diffusion (and SDXL)

Flux (by Black Forest Labs)

15 Example Prompts With Explanations

Product Photography

Portraits

Landscapes

Illustration Styles

Abstract and Artistic

Architecture and Interior

Food Photography

Graphic Design and Branding

Common Mistakes (and How to Fix Them)

Mistake 1: Being Too Vague

Mistake 2: Keyword Stuffing Without Structure

Mistake 3: Ignoring Negative Prompts

Mistake 4: Fighting the Model's Strengths

Mistake 5: Forgetting Aspect Ratio

Mistake 6: Not Iterating

Advanced Techniques

Multi-Subject Composition Control

Style Mixing

Time and Sequence

Texture and Material Focus

Negative Space and Minimalism

Consistent Character Across Generations

The Prompt Formula Cheat Sheet

Build Prompts Faster

Keep Reading

Get ready-made ChatGPT prompts

Related Resources

Midjourney V7 Product Shot Template

Midjourney V7 Cinematic Still Template

Midjourney V7 Character Portrait Template

Midjourney V7 Video Hero Shot Template

Related Articles

50 Best ChatGPT Image Prompts: Copy-Paste Templates That Actually Work (2026)

Midjourney V7 Prompting: Everything New + 50 Best Prompts

Flux Pro Prompting Guide: Open Source Image AI Mastery