"Cinematic" as a Sora keyword stopped meaning anything by the second day of v1 — the model would return a generic wide shot with a lens flare and call it done. In Sora 2, you actually get film looks by doing what a DP does: specifying shot type, camera move, lens character, lighting motivation, color grade, and audio direction. These 30 copy-paste prompts are written that way — each one is a director brief, not a vibe request.
What Makes a Sora 2 Prompt Cinematic
Aspect ratio sets format expectations before a single frame renders. 2.39:1 tells the model you're shooting widescreen anamorphic — the frame is wide, the negative space is intentional, the letterbox is part of the composition. 9:16 reads vertical, which shifts the visual grammar entirely toward social and documentary. State it explicitly and the model treats it as a production constraint, not decoration.
Naming the camera move beats the word "cinematic" every time. A slow dolly-in to a close-up and a locked-off wide shot are cinematically opposite choices. "Cinematic camera move" is meaningless — "slow dolly-in from medium to tight close-up, ending on eyes" is an instruction the model can execute. The same applies to crane-up, orbit-around-subject, push-in handheld, and tracking alongside. Named moves produce consistent motion; adjectives produce noise.
Lens choice drives compression, bokeh character, and spatial relationships. A 35mm anamorphic widescreen shot has horizontal lens flares, slight barrel breathing, and a wide field of view that keeps the environment in play. An 85mm portrait compression shot collapses depth, separates subject from background, and keeps the frame intimate regardless of the location behind it. Vintage 16mm grain means a different shutter behavior and a specific halation around highlights. Specify the glass, not just the mood.
Lighting is a set of motivated decisions, not a single adjective. Rembrandt lighting has a specific triangular catchlight placement on the shadow side of the face — say that and you get a sculpted key without filling the shadows. Motivated practicals means every light source in the frame has a visible reason: a bare bulb, a monitor glow, a neon sign. Sodium-vapor street means orange-cast, low CRI, with hard shadows from overhead sources. Naming these setups tells the model both the quality of light and its implied source.
Color grade language closes the loop between image and genre. Teal-and-orange is the dominant blockbuster grade: pushed skin tones toward orange, pushed everything else toward teal. Bleach-bypass compresses the blacks, desaturates the mids, and gives a silver-retention look associated with gritty war films. Name the grade explicitly and pair it with matching audio direction — "wind through dry grass, distant diesel engine" added to the frame sells the world the image implies. For a full breakdown of Sora 2 prompt construction, see the Sora 2 prompts guide and the complete AI video prompting guide.
Film Genre Cinematic Clips (1–5)
1. Neo-Noir City Night
A wide shot of a lone figure in a rain-soaked trench coat walking away
from camera down a slick urban alley, night. Neon signs reflect in
puddles — red, pink, blue. Camera: locked-off static with a subtle rack
focus from foreground rain droplets to the retreating figure. Lens:
40mm anamorphic, moderate depth of field, visible horizontal flares on
neon sources. Lighting: motivated practicals only — neon signage and a
single sodium-vapor streetlamp casting hard shadows at 45 degrees from
above-left. Color grade: crushed blacks, desaturated mids, neon hues
pushed into saturation while skin and concrete remain cool. Audio: low
rain ambience, distant traffic hum, wet footsteps on concrete, no
dialogue. Aspect: 2.39:1. Duration: 10 seconds.
2. Modern Western
A medium shot of a weathered rancher in a duster coat standing at the
edge of a dry canyon at late afternoon, squinting into sun. Camera:
slow dolly-in from medium to tight close-up, ending on eyes under the
hat brim. Lens: 85mm with slight portrait compression, shallow depth
of field — canyon wall behind goes soft. Lighting: hard side-key from
setting sun at screen-right, deep shadow on screen-left side of face,
no fill, golden-orange color temp 3200K. Color grade: warm tungsten
shadows, desaturated greens in the scrub brush, skin tones pushed
slightly orange-amber. Audio: dry canyon wind, distant hawk cry,
leather creak, no dialogue. Aspect: 2.39:1. Duration: 8 seconds.
3. Slow-Burn Horror
A tight over-the-shoulder shot of a child standing at the far end of
a long, dimly lit institutional hallway at night, facing away from
camera, not moving. Camera: extremely slow push-in handheld, almost
imperceptible movement, the hallway does not appear to get closer.
Lens: 50mm spherical, deep focus — both the child and the hallway
lights sharp. Lighting: single overhead fluorescent tube at mid-hall
flickering cold white 5600K, long shadow toward camera, no other
source. Color grade: desaturated greens and yellows in the walls,
crushed blacks in doorways, clinical cool cast across the entire frame.
Audio: fluorescent hum with intermittent buzz, distant institutional
HVAC, complete absence of footsteps, no dialogue. Aspect: 2.39:1.
Duration: 12 seconds.
4. Near-Future Sci-Fi
A wide establishing shot of a massive arcology megastructure rising
from a fog-covered coastal city at blue-hour dusk, rain falling.
Camera: slow crane-up from street level, rising to reveal the full
structure disappearing into cloud cover. Lens: 24mm rectilinear,
slight barrel distortion keeping verticals slightly converging.
Lighting: city ambient underlight bouncing upward off fog 3000K warm,
structure lit from internal blue-white LEDs 6500K, rain catching light
as vertical streaks. Color grade: teal-and-orange split — warm amber
street-level practicals against cool cyan upper structure, midrange
desaturated. Audio: low-frequency sub drone, heavy rain, distant
automated traffic, synth texture rising with crane move. Aspect: 2.39:1.
Duration: 12 seconds.
5. Period Drama
A medium close-up of a woman in 19th-century dress seated at a
candlelit writing desk in a stone manor library at night, quill moving
across paper. Camera: locked-off static on a slight Dutch tilt, slow
pull-focus from inkwell in foreground to her face at mid-frame. Lens:
vintage 75mm with visible halation around candle highlights, soft
spherical rendering. Lighting: motivated candle practicals — warm
tungsten 2000K, flickering, key falling from screen-right, left side
of face in deep shadow, no fill. Color grade: warm amber shadows,
desaturated blues in the stone walls, no teal push, organic grain
overlay. Audio: quill scratching parchment, distant fireplace crackle,
rain against stone window, no dialogue. Aspect: 2.39:1. Duration:
10 seconds.
Director-Style Cinematic Clips (6–10)
6. Wes Anderson Symmetrical Pastel Motion
A perfectly symmetrical wide shot of a vintage pastel-painted hotel
lobby — coral pink walls, mint green wainscoting, a uniformed bellhop
standing dead-center — at soft afternoon interior. Camera: locked-off
static, then a single lateral dolly-right moving a precise measured
distance to reveal a second identical bellhop mirroring the first,
then stopping. Lens: 27mm spherical, zero distortion, no depth of
field — everything tack sharp front to back. Lighting: flat, even
soft-box quality interior light 3800K, no hard shadows, every surface
uniformly lit. Color grade: pastel palette — pinks, mints, creams,
powder blue accents pushed into saturation, no blacks, lifted shadows
throughout. Audio: precise comedy foley — single footstep, paper
shuffle, muted lobby ambient. Aspect: 1.85:1. Duration: 8 seconds.
7. Denis Villeneuve Epic Minimalism
An extreme wide shot of a solitary figure crossing an enormous salt
flat under a white overcast sky, the horizon perfectly bisecting the
frame — sky above, cracked earth below. Camera: locked-off static,
absolute stillness, figure crossing left to right in extreme long shot
over 10 seconds. Lens: 65mm spherical on large-format sensor,
ultra-shallow depth of field in sky haze, heat shimmer off the flat.
Lighting: overcast diffuse daylight from directly above — no direction,
no shadows, the entire world equally lit, total flatness. Color grade:
desaturated near-monochrome — whites pushed to off-white, cracked earth
to ash-grey, figure a dark silhouette, bleach-bypass approach. Audio:
profound silence broken only by wind across open flat, distant
atmospheric pressure, no foley, no dialogue. Aspect: 2.39:1. Duration:
12 seconds.
8. David Fincher Cool Cyan Precision
A medium shot of two people seated across a conference table in a
glass-walled corporate office at night, city visible behind them.
Camera: extremely slow push-in dolly from medium to medium-close, so
precise it reads as almost static. Lens: 35mm spherical with clinical
sharpness, minimal depth of field difference between subjects and
background. Lighting: cold overhead practical fluorescents 4100K as
fill, subtle cyan bounce off glass walls, no warm sources anywhere —
every light source cool. Color grade: cyan-shifted mids and shadows,
compressed blacks, slightly desaturated skin tones into grey-pink,
the look associated with Greig Fraser's harder corporate interiors.
Audio: air conditioning hum, near-inaudible city traffic through glass,
precise ambient silence, no dialogue. Aspect: 2.39:1. Duration:
10 seconds.
9. Wong Kar-wai Saturated Longing
A tight close-up of a man's hands holding a cigarette in a neon-lit
Hong Kong corridor at night, smoke rising in slow motion. Camera:
handheld, slightly restless, slow rack focus between the cigarette
tip and the man's face, then back, then the blurred neon behind.
Lens: 28mm with close-focus, slight lens breathing, warm focus fall-off
at frame edges reminiscent of Christopher Doyle's approach. Lighting:
motivated practicals — red and gold neon light at 45 degrees from
above-right, warm orange bounce from below, deep shadow on the left
side, high contrast. Color grade: saturated reds and ambers pushed
hard, skin tones warm, shadows red-shifted, the entire frame bathed
in warm neon excess. Audio: low bass texture, distant city —
indistinct voices, scooter engine fading, slow rain — no dialogue.
Aspect: 1.85:1. Duration: 10 seconds.
10. Christopher Nolan IMAX Expansion
A wide exterior shot of a man standing on the nose of a massive dam,
water rushing below, mountain range stretching to the horizon,
overcast sky above. Camera: crane-up from tight on the man to a
sweeping wide reveal of the full landscape, continuous move, ending
on a locked-off wide where the man is small against the environment.
Lens: 15mm ultra-wide spherical IMAX character, slight barrel,
exaggerating the scale relationship between human figure and
infrastructure. Lighting: flat natural overcast daylight, the dam
structure grey and industrial, Hoyte van Hoytema-style desaturated
natural palette. Color grade: cool and desaturated — blue-grey skies,
grey concrete, muted greens, skin tones cool and natural, no grade
artifice. Audio: roar of rushing water building with crane move, wind
at height, no music, no dialogue. Aspect: 1.43:1 (IMAX). Duration:
14 seconds.
Named Camera Move Prompts (11–15)
11. Slow Dolly-In to Close-Up
A medium shot of an aging boxer sitting alone in the corner of an
empty ring at dawn, training gloves on lap, head down. Camera: slow
dolly-in on a rail from medium full shot to tight close-up of face,
the move lasting the full duration — 12 seconds of continuous approach.
Lens: 50mm spherical, the dolly-in breathing the depth of field
progressively shallower as we close. Lighting: single motivated
practical — a bare overhead ring light, hard top-key only, deep shadows
under eyes and jaw, no fill. Color grade: desaturated warm with slight
bleach-bypass compression in the shadows, grain. Audio: distant gym
ambience, fabric settling, very faint arena echo, no music.
Aspect: 2.39:1. Duration: 12 seconds.
12. Crane Down From Sky
A wide aerial establishing shot of a medieval village at golden hour —
thatched rooftops, cobblestone square, a single market stall. Camera:
crane-down from directly above the village descending slowly to eye
level at street height, the final frame a medium wide of the market
stall and a merchant. Lens: 24mm rectilinear, verticals straightening
as we descend to eye level. Lighting: golden-hour practical sun from
screen-left at low angle, long shadows across cobblestones, warm
3000K color temp throughout, rim light on rooftop edges. Color grade:
warm amber throughout, pushed golden-hour saturation on stone and thatch,
clean shadows with organic warmth. Audio: village ambient — distant
church bell, crowd murmur rising as we descend, market sounds
sharpening. Aspect: 2.39:1. Duration: 12 seconds.
13. Orbit Around Subject
A medium shot of an astronaut in a full EVA suit floating in open
space with Earth visible below, arms outstretched. Camera: a full
360-degree orbit around the astronaut — starting from face-on, moving
to profile, continuing to back, returning to face-on, the camera
maintaining a constant distance while Earth rotates slowly below
throughout. Lens: 35mm anamorphic, horizontal lens flares catching
the sun during the orbit's sun-facing quarter. Lighting: hard
unfiltered sunlight from a single direction — one side of the suit
bright white, the other side completely black, zero ambient fill.
Color grade: no grade intervention — the vacuum of space dictates the
palette. Audio: absolute silence broken by EVA suit breathing and
radio static hiss. Aspect: 2.39:1. Duration: 14 seconds.
14. Push-In Handheld
A medium shot of a protest leader stepping onto a concrete podium to
address a crowd at dusk, the crowd stretching back out of focus.
Camera: push-in handheld from behind a crowd member — the operator
moving through the crowd, slight organic weave, breathing into
tighter and tighter on the speaker's face over the full duration.
Lens: 85mm portrait compression, the crowd's bokeh rings pressing in
from the frame edges. Lighting: mixed late dusk — blue sky ambient
above, sodium-vapor street practicals warming faces orange from below,
high contrast between sky and ground sources. Color grade: warm orange
skin tones against cool blue sky, high contrast, pushed street grit.
Audio: crowd ambient building, distant chant, wind, a microphone
feedback squeal at the moment the speaker reaches the podium.
Aspect: 2.39:1. Duration: 10 seconds.
15. Locked-Off Wide-to-Tight Reveal
A locked-off static wide shot of a dark kitchen at 3am — every
detail sharp — then a slow rack focus from the far background (a
window with street light) progressively pulling toward the foreground,
revealing a figure sitting at the kitchen table who was invisible in
the background focus. The final focus rests on the figure's still
silhouette. Camera: absolutely static, zero camera movement, all
motion achieved through focus rack only. Lens: 35mm spherical,
deep focus start transitioning to shallow. Lighting: single motivated
street-light practical through window — hard parallel beams, cold
sodium-vapor orange 2200K, everything else in near-total dark.
Color grade: orange sodium cast on all surfaces, deep blacks, no fill.
Audio: refrigerator hum, distant traffic, a single chair creak at
the reveal, silence. Aspect: 2.39:1. Duration: 10 seconds.
Lens and Optics Cinematic Prompts (16–20)
16. 35mm Anamorphic Widescreen Action
A wide shot of a motorcycle courier weaving at speed through congested
city traffic at late afternoon, golden light. Camera: tracking shot
running parallel to the motorcycle, keeping pace, the city blurring
into bokeh behind. Lens: 35mm anamorphic — oval bokeh rings on
background light sources, visible horizontal lens flares from sun
backlighting, slight breathing on the focus. Color grade: teal-and-orange
blocking — asphalt pushed teal, skin and the motorcycle's warm surfaces
pushed orange, high contrast. Lighting: hard backlight sun from screen-
right producing rim light on the rider, street-level reflections.
Audio: engine note dominating, tire noise on asphalt, rush of air,
city ambient compressed behind. Aspect: 2.39:1. Duration: 10 seconds.
17. 85mm Portrait Compression Dialogue
A tight two-shot of two characters facing each other in a cramped
apartment doorway at night, about to separate. Camera: locked-off
static on both faces, no movement. Lens: 85mm portrait compression —
the hallway behind both characters collapsed to flat bokeh, extreme
compression making the door frame and the gap between them feel
claustrophobic. Lighting: single motivated practical — overhead
hallway bulb 2700K, top-key only, catching upper faces, eyes in
slight shadow, jawlines lit. Color grade: warm amber interior tones,
slight crush in the blacks of the hallway, skin tones natural.
Audio: apartment building ambient — distant TV, muffled voices,
air vent — the silence between the characters palpable. Aspect:
1.85:1. Duration: 10 seconds.
18. Vintage 16mm Grain Documentary
A medium observational shot of an elderly fisherman repairing nets
on a weathered dock at early morning, fog over the water. Camera:
handheld observational, slight drift and reframe as if the operator
is watching rather than directing — occasional loss of perfect
horizontal. Lens: vintage 16mm character — visible grain structure
in the frame, softer resolving power than modern glass, halation
around the brightest dock lights and the sky. Lighting: flat pre-
dawn grey ambient, no direct sun, diffuse fog scatter — everything
lit softly from above with no directionality. Color grade: 16mm
print character — slightly washed-out whites, warm shifted mid-tones,
organic grain that reads as film. Audio: dock creak, water lap,
net fiber dragging on wood, distant buoy bell, no dialogue.
Aspect: 1.33:1. Duration: 10 seconds.
19. Fish-Eye Distortion Subjective POV
A tight POV shot from the perspective of someone falling backward
through a trapdoor — the room above receding rapidly as the opening
shrinks. Camera: locked to the falling subject's perspective,
tracking down, the room above framed by the square trapdoor opening.
Lens: extreme 8mm fish-eye — dramatic barrel distortion warping the
ceiling and walls of the room above, the trapdoor opening circular
rather than square at the frame edges. Lighting: interior practical
from the room above — single bulb 3000K providing the only light,
the space below the door in increasing darkness. Color grade:
pushed contrast and vignette pressing in from the barrel edges,
harsh falloff toward black frame corners. Audio: rush of air,
sudden silence as the trapdoor closes, darkness. Aspect: 1:1.
Duration: 6 seconds.
20. Tilt-Shift Miniature City
A wide elevated shot of a dense city intersection at golden hour,
traffic flowing, pedestrians crossing. Camera: locked-off static
from a high angle. Lens: tilt-shift lens with the focus plane
tilted to run horizontally through the mid-distance of the frame —
creating a narrow band of sharp focus while the near-ground and
the distant buildings both fall into progressive bokeh, making
everything read as a scale model. Lighting: actual golden-hour
sunlight raking across the scene from low screen-right, long
shadows stretching across the intersection. Color grade: saturation
pushed hard — toy-like colors, rich stop-sign red, taxi yellow,
deep blue sky — the miniaturization illusion reinforced by color pop.
Audio: the sound design treated to match the visual — miniaturized,
slightly pitched up, toy-like. Aspect: 2.39:1. Duration: 8 seconds.
Lighting and Mood Cinematic Prompts (21–25)
21. Rembrandt Key Light Interior
A medium close-up of an artist in a sparse studio, facing three-
quarters to camera, working on an unseen canvas at afternoon interior.
Camera: locked-off static. Lens: 75mm spherical, shallow depth of
field — the studio behind soft. Lighting: Rembrandt setup — single
key from upper screen-left at approximately 45 degrees, creating
the triangular catchlight on the shadow side of the face, shadow
side in deep falloff, no fill, no reflector. Color temp: warm
tungsten practical 2700K. Color grade: warm amber with rich dark
shadows, no shadow detail lifted, the Rembrandt triangle glowing
against the dark side. Audio: brush on canvas, paint jar settling,
no ambient, absolute silence of concentration. Aspect: 1.85:1.
Duration: 8 seconds.
22. Motivated Practicals Only Night Exterior
A wide shot of a woman standing on a rain-wet fire escape outside
a tenement building at 2am, city below and above. Camera: locked-off
static at medium distance. Lens: 35mm anamorphic. Lighting:
motivated practicals only — no artificial fill or key added beyond
what exists in the scene: window light from apartment behind her
(warm 2700K casting forward), a neon laundry sign one floor below
(pink-red, intermittent), sodium street lamp below-right at low
angle (orange, hard-edged). No light source introduced that isn't
visibly motivated. Color grade: all three light colors competing —
orange street, pink neon, warm window — rich and unreconciled.
Audio: distant traffic, rain, neon sign electrical buzz at its cycle
interval, muffled apartment TV. Aspect: 2.39:1. Duration: 10 seconds.
23. Golden-Hour Magic-Hour Exterior
A wide shot of two children running toward camera through a field
of tall grass at magic hour, sun directly behind them. Camera:
locked-off static, then a slow rack focus from the grass in the
foreground to the children as they approach. Lens: 50mm spherical,
backlight creating natural lens flare halation. Lighting: pure magic-
hour backlight — sun just above the horizon, 1800K deep amber,
both children silhouetted with golden rim light catching hair and
shoulder edges, the grass lit from behind into luminous gold,
zero frontal illumination. Color grade: push the warm — amber skies,
golden grass, rich organic color. No desaturation, no teal. Audio:
children's laughter at distance growing closer, grass rustling,
warm wind, birds clearing. Aspect: 2.39:1. Duration: 10 seconds.
24. Harsh Sodium-Vapor Street Night
A wide shot of an empty underpass at 1am — a lone figure walking
away from camera along the center of the road, walls graffitied.
Camera: locked-off static. Lens: 28mm spherical, moderate depth
of field. Lighting: overhead sodium-vapor streetlamps only — the
classic municipal orange 2100K, monochromatic, rendering colors as
shades of amber and brown, no blues or greens surviving the color
shift. Shadows hard and directly downward from each lamp's overhead
position. Color grade: allow the sodium cast to dominate — the entire
frame in amber-orange monochrome, desaturated everything except the
sodium tones, crushed shadows between lamp pools. Audio: distant
highway drone through concrete, water drip, distant train, the
figure's footsteps receding. Aspect: 2.39:1. Duration: 12 seconds.
25. Single Rim With Deep Shadow Noir
A tight medium shot of a detective in a dark office, seated, profile
to camera, smoke rising from an ashtray on the desk. Camera:
locked-off static. Lens: 50mm spherical, shallow depth of field.
Lighting: single rim light from behind at screen-right — a bare
practical desk lamp behind the figure, illuminating only the back
shoulder, the back of the head, and the edge of the smoke rising
from the ashtray. The entire face is in deep shadow with no
catchlight. A slight cool spill from a street window at screen-
left fills almost nothing. Color grade: extreme contrast — near-black
shadows consuming 60% of the frame, the rim catching warmth 2700K,
smoke rendered in grey against dark. Audio: ticking clock, distant
rain, paper rustle, cigarette crackle. Aspect: 2.39:1. Duration:
10 seconds.
Color Grade Cinematic Prompts (26–30)
26. Teal-and-Orange Thriller
A medium wide shot of a government operative moving through a
crowded airport terminal at midday — fast walk, civilian clothes,
eyes forward. Camera: handheld tracking shot following slightly
behind and to the side, keeping pace. Lens: 35mm spherical.
Lighting: airport overhead fluorescents at 4000K as base fill,
warm skin tones from the crowd, hard glass-ceiling daylight from
above supplementing. Color grade: full teal-and-orange treatment —
skin tones pushed warm into orange-amber, all neutral surfaces
(floor, walls, signage backgrounds) pushed toward teal, shadows
cold teal, highlights warm. Midrange desaturated. The definitive
commercial thriller look. Audio: airport terminal ambient —
announcement PA, crowd din, hard footsteps on tile. Aspect: 2.39:1.
Duration: 10 seconds.
27. Bleach-Bypass War Film
A wide shot of soldiers moving through a destroyed urban street at
overcast daylight — rubble, blown-out building facades, smoke.
Camera: handheld, documentary urgency, weave and drift. Lens:
28mm spherical, slight vignette from lens barrel. Lighting: overcast
war-zone ambient — flat diffuse daylight from above, the smoke
scattering all directional quality out of the light. Color grade:
full silver-retention bleach-bypass — compressed blacks that lift
into heavy grain rather than pure black, desaturated color
throughout (greens muted, skin tones grey-pink), high contrast in
the luminance channel while chroma is suppressed. The look of
Saving Private Ryan's opening and countless post-2000 war films.
Audio: distant artillery report, falling debris, boots on rubble,
wind through empty windows, no score. Aspect: 2.39:1. Duration:
12 seconds.
28. Warm Tungsten Interior
A medium shot of a family gathered at a dinner table during a
holiday meal — candles lit, food spread, mid-conversation, evening
interior. Camera: locked-off static from slight above-eye angle.
Lens: 35mm spherical, medium depth of field, background room
soft. Lighting: motivated candles on the table as key practicals
2000K, supplemented by a warm overhead tungsten chandelier 2700K —
all light sources warm, no cool or daylight sources competing.
Color grade: embrace the tungsten warmth — deep amber shadows,
warm skin tones, orange candle highlights, wood and fabric
rendered in their warmest read. No teal counter-push. Audio:
overlapping conversation, silverware on china, a child's laugh,
ambient warmth. Aspect: 1.85:1. Duration: 10 seconds.
29. Cool Fluorescent Corporate
A wide shot of an open-plan office at night — a single employee
remaining, working at a screen surrounded by empty desks,
overhead fluorescents. Camera: slow push-in dolly from wide to
medium, ending on the lone employee's face. Lens: 40mm spherical.
Lighting: overhead commercial fluorescents 4100K as sole source —
cool, flat, uniform, casting multiple overlapping soft shadows
downward from every overhead panel. No warmth, no directionality.
Color grade: lean into the fluorescent green-shift — push the mids
slightly green, desaturate skin tones to grey-pink, lift the
shadows slightly to preserve the anemic midrange, suppress all
warmth. Audio: fluorescent hum, keyboard clicks, distant building
HVAC, the specific silence of an empty office at midnight.
Aspect: 2.39:1. Duration: 12 seconds.
30. Monochrome Black-and-White Film
A wide shot of a jazz musician playing double bass alone on a
small stage in an empty club at night — a single spotlight, chairs
on tables in the background. Camera: slow orbit from face-on to
three-quarter profile, moving around the musician over the full
duration. Lens: 75mm with period-appropriate soft rendering, slight
glow on specular highlights. Lighting: single hard spotlight from
above — top-key only, harsh falloff to shadow, the dark club beyond
the light pool completely black. Designed specifically for mono
rendering: high contrast, no dependence on color differentiation.
Color grade: full desaturation — processed as if shot on Kodak
Double-X black-and-white negative film, pushed contrast, grain
present, whites not clipped, rich blacks. Audio: upright bass
resonance filling the empty room, bow hair on string, room
ambience, no other musicians. Aspect: 1.85:1. Duration: 14 seconds.
Cinematic Sora 2 Power Tips
State 2.39:1 or 21:9 for genuine widescreen. The aspect ratio declaration is a production constraint, not decoration. It changes frame composition, negative space, and how the model distributes visual weight. Default output reads square or broadcast — specify widescreen if that's the format you're building for.
Name the camera move — don't say "cinematic." Dolly-in, push-in, orbit, crane-up, locked-off, handheld, tracking — each of these is a specific instruction with a specific visual result. "Cinematic camera movement" tells the model nothing. "Slow dolly-in from medium to tight close-up over 10 seconds" tells it exactly what to execute.
Specify the lens — anamorphic and 85mm portrait are opposite choices. Anamorphic 35mm delivers oval bokeh, horizontal lens flares, and a wide spatial relationship between subject and environment. 85mm portrait compression collapses depth, separates subject from background, and creates intimacy regardless of the location. These two lenses produce entirely different motion and spatial grammar — picking the wrong one contradicts your shot type.
Describe lighting like a DP. "Motivated practical" means every light source has a visible reason in the frame. "Rembrandt key" means a specific triangular catchlight placement with deep shadow-side falloff. "Sodium-vapor street" means 2100K orange monochromatic with hard downward shadows. Named setups tell the model both the quality of light and its implied source — adjectives like "dramatic lighting" tell it neither.
Name the color grade explicitly. Teal-and-orange, bleach-bypass, warm tungsten, cool fluorescent, monochrome double-X — each of these has a specific tonal relationship and a genre association the model can map to. "Cinematic color grade" could mean anything and usually means nothing. The grade name closes the loop between image and genre intent.
Audio direction sells the cinematic frame. Sora 2 generates synchronized audio, and it matters. "Wind through dry mesquite, a distant windmill turning" is a Western sound design cue; "western ambient audio" is not. Specific foley and ambient direction — a refrigerator hum, a brass section at distance, wet footsteps on concrete — tells the model what world the image lives in and completes the sensory frame the visuals establish.
Cinematic shot of a city at night.
A wide locked-off shot of a rain-soaked urban alley at 2am, a lone figure in a trench coat walking away from camera. Camera: static, rack focus from foreground puddle reflection to the figure. Lens: 40mm anamorphic, horizontal neon flares. Lighting: motivated practicals only — sodium-vapor streetlamp overhead at 2200K, neon signage at 45 degrees, no fill. Color grade: crushed blacks, desaturated mids, neon hues pushed while concrete remains cool. Audio: low rain ambience, distant traffic, wet footsteps on concrete. Aspect: 2.39:1. Duration: 10 seconds.
Build Your Own Sora 2 Director Brief
These 30 prompts follow a strict production format because that's what Sora 2 responds to. The model has the capability for coherent camera motion, native audio, and character consistency across a clip — the ceiling isn't the model, it's the prompt. A vague brief returns vague output. A brief that reads like a DP's shot specification returns output that actually looks like a film.
Use the AI prompt generator to build structured Sora 2 prompts automatically — describe what you're trying to shoot and get a production-ready director brief. For more on Sora 2 across every use case, see the best Sora 2 prompts for 2026, the full Sora 2 prompts guide, and the complete AI video prompting guide.