Most "Sora prompts" circulating online were written for Sora 1 — vague vibes, no camera direction, no audio intent, hope for the best. Sora 2 is a different model. It rewards a director's brief: shot framing, lens choice, motivated lighting, explicit camera move, and a sound layer the model can actually render. These 50 copy-paste Sora 2 prompts are built that way.
What Changed in Sora 2
Coherent clips at real length. Sora 2 can hold a scene together across longer clips — up to roughly 20 seconds — without the physics degrading or characters drifting. That's the headline capability. It means you can prompt a full action sequence, a complete product reveal, or a two-person exchange without Sora losing the thread halfway through.
Native audio in the model. This is Sora 2's signature differentiator. The model generates ambient sound, foley, and dialogue-level audio as part of the video — not as a separate post-processing step. Describe the sound you want (the hiss of an espresso machine, the crack of a basketball on hardwood, a character saying a short line) and the model renders it. The prompts in this list use explicit audio direction throughout. It's not optional — it's one of the main levers you have over the output.
Explicit camera control. Sora 2 responds to camera language with much more precision than its predecessor. Dolly-in, handheld, locked-off, pan-left, push-in, orbit — these aren't decorative words. They change the output meaningfully. Writing "cinematic" tells the model nothing useful. Writing "slow dolly-in on a 50mm lens from chest to close-up" tells it exactly what you want.
Character and scene consistency. Subjects hold across cuts. If you describe a specific person, environment, or prop in a multi-shot prompt, Sora 2 maintains those visual anchors through transitions. This makes it viable for narrative work — not just isolated clips. For a deep dive on building consistent characters and using Sora 2's prompting formula from scratch, read the complete Sora 2 prompting guide.
Multi-shot framing. Sora 2 understands shot transitions. You can describe "Shot 1: wide establishing" and "Shot 2: medium push-in on character" and get a coherent cut, not two disconnected clips stitched together. Eight of the prompts below use this explicitly. For the broader craft of structuring multi-shot AI video sequences, see the AI video prompting complete guide.
Cinematic Wide & Establishing Shots (1–8)
1. Neon-Night Cityscape
Wide establishing shot of a rain-slicked urban street at 2 a.m. Neon signs in pink, cyan, and amber reflect in standing puddles on the asphalt. A lone figure in a dark coat walks away from camera down the center of the empty street. Camera: locked-off on a tripod at eye level, 24mm lens, slight underexposure for atmosphere. Lighting: practical neon sources only — no fill, deep shadows between pools of color. Audio: light rain on pavement, distant traffic, a passing train two blocks over, the low hum of a neon transformer. Color grade: teal and orange with deep blacks. Aspect: 2.39:1. Duration: 10 seconds.
2. Vast Desert at Midday
Extreme wide shot of a red sandstone desert stretching to a heat-haze horizon. A single unpaved road bisects the frame. No vehicles. A dust devil spins slowly in the middle distance. Camera: locked-off high angle from a mesa ledge, 16mm ultra-wide, deep focus. Lighting: harsh midday sun directly overhead, bleached sky, no shadow relief — oppressive heat visible in the shimmer. Audio: wind across open rock, sand granules skittering, the absence of any human sound. Color grade: burnt sienna, bone white, overexposed sky. Aspect: 2.39:1. Duration: 8 seconds.
3. Sci-Fi Establishing Shot
Aerial wide shot descending slowly toward a sprawling space station orbiting a gas giant. The station rotates at a steady pace. Solar panels catch orange light from the distant sun. Camera: slow dolly-down from orbit perspective, rotating with the station, 35mm equivalent lens. Lighting: hard directional sunlight from frame left, deep shadow on the station's dark side, the gas giant's reflected amber glow as fill. Audio: low mechanical hum of rotating station, faint internal systems, the silence of space broken only by structural vibration. Aspect: 2.39:1. Duration: 12 seconds.
4. Period Drama Exterior
Wide shot of a Victorian terraced street in winter, early morning, gas lamps still lit against a grey dawn sky. Light snow dusts the cobblestones. A horse-drawn milk cart moves slowly left to right across the far background. Camera: locked-off at street level across the road, 50mm lens, slight elevation on a low riser. Lighting: soft grey overcast, warm amber glow from gas lamps, no strong shadows. Audio: horse hooves on cobblestone, glass milk bottles clinking, a distant church bell striking six, someone's breath visible in cold air. Color grade: desaturated with warm amber accents. Aspect: 16:9.
5. Snow Mountain Summit
Wide shot looking down from a Himalayan ridge at a sea of clouds below. The sun rises behind camera, casting long blue shadows across snow and ice. A climber in red high-altitude gear stands at the ridgeline looking out, back to camera. Camera: locked-off handheld with minimal movement on a 24mm lens, slight upward angle to show scale. Lighting: alpenglow on the snow — pink to gold transition over 8 seconds, no artificial sources. Audio: wind gusting at altitude, the creak of a crampon on ice, absolute stillness between gusts. Color grade: cold blue shadow, warm golden peak light. Aspect: 16:9. Duration: 10 seconds.
6. Crowded Street Market
Wide tracking shot through a dense outdoor market in Southeast Asia at midday. Camera moves at walking pace down the central aisle, handheld, medium height, 28mm lens. Vendors call out, colorful produce and fabric on either side. Depth of crowd visible through 15 meters of frame. Lighting: overhead sun filtered by canvas canopies, dappled light on the ground, strong practical shadows. Audio: overlapping vendor calls in Thai, sizzling from a nearby grill, motorcycle passing, children laughing, the ambient hum of dense human activity. Color grade: saturated greens, reds, and saffron. Aspect: 16:9. Duration: 8 seconds.
7. Ocean Storm at Dusk
Epic wide shot from a cliff overlooking a stormy North Atlantic. Waves crash 40 feet below. Heavy rain and sea spray in the air. The last light of a storm-filtered sunset turns the water dark green and steel grey. Camera: locked-off on a high cliff ledge, 24mm wide, deep focus, no movement. Lighting: stormy overcast with one break in the clouds allowing a single dramatic shaft of orange light across the far water. Audio: roaring surf, wind howling, rain on rock, thunder rolling in from distance, the structural creak of the cliff edge. Color grade: desaturated green-grey with one warm accent beam. Aspect: 2.39:1. Duration: 12 seconds.
8. Golden-Hour Countryside
Wide shot of rolling English countryside at golden hour. Hedgerows divide fields of wheat and green pasture. A narrow lane disappears over a hill to the right. Two sheep graze in the near foreground. Camera: locked-off at field height, 50mm lens, slight upward angle toward the amber sky. Lighting: sun at 15 degrees above horizon, backlit wheat glowing gold, long warm shadows stretching left. Audio: birdsong, a light breeze through wheat, distant sheep, an occasional bee. Color grade: warm amber and deep green, soft contrast. Aspect: 16:9. Duration: 8 seconds.
Character Action Prompts (9–15)
9. Person Walking and Talking
Medium shot of a woman in her late 30s, shoulder-length dark hair, wearing a tailored navy blazer, walking briskly through a glass-walled corporate corridor. She speaks on the phone — engaged, nodding, occasionally gesturing with her free hand. Camera: lateral tracking shot at walking pace, handheld, 50mm lens, slightly below eye level. Lighting: soft overhead fluorescent fill, floor-to-ceiling window light from the left, natural daylight streaming in. Audio: her side of a confident business call ("...we need to close this by Friday, not negotiate it"), heel-clicks on polished floor, ambient office hum. Aspect: 16:9. Duration: 8 seconds.
10. Athlete in Full Sprint
Medium-wide shot of a male sprinter in the final 30 meters of a 100m race, track stadium behind him. Full athletic effort — arms pumping, face in controlled exertion, body leaning forward at full speed. Camera: tracking shot at track level moving with the athlete, 85mm telephoto, slightly ahead of the sprinter. Lighting: bright stadium floodlights, hard key from above, motion blur on background crowd. Audio: crowd roar rising, lane markers rushing past, wind buffeting the camera mic, the snap of feet on synthetic track. Aspect: 16:9.
11. Two-Person Dialogue
Shot 1: Medium two-shot of two men in their 40s sitting across a diner booth, late night. One leans forward with intensity; the other leans back, arms crossed. Shot 2: Cut to close-up over-the-shoulder on the leaning-back man as he responds, expression shifting from skeptical to uncertain. Both shots: locked-off, 50mm lens. Diner booth with red vinyl, Formica table, coffee cups. Lighting: single overhead pendant lamp, hard shadows under eyes and jawline, neon glow from a window behind. Audio: coffee cup set down, ambient diner noise low, the tense pause between lines, then the second man quietly says "You knew the whole time, didn't you." Aspect: 16:9. Duration: 12 seconds.
12. Child and Dog Moment
Close-up of a 5-year-old girl sitting cross-legged on a back porch in summer, laughing as a golden retriever licks her face repeatedly. She tries to hold the dog still but keeps dissolving into giggles. Camera: locked-off at child's eye level, 85mm telephoto, shallow depth of field blurring backyard garden. Lighting: soft afternoon sun from frame left, warm fill from light-colored house wall. Audio: child's uncontrollable laughter, dog panting enthusiastically, tail thumping on wood porch, the girl saying "Stop it, Biscuit!" between laughs. Color grade: warm, saturated summer. Aspect: 9:16. Duration: 6 seconds.
13. Craftsperson at Work
Close-up of a glassblower's hands at a furnace, shaping molten orange glass on a steel blowpipe. Slow rotation of the pipe, the glass stretching and holding form. Sweat visible on forearms. Camera: locked-off at table level, 100mm macro-equivalent, tight on hands and glass, face softly out of focus behind. Lighting: the molten glass itself as practical key light — intense orange glow on hands and face, industrial forge dark behind. Audio: the roar of the furnace, glass scraping pipe, occasional breath through the blowpipe, the creak of the craftsperson's stool. Aspect: 1:1. Duration: 8 seconds.
14. Skateboard Trick
Wide-to-medium shot of a skateboarder executing a kickflip at the lip of an empty concrete drainage channel, late afternoon. The board flips cleanly, feet reconnect, she rolls away fakie. Camera: starts wide locked-off showing the full run-up, then cuts mid-air to a tight follow tracking shot at board level. 24mm wide for approach, 50mm for trick. Lighting: low sun from behind camera, long shadow ahead of skater, warm backlight on the trick. Audio: wheels on concrete, the sharp pop of the kickflip, board slap on landing, wheels rolling away on rough concrete. Aspect: 16:9.
15. Fight Choreography
Wide shot of two martial artists in a controlled sparring match on a hardwood dojo floor, morning light. Clean, technical exchanges — no contact to the head, visible control, mutual respect in movement. Camera: locked-off at distance, 50mm lens, full bodies always in frame. Lighting: natural morning light through dojo windows at a 45-degree angle, hard directional shadows, no fill. Audio: the snap of a gi during movement, feet on hardwood, controlled exhales on strikes, the ambient quiet of a focused space. Aspect: 16:9. Duration: 10 seconds.
Product & Commercial Prompts (16–22)
16. Beverage Hero Shot with Condensation
Extreme close-up of a dark glass bottle of craft beer being lifted slowly from a bed of crushed ice. Condensation runs down the label. The cap is off. The mouth of the bottle is at frame top as it clears the ice. Camera: locked-off at near eye-level with the bottle, 100mm macro, shallow depth of field, bottle sharp, background bokeh. Lighting: single backlight through translucent white diffusion panel creating a glowing product-lit look; no shadows on label. Audio: ice shifting as bottle is removed, the light drip of condensation, refrigerator hum fading. Color grade: cool blue tones, dark amber of the beer catching backlight. Aspect: 9:16. Duration: 5 seconds.
17. Food Being Prepared
Overhead locked-off shot of a chef's hands slicing a ripe mango on a dark marble board. Each cut is precise, the fruit opening to show its deep orange interior. Steam rises faintly from a pan at the top of frame. Camera: directly overhead, 50mm equivalent, locked position, centered on the board. Lighting: single overhead softbox giving even, shadow-free illumination; slight specular highlight on the marble. Audio: the clean tap of the knife on marble with each cut, the faint sizzle of the pan, the soft sound of mango segments separating. Color grade: warm, saturated food-photography palette. Aspect: 1:1. Duration: 6 seconds.
18. Tech Device Demo
Medium close-up of a hand picking up a matte black wireless earbud case from a white table and opening it with a satisfying click. The earbuds illuminate briefly as the case opens. The hand tilts the case toward camera to show the interior detail. Camera: locked-off slightly above table level, 85mm, shallow depth of field. Lighting: clean soft-box key from the upper right, white bounce card fill from left, product photography white background. Audio: the precise click of the case mechanism, a soft digital tone as earbuds pair, ambient silence emphasizing product quality. Aspect: 1:1. Duration: 5 seconds.
19. Fashion Item in Motion
Medium shot of a linen summer dress in movement — the wearer walks away from camera along a whitewashed Mediterranean terrace overlooking the sea, then turns back. The fabric catches the breeze. Camera: locked-off at mid-distance, 85mm telephoto, subject moving in and out of depth zone, fabric always in focus. Lighting: natural midday sun, slightly overhead, fabric translucency visible at the hem, sea light reflecting off terrace. Audio: light breeze, fabric movement, the wearer's sandals on stone, distant water. Color grade: bright, airy — warm ivory and Mediterranean blue. Aspect: 9:16. Duration: 8 seconds.
20. Beauty Product Application
Close-up of a woman applying a pearl-drop serum to her cheekbone with two fingertips. Gentle tapping motion. The serum catches the light as it absorbs. Camera: locked-off at close distance, 100mm macro, tight on cheekbone and fingers, eyes softly present in upper frame. Lighting: single large softbox at 45 degrees for skin-flattering even illumination, faint catchlight in the eye. Audio: the soft tap of fingers on skin — intentional ASMR quality, the ambient quiet of a bathroom morning, no music. Color grade: clean warm skin tones, neutral grey-white background. Aspect: 9:16. Duration: 5 seconds.
21. Automotive in Motion
Wide-to-medium tracking shot of a dark blue sedan navigating a two-lane mountain road at speed, early morning. Pine trees blur in the background. The car takes a sweeping left-hand bend cleanly. Camera: car-mount tracking shot from a position slightly ahead and to the passenger side, moving with the vehicle, 24mm wide. Lighting: overcast dawn, flat natural light, headlights still on, warm interior light visible through windshield. Audio: engine note rising on acceleration, tire grip on asphalt through the corner, the rush of wind past the camera. Color grade: cool grey-blue dawn, dark navy car paint. Aspect: 16:9. Duration: 8 seconds.
22. Packaging Unboxing
Overhead locked-off shot of hands methodically unboxing a premium white product package. Tissue paper is folded back to reveal a matte black device nestled inside. One hand lifts the device out. Camera: directly overhead, 50mm equivalent, locked. Product packaging fills the frame. Lighting: dual softbox setup, even and shadow-free, clean commercial look. Audio: the deliberate crinkle of tissue paper, the slight squeak of dense foam holding the product, the quiet of an ASMR-quality unboxing pace. Color grade: pure white with black, minimal, clean. Aspect: 1:1. Duration: 7 seconds.
Lifestyle & Documentary Prompts (23–29)
23. Coffee Shop Morning
Medium shot of a barista in her late 20s pulling an espresso shot at a La Marzocco machine in a specialty coffee shop. She watches the extraction intently. Morning light through a large window. Behind her, the café is beginning to fill. Camera: locked-off across the counter at customer height, 50mm lens. Lighting: natural window light as key, warm café lamp fill behind the bar. Audio: the hiss and gurgle of the espresso machine, the ambient chatter of a waking café, the clink of cups, soft background music at low volume. Color grade: warm amber and cream. Aspect: 16:9. Duration: 8 seconds.
24. Family Dinner
Wide shot of a family of four around a round dinner table — two adults, two children roughly 8 and 12 — in a warm kitchen, evening. A serving bowl is being passed. Talking and laughing. Camera: locked-off from slightly above, 35mm lens, capturing the full table and surrounding kitchen context. Lighting: overhead pendant lamp as practical key, warm fill from kitchen lighting, window dark behind. Audio: overlapping conversation, the clink of cutlery on plates, the child asking "Can I have more?", genuine family noise. Color grade: warm, high-key domestic. Aspect: 16:9. Duration: 8 seconds.
25. Working From Home
Medium shot of a man in his early 40s at a standing desk in a home office, late morning. He's on a video call — engaged, speaking clearly, a mug in his left hand. Bookshelves and a window behind him, soft daylight. Camera: locked-off at desk height, 50mm lens. Lighting: window light as key from the left, soft fill from a monitor on the right, no overhead lights. Audio: his side of the call ("...that makes sense, let me check the deck"), the click of his keyboard as he references something, the ambient quiet of a home. Aspect: 16:9. Duration: 8 seconds.
26. Gym Workout
Medium wide shot of a woman in her 30s performing a clean set of dumbbell Romanian deadlifts in a well-lit, uncrowded gym. Full movement shown — hip hinge, loaded stretch, controlled return. Camera: lateral tracking shot staying level with the lifter's hips, 50mm lens, moving slowly with the motion. Lighting: gym overhead LEDs, slightly harsh but clean, no dramatic effect needed. Audio: the exhale on the concentric phase, rubber dumbbells touching the platform, ambient gym soundtrack in the background, a weight rack shifting in the distance. Aspect: 16:9. Duration: 8 seconds.
27. Weekend Hike
Wide-to-medium shot of two women hiking a trail ridge in autumn. Deciduous trees in peak color on either side. They walk side by side talking, not looking at the camera. The trail curves ahead. Camera: starts wide, locked on a 28mm, then slowly pushes in over 8 seconds to a medium two-shot. Lighting: overcast autumn light, even and soft, no strong directional shadows. Audio: boots on packed earth and leaf litter, birdsong in the canopy, wind through turning leaves, the conversation in the foreground — warm, casual, unintelligible but natural. Color grade: amber, orange, deep red. Aspect: 16:9. Duration: 10 seconds.
28. Urban Commute
Wide tracking shot of the interior of a subway car during morning rush hour. Passengers standing, seated, looking at phones or windows. The car sways slightly as it moves. Camera: handheld at seated height, 28mm wide, slight movement following the car's motion. Lighting: fluorescent overhead tube lights, mixed with flashes of tunnel darkness through the windows. Audio: the rhythmic clatter of the train on rails, the PA announcement of the next station, the compressed silence of strangers in proximity, a phone playing audio without headphones at low volume. Color grade: flat fluorescent, muted urban. Aspect: 16:9. Duration: 8 seconds.
29. Restaurant Kitchen
Wide shot of a professional restaurant kitchen at full service — four chefs moving simultaneously, pans on open flames, plating at the pass, the expediter calling tickets. Controlled chaos. Camera: locked-off from a high corner angle showing the full kitchen, 28mm wide, slight elevation looking down. Lighting: harsh overhead commercial kitchen LEDs, open gas flames as warm practical accents. Audio: the call-and-response of the expediter and line ("Yes, Chef!"), sizzle of fat in a pan, the metallic clatter of a hotel pan on a shelf, urgent footfall on rubber matting. Aspect: 16:9. Duration: 10 seconds.
Narrative Short / Multi-Shot Prompts (30–36)
30. Two-Shot Dialogue Scene
Shot 1: Medium close-up of a woman in her late 30s sitting at a park bench, looking just off-camera right. She pauses, then delivers a line — "I think I always knew." Her expression is a mix of relief and sadness. Camera 1: locked-off, 85mm telephoto, shallow depth of field, bokeh trees behind. Shot 2: Cut to a matching medium close-up of a man the same age on the same bench, reacting. He looks down, then back at her. Camera 2: matching 85mm setup from the opposite side. Both shots: afternoon golden-hour backlight, minimal fill, warm rim light on both subjects. Audio: park ambient — breeze, distant children, birdsong — continuous across the cut; no score. Aspect: 16:9. Duration: 12 seconds.
31. Action-Reaction Sequence
Shot 1: Extreme close-up of a hand pressing a red button on an industrial panel — tight on fingers and button, motion slightly slow. Shot 2: Wide shot of a large warehouse loading door beginning to roll upward as industrial machinery engages. Shot 3: Medium close-up of a worker watching the door rise, expression going from focused to satisfied. All shots: locked-off, matched industrial lighting — overhead sodium vapor, cool and slightly amber. Audio continuity: the mechanical click of the button triggering the relay (Shot 1), the door motor's deep hum starting (Shot 2), the chain drive rattling as it rolls up, the worker breathing out through the nose (Shot 3). Aspect: 16:9. Duration: 10 seconds total.
32. Character Entering a Room
Single-take, medium-wide shot of a man in his 50s opening a frosted-glass office door, stepping through, and pausing to read the room before moving further in. The room is empty. He expected someone to be there. Camera: locked-off across the room, 35mm, his figure entering from the left third. Lighting: interior office fluorescent, cold and flat, with warm hallway light briefly visible through the open door. Audio: the door handle click, the soft exhale of the pneumatic hinge, his footstep on carpet, a pause — then the faint sound of his watch ticking as he stands still. Aspect: 16:9. Duration: 8 seconds.
33. Pickup and Reveal
Shot 1: Overhead close-up of a table surface. A hand enters frame and picks up a folded note. Shot 2: Cut to medium close-up of a woman reading the note, expression unreadable. Shot 3: She lowers the note slightly, revealing her eyes — and the faintest smile. Three shots, each locked-off at corresponding angles. Lighting consistent: single warm desk lamp as key across all three shots, suggesting same room. Audio: the soft crinkle of paper unfolding (Shot 1), silence except for ambient room tone (Shot 2), and the almost-inaudible exhale of relief that preludes the smile (Shot 3). Aspect: 16:9. Duration: 10 seconds.
34. Before-and-After Transformation
Shot 1: Wide shot of a derelict urban lot — overgrown, fenced, debris visible. Grey overcast day. Locked-off on a 35mm lens at street level. Shot 2: Match cut to the identical camera position and framing — same lot, now a finished community garden. Raised beds, painted fence, people tending plants. Same overcast weather maintained for continuity. Audio: Shot 1 is near-silent — wind, one distant car; Shot 2 brings in people talking quietly, the scrape of a trowel, children, birdsong — the same space made alive. The contrast is purely auditory and visual — no dissolve, a clean cut. Aspect: 16:9. Duration: 10 seconds total.
35. Found-Footage Style
Handheld, shaky medium shot of two hikers moving quickly through dense forest at dusk, the person filming clearly running to keep up. Flashlight beams cut through underbrush. One hiker looks back at the camera and says "Come on, keep up!" before turning forward again. Camera: aggressively handheld, 24mm, slight motion blur on fast pans. Lighting: practical — two handheld flashlights as the only sources, harsh shadows, tree trunks momentarily illuminated. Audio: heavy breathing, footfall on fallen leaves, branches snapping, a flashlight battery rattle, the ambient darkness of a forest at dusk. Aspect: 9:16. Duration: 8 seconds.
36. Single-Take Walk Through Space
Single unbroken shot following a woman walking from a narrow back corridor through a fire door, down a wide exhibition gallery with art on the walls, and out through glass doors to an outdoor terrace. She walks at a confident, unhurried pace. Camera: handheld following shot, 28mm wide, staying at her shoulder level. The environments change completely — corridor to gallery to terrace — but the subject and camera motion are continuous. Lighting: dim corridor, then bright gallery track lighting, then natural daylight on the terrace — the transitions are motivated and physical. Audio: echoey corridor footfall, transitioning to the acoustic of the gallery, transitioning to open-air ambient on the terrace — all continuous. Aspect: 16:9. Duration: 14 seconds.
Sports, Motion & Physics Prompts (37–43)
37. Basketball Dunk
Medium shot of a basketball player launching from just inside the arc, rising through the air, and finishing with a two-handed slam dunk — rim shaking, net swishing. Camera: locked-off slightly below rim level at the baseline, 50mm lens, subject rising to fill upper frame. Lighting: sports arena floodlights, hard overhead key, slight rim backlight. Audio: sneaker squeak on hardwood on the last step, the hang in the air — near silence — then the violent slam of ball on rim, the net snapping, crowd eruption. Aspect: 16:9. Duration: 5 seconds.
38. Skateboard Line
Wide tracking shot following a skateboarder through a three-trick line in a concrete skate plaza: a boardslide on a low rail, into a manual on the flat section, into a pop shove-it to close. Camera: moving tracking shot alongside at board level, 28mm, keeping pace with the skater throughout. Lighting: late afternoon sun from behind camera, long shadows ahead, warm light on the skater's back and board. Audio: wheel speed on concrete, the scrape of a truck on the rail, the tap on manual, the pop and board slap on the shove-it landing. Aspect: 16:9. Duration: 8 seconds.
39. Soccer Goal
Wide shot of a soccer striker receiving a through-ball on the right side of the penalty area, taking one touch to set, and striking low into the far corner. The ball hits the back of the net. Camera: behind-goal wide angle, 24mm, keeper visible in frame. Lighting: late afternoon stadium flood, warm directional light on the pitch. Audio: the contact of boot on ball — a clean, solid thwack — the keeper diving and landing on grass, the net rustling as the ball hits it, the delayed crowd reaction building. Aspect: 16:9. Duration: 6 seconds.
40. Surfing a Wave
Medium wide shot of a surfer dropping into a steep overhead wave, driving a hard bottom turn, then projecting up through the face for a top turn that sends spray back over the lip. Camera: water-level position to the side of the wave, 50mm, moving with the surfer's line. Lighting: mid-morning sun from behind camera, backlit spray catching light, green-blue face of the wave translucent in the sun. Audio: the rush of moving water, the sharp crack of the board rail on the top turn, white water foaming behind, the hollow sound inside the wave's curl. Aspect: 16:9. Duration: 8 seconds.
41. Slow-Motion Water Splash
Extreme close-up of a smooth stone dropped into a still clear-water pool. Physics-correct fluid dynamics: the entry column forms and collapses, the crown splash rises symmetrically, droplets follow ballistic trajectories before falling back. Visible surface tension in the initial entry. Camera: locked-off at water surface level, 100mm macro, camera slightly tilted up at impact zone. Lighting: single backlight through white diffusion panel beneath the pool, creating translucent glowing water. Audio: the single soft plunk of entry, then slow-motion audio if possible — the low, stretched rumble of the splash. Aspect: 1:1. Duration: 6 seconds at high-speed frame rate.
42. Dropping Object Physics
Medium shot of a ceramic bowl falling from a kitchen counter edge — tipping point, brief weightlessness, impact on tile floor, and the bowl shattering into 5-8 large pieces radiating outward. Physics-correct: the bowl deforms slightly on contact before fracturing along stress lines. Camera: locked-off at near-floor level, 50mm, bowl dropping from upper frame. Lighting: warm kitchen overhead, practical key, even fill. Audio: the tap of the bowl on the tile edge before it goes over, the fraction-of-a-second silence of the fall, the sharp ceramic crack of impact, the pieces sliding and spinning to rest on the floor. Aspect: 16:9. Duration: 5 seconds.
43. Particle and Dust Motion
Close-up of a leather-bound book being opened on a wooden desk. As the pages fan apart, a visible cloud of fine dust particles catches a shaft of afternoon sunlight from a window at frame left. The particles follow correct Brownian motion — drifting upward and outward, slowing as they disperse, responding subtly to the air movement from the turning pages. Camera: locked-off, 85mm telephoto, shallow depth of field, book sharp, background of more books soft. Lighting: single shaft of warm afternoon sunlight as the only key; everything else in shadow. Audio: the dry creak of old leather, pages fanning with a papery whisper, the complete quiet of a study room. Aspect: 16:9. Duration: 8 seconds.
Abstract & Conceptual Prompts (44–50)
44. Abstract Liquid Motion
Extreme close-up of two liquids meeting and mixing in a glass tank — a deep cobalt blue oil and a transparent mineral solution. The oil moves in slow, deliberate tendrils through the water, never fully mixing. Physics-correct density separation: the oil maintains its form as it travels. Camera: locked-off at tank side, 100mm macro, full depth field. Lighting: single backlight from behind the tank on a white panel — the liquids are luminous. Audio: faint underwater gurgling, the ambient hum of a recording studio — total quiet that emphasizes the visual. Color grade: pure cobalt, pure clear, no color correction needed. Aspect: 1:1. Duration: 10 seconds.
45. Geometric Kaleidoscope
Overhead locked-off shot of a flat mirror surface onto which colored geometric tiles are slowly arranged by unseen hands — triangles, hexagons, squares — in expanding symmetrical patterns. New tiles slide in from the edges. Camera: directly overhead, 50mm equivalent, fixed position. Lighting: even overhead diffused white light, no shadows, maximum color fidelity. Audio: the soft click and slide of tiles on glass, no other sound, minimal reverb — the sound of deliberate construction. Color grade: primary colors plus jewel tones, white background. Aspect: 1:1. Duration: 12 seconds.
46. Particle Simulation
Wide shot of a simulation-style visualization: thousands of fine gold particles in a dark void, initially random, then coalescing under invisible force into the form of a sphere, holding for two seconds, then dispersing again. Physics-correct attraction curves — particles accelerate on approach, decelerate slightly as the form nears completion. Camera: locked-off in 3D space, slightly below center, looking slightly up at the forming sphere, 50mm equivalent. Lighting: particles are self-luminous — warm gold on black, no environmental light. Audio: a growing, resonant low frequency tone as particles converge — not musical, physical — then silence and the soft rush of dispersal. Aspect: 16:9. Duration: 12 seconds.
47. Dreamlike Morph
Close-up of a red rose at peak bloom. Over 8 seconds, the petals slowly transform — maintaining their volume and surface texture throughout — into the surface of a slowly spinning planet, the deep reds of the petals becoming the rust tones of a Mars-like surface. The transformation is seamless, no cuts, a continuous physical metamorphosis. Camera: locked-off, 85mm macro equivalent, subject filling the frame. Lighting: begins with warm studio key on the rose, transitions to the single hard directional light of a distant star on the planet. Audio: silence and ambient room tone throughout, just faint natural sounds of movement. Aspect: 1:1. Duration: 10 seconds.
48. Color-Field Motion
Abstract wide shot of a smooth surface — perhaps fabric, perhaps liquid, perhaps paint — slowly shifting through a gradient from deep ultramarine to warm magenta to gold. The surface has subtle physical texture that catches light as the colors move. No objects. No narrative. Pure color and texture in motion. Camera: locked-off directly overhead, 50mm equivalent. Lighting: large format diffused overhead source, even, designed to show surface texture without creating hot spots. Audio: a single sustained, warm low tone — like a cello harmonic — held for the full duration. Aspect: 16:9. Duration: 12 seconds.
49. Slow-Motion Pour
Extreme close-up of golden honey being poured from a wooden spoon onto a dark marble surface. The pour is slow, the honey folding onto itself in a coiling ribbon with accurate viscosity — the liquid maintains surface tension, folds rather than splashes, the coil gradually builds. Camera: locked-off at surface level and close, 100mm macro, honey filling the frame. Lighting: single backlight through honey creating amber translucency — the liquid glows from within. Audio: the near-silence of slow honey, a faint viscous folding sound, the tap of the spoon handle on the jar rim. Color grade: deep amber, black marble, high contrast. Aspect: 1:1. Duration: 8 seconds.
50. Time-Lapse Style
Wide locked-off shot of a city intersection from a high building — same framing, dawn to dusk compressed. The light transitions from grey pre-dawn, through golden morning, flat midday, golden late afternoon, and blue-hour dusk. Traffic, pedestrians, and shadows are all in motion. Camera: locked-off rooftop position, 28mm wide, minimal depth, everything in focus. Lighting: natural sky progression is the entire lighting design — no artificial intervention. Audio: the ambient city below, rising from near-silent pre-dawn to midday noise and back to quieter evening — compressed into a continuous texture. Aspect: 16:9. Duration: 12 seconds.
Sora 2 Power Tips
Describe action with verbs, not adjectives. "A woman walks briskly toward the camera" produces far better results than "a woman looking confident." Sora 2 executes movement instructions — it doesn't infer emotion from adjectives.
Name your camera move explicitly. "Cinematic" is not a camera instruction. "Slow dolly-in from chest to close-up over 6 seconds" is. The model responds to specific camera language: dolly, push-in, pull-back, pan-left, orbit, handheld, locked-off.
Write the audio layer like a sound designer. Sora 2 generates audio natively — it's not post-production. Describe what the world sounds like: ambient environment, specific foley events (a cup set down, a door hinge), and dialogue intent. "A soft espresso machine hiss" is actionable. "Good audio" is not.
State aspect ratio and duration at the top of the prompt. These are structural constraints that shape everything downstream — composition, pacing, motion design. Specify them before you describe the scene, not as an afterthought.
For multi-shot sequences, anchor what stays consistent. Describe the matching element explicitly: same lens, same lighting source, same character description word-for-word. The model uses these anchors to maintain continuity across cuts. If the subject is "a woman in a navy blazer," use that exact phrase in every shot description.
Physics-correct prompts outperform physically impossible ones. Sora 2's physics simulation is its genuine strength — real-world causality, fluid dynamics, material behavior. Lean into it. Describe the honey folding onto itself with correct viscosity, the wave breaking with the right fluid dynamics, the ceramic shattering along stress lines. Prompts that fight physics produce worse results than prompts that direct it.
Make a video of someone walking down a street at night.
Medium shot of a woman in her 30s walking with purpose down a rain-slicked city street at 2 a.m. Camera: lateral tracking shot at eye level, handheld with slight movement, 50mm lens. Lighting: neon signage as practical key — pink and cyan pools on wet asphalt, deep shadow between them. Audio: light rain on pavement, her heel-clicks on the wet sidewalk, a distant car passing, the hum of a neon sign she passes close to. Color grade: teal-orange with deep blacks. Aspect: 16:9. Duration: 8 seconds.
Start Building with These Prompts
These 50 templates are ready to paste into Sora 2's interface as-is, or adapt by swapping the subject, environment, or camera choice. The structure does the work — the model responds to specificity, and every prompt here is specific.
For a systematic breakdown of why each element works and how to build this structure from scratch, read the complete Sora 2 prompting guide. For a broader framework covering narrative structure, multi-model workflows, and AI video production pipelines, the AI video prompting complete guide goes deeper.
If you want prompts built to your exact brief rather than adapting a template, the AI prompt generator can construct a structured Sora 2 brief from a plain English description of what you're trying to make.
Wondering how Sora 2 stacks up against Veo 3 and Runway for different types of work? The Veo 3 vs Sora 2 vs Runway comparison breaks down which model wins for which use case.