Sora 2 product videos that fail look like AI hallucinated them — warped logos, objects that drift, lighting that makes no physical sense. The ones that work look like a Tier-2 production studio spent a half-day on them, and the difference almost always comes down to three things: specific camera direction, motivated audio, and platform-aware aspect ratios baked into the prompt itself. These 30 templates give you all three.
What a Working Product Video Prompt Looks Like
Most people start a Sora 2 prompt by describing the product. That's the right instinct, but description alone isn't enough. Sora 2 responds well to director-style language — you're not writing a caption, you're briefing a cinematographer. The model understands camera vocabulary, lighting terminology, and audio direction, and using that language consistently produces far more usable footage.
Start with the product, not the scene. Name the product, its material, finish, and color before you describe anything else. "A matte-black aluminum water bottle with a brushed-steel lid" gives the model something specific to render and preserve. Vague references like "a water bottle" invite reinterpretation. If you're working from a reference image, close the loop explicitly with a preservation cue: "the product should look exactly as photographed — no AI reinterpretation of the design."
Specify a camera move that flatters the product's shape. A slow 360-degree orbit works well for products with interesting silhouettes — perfume bottles, sneakers, packaged goods. A controlled push-in to a 10cm close-up earns its place when the detail is the selling point — texture, embossing, a fine weave. Top-down reveals suit flat-lay products: skincare, stationery, food. A locked-off shot with subtle product rotation works when you want the background to do the storytelling. Name the move and add a speed cue ("slow," "gentle," "crisp") so the model calibrates the pacing.
Describe lighting as if you're building the set. Soft top-light with a white reflector below is a workhorse setup for consumer packaged goods — it wraps the product in even light without harsh shadows. Hard rim lighting on a dark background reads as premium and editorial. Golden-hour outdoor light suits lifestyle products and apparel. Studio key-and-fill is neutral and clean for technical products. The more specific you are, the less likely Sora 2 will default to a generic, flat render that looks like a stock image.
Audio direction is the multiplier. A product video without motivated audio is a GIF with ambitions. Sora 2 supports native audio generation, and telling it exactly what to produce — the fizz of a carbonated drink being poured, the crisp click of a compact closing, the rustle of tissue paper in a gift box — is what separates footage that feels produced from footage that just looks like it was rendered. Specify both ambient background sound (coffee shop hum, outdoor birdsong, studio silence) and foreground foley tied to the product action.
Match aspect ratio to the platform before you render. Rendering everything at 16:9 and then cropping for TikTok or Reels is a waste of a generation — and the crop almost always cuts the product awkwardly. Build the platform target into the prompt from the start: 9:16 for TikTok, Reels, and YouTube Shorts; 1:1 for Instagram feed; 4:5 for Instagram portrait (performs better than square in the feed); 16:9 for YouTube and product-page embeds. Read more in the Sora 2 prompts guide and AI prompts for ecommerce.
Product Hero Shot Prompts (1–5)
1. Clean White-Background Hero With Rotation
A slow 360-degree orbit of a [matte-white ceramic candle jar with a
gold foil logo and a cream cotton wick] centered on a seamless white
surface. Camera: smooth clockwise orbit at eye level, completing one
full rotation over the clip duration. Lighting: soft top-light with
a white fill card below, no shadows, high-key studio setup. Audio:
near-silent studio ambience, faint hiss of the wick beginning to
catch flame at the 4-second mark (foley). Aspect: 1:1. Duration:
8 seconds. The product should look exactly as photographed — no AI
reinterpretation of the design or label artwork.
2. Gradient Background Product Orbit
A [dark-green glass perfume bottle with a faceted stopper and a
minimal black label] orbiting slowly against a gradient background
that transitions from deep navy at the base to soft rose at the top.
Camera: counter-clockwise orbit at 25 degrees above product level,
one-and-a-quarter rotations. Lighting: hard rim light from camera
right with a subtle warm fill, creating edge highlights on the glass
facets. Audio: refined studio silence with a low, resonant ambient
tone. Aspect: 4:5. Duration: 10 seconds. The product should look
exactly as photographed — preserve the label typography precisely.
3. Top-Down Flat-Lay Reveal
A top-down reveal of a [matte rose-gold skincare set — cleanser,
serum, and moisturizer in matching slim aluminum tubes] arranged on
a white marble surface. Camera: locked overhead shot, products are
placed one by one into frame from outside the shot edge, each
settling with a soft tap. Lighting: soft diffused daylight from a
north-facing window, gentle shadow detail. Audio: ambient kitchen
silence, clean crisp tap of each aluminum tube landing on marble
surface (foley). Aspect: 1:1. Duration: 9 seconds.
4. Dramatic Studio Rim Light
A [matte-black stainless-steel insulated flask with an embossed
mountain logo] on a polished black surface, backlit with hard studio
rim lighting. Camera: locked-off front-facing shot, product rotates
10 degrees left then 10 degrees right, pausing at center. Lighting:
twin hard rim lights at 45 degrees behind product left and right,
creating bright edge highlights against a near-black background, no
front fill. Audio: deep studio silence, single resonant metallic ring
as the flask completes a gentle tap on the surface (foley).
Aspect: 9:16. Duration: 6 seconds. The product should look exactly
as photographed — preserve the embossed logo detail.
5. Slow Dolly-In Detail Close-Up
A slow push-in to an extreme close-up of [a hand-stitched tan leather
wallet, crocodile-embossed, with brass corner hardware] resting on a
dark walnut surface. Camera: slow dolly-in from a medium shot
(full wallet visible) to a tight close-up on the stitching and
emboss texture, movement taking the full duration. Lighting: warm
studio key light from camera left at 30 degrees, casting soft shadow
that reveals the emboss texture; no fill. Audio: quiet leather-shop
ambience, subtle leather creak as the wallet rests and settles
(foley). Aspect: 16:9. Duration: 8 seconds.
Lifestyle Context Video Prompts (6–10)
6. Kitchen-Counter Context
A [stainless-steel pour-over coffee kettle with a gooseneck spout
and a bamboo handle] sitting on a marble kitchen counter in a
bright, minimal Scandinavian kitchen. Camera: slow dolly-in from a
medium establishing shot to a tighter frame showing the kettle and
steam curling from the spout. Lighting: soft morning daylight from
a window at camera left, warm 5,000K color temperature. Audio:
quiet kitchen ambience, coffee drip and gentle steam hiss, water
trickling into a ceramic cup (foley). Aspect: 9:16. Duration:
10 seconds.
7. Outdoor Lifestyle Context
A [slim ultralight running vest in deep forest-green ripstop nylon
with reflective trim] on a runner pausing on a forest trail at
golden hour. Camera: slow rack-focus from the blurred trail
background to the vest in sharp detail, then a gentle push-in to
show the chest pocket and zipper. Lighting: golden-hour backlight
creating a warm rim around the runner, dappled forest light on
the trail. Audio: birds and light breeze in pine trees, fabric
rustle as the runner shifts weight (foley). Aspect: 9:16.
Duration: 8 seconds.
8. On-Shelf Retail Context
A [brightly branded cereal box in cobalt blue with bold yellow
typography] facing forward on a clean grocery store shelf,
surrounded by competing products that are soft-focused. Camera:
slow push-in from a wide shelf view to a tight hero shot of the
product, finishing on the brand logo. Lighting: even supermarket
overhead fluorescent with a subtle warm key added to the hero
product. Audio: distant grocery store ambient hum, soft product
placement tap as the box is straightened on the shelf (foley).
Aspect: 4:5. Duration: 7 seconds. The product should look exactly
as photographed — preserve all label text and brand colors.
9. Bathroom Morning Routine
A [white frosted-glass serum bottle with a rose-gold dropper top]
on a clean white bathroom shelf during a morning routine. Camera:
locked-off medium shot of the shelf; a hand enters frame, picks
up the bottle, and dispenses two drops of serum, then sets it
back down. Lighting: soft diffused bathroom vanity light, clean
and neutral with no harsh shadows. Audio: bathroom morning
ambience — light tap of water, hum of ventilation — plus the
quiet pop and hiss of the dropper dispensing serum (foley).
Aspect: 9:16. Duration: 9 seconds.
10. Desk and Work Setup Context
A [slim wireless mechanical keyboard in off-white with PBT
keycaps and a USB-C charging port] on a clean oak desk beside
a monitor showing a blurred code editor. Camera: wide establishing
shot of the full desk, then a gentle push-in to center on the
keyboard. Lighting: soft indirect window light from the left,
supplemented by a warm desk lamp on the right. Audio: quiet
home-office ambience, satisfying crisp tactile key clicks as
fingers type a few keystrokes (foley). Aspect: 16:9.
Duration: 8 seconds.
Demo & In-Use Prompts (11–15)
11. Beauty Product Application Close-Up
An extreme close-up of [a matte-finish concealer wand in a neutral
champagne shade] being drawn across clean skin in a smooth single
stroke. Camera: locked-off macro, product enters frame from the top
and applies in a deliberate motion, blending at the end. Lighting:
beauty ring light from front, even and flattering with no shadows.
Audio: near-silent with a faint soft swatch sound as the concealer
meets skin (foley). Aspect: 1:1. Duration: 5 seconds.
12. Food Product Being Prepared
A [bottle of cold-pressed yuzu hot sauce in bright amber glass with
a kraft-paper label] being tipped to pour a thin stream of sauce
over a taco on a dark slate surface. Camera: side-angle locked-off
shot at counter height, focus on the pour and the sauce landing on
the food. Lighting: warm tungsten studio light from camera right,
creating appetizing highlight on the sauce stream. Audio: restaurant
kitchen ambience, the satisfying liquid pour and gentle sizzle of
the sauce on the food (foley). Aspect: 9:16. Duration: 6 seconds.
The product should look exactly as photographed — preserve all
label text and the amber glass color.
13. Tech Gadget in Use
A [compact gunmetal-gray wireless earbuds case with a matte finish
and an LED indicator] being picked up and opened by one hand, buds
removed and placed into ears. Camera: over-the-shoulder tracking
shot following the hand motion, ending on a tight shot of the open
case showing the buds inside. Lighting: cool 6,500K studio key
light, clean and tech-forward. Audio: quiet room tone, satisfying
magnetic snap of the case lid opening, the soft click of earbuds
lifting from their magnets (foley). Aspect: 9:16. Duration:
8 seconds.
14. Apparel on Model in Motion
A [structured oversized blazer in camel wool with notched lapels
and a single front button] on a model walking through a sunlit
urban courtyard. Camera: tracking shot from front at medium
distance, model walks toward camera, blazer falling naturally with
each stride; ends on a confident stop and direct look to camera.
Lighting: golden-hour afternoon sun from camera left, natural
urban fill from the courtyard walls. Audio: city ambient hum,
confident footsteps on stone, clean fabric movement as the blazer
swings with each step (foley). Aspect: 9:16. Duration: 10 seconds.
15. Kitchen Tool in Action
A [ceramic-coated non-stick skillet in deep navy with a riveted
stainless-steel handle] on a gas burner, being used to sear
vegetables with a clean flip. Camera: front-side locked-off shot
at burner height, showing the full skillet and the cook's hands;
push-in to a tighter shot as the flip happens. Lighting: warm
practical tungsten kitchen light, steam and heat haze adding depth.
Audio: kitchen cooking ambience — burner hiss, extractor fan in
background — followed by the sharp sizzle of vegetables hitting the
hot ceramic surface and the clean pan-scrape of the flip (foley).
Aspect: 16:9. Duration: 8 seconds.
Unboxing & Packaging Prompts (16–20)
16. Branded Box Unboxing
A [premium matte-black rigid gift box with a debossed white logo
and a white satin ribbon] being opened slowly on a clean white
surface. Camera: top-down locked-off overhead shot, hands enter
from both sides and lift the lid in a single smooth motion,
revealing [product in white tissue paper] inside. Lighting: soft
overhead diffused studio light, clean and shadow-free. Audio:
quiet room tone, satisfying whisper of the ribbon sliding, the
pop of the lid releasing from the base, and the rustle of tissue
paper inside (foley). Aspect: 1:1. Duration: 8 seconds. The
product should look exactly as photographed — preserve the
debossed logo detail precisely.
17. Gift-Wrap Reveal
A [glossy hunter-green gift-wrapped box with a gold wired ribbon
and a bow] being unwrapped on a rustic wooden table at Christmas.
Camera: 45-degree angle medium shot, hands pull the ribbon ends
and the bow releases, then the wrapping paper is folded back in
one deliberate motion. Lighting: warm tungsten room light,
practical candle glow in the background. Audio: cozy living-room
ambience — crackling fireplace in the background — the ribbon
swish and crisp paper crinkle as it unfolds (foley). Aspect: 4:5.
Duration: 10 seconds.
18. Tearing Tamper Seal
A close-up of a hand pulling the tamper-evident seal strip from
[a white high-gloss supplement bottle with a minimalist navy label].
Camera: macro locked-off shot, the hand enters from the right and
pulls the strip in a smooth continuous pull. Lighting: bright even
studio overhead, sharp product focus. Audio: near-silent with the
crisp tearing sound of the perforated seal strip as it releases
cleanly (foley). Aspect: 9:16. Duration: 5 seconds. The product
should look exactly as photographed — preserve the label design.
19. Drawer-Style Packaging Unboxing
A [slide-out drawer-style box in warm tan kraft board with a black
fabric pull-tab and a printed logo] being opened on a light
concrete surface. Camera: front-angle medium shot, the inner
drawer is pulled out with the fabric tab in a slow, smooth motion
revealing a product nested in a molded pulp insert. Lighting:
natural diffused daylight from a skylight, neutral and editorial.
Audio: ambient lifestyle silence, the smooth whisper of the
cardboard drawer sliding out on its track, followed by a soft
crinkle of the paper insert (foley). Aspect: 9:16. Duration:
7 seconds.
20. Retail Plastic-Bag Unboxing
A [resealable kraft-paper and clear-window retail pouch containing
[artisan chocolate bars]] being opened on a wooden café table.
Camera: 45-degree angle close-up, hands grip the reseal tab and
tear it open in a single pull, folding back the top to reveal
the contents. Lighting: warm café window light, natural and
slightly warm. Audio: café ambient background — faint coffee
machine — the crisp tearing of the reseal adhesive and the rustle
of the kraft paper as the bag is folded open (foley). Aspect: 4:5.
Duration: 6 seconds.
Before-and-After Transformation Prompts (21–25)
21. Skincare Before-and-After Split
A side-by-side split-screen video: left panel shows dry, uneven
skin texture under a close-up beauty light; right panel shows the
same skin zone looking hydrated, even, and smooth after [the hero
moisturizer] has been applied. Camera: both panels use the same
locked-off macro angle, identical framing. Lighting: same beauty
ring light in both panels for consistency. Audio: soft ambient
tone, subtle skin-care product dispense sound on the right panel
at the transition (foley). Aspect: 16:9. Duration: 8 seconds.
Note: this is a conceptual demonstration — individual results vary.
22. Organization Before-and-After
A split-screen showing a cluttered desk drawer on the left versus
the same drawer neatly organized with [a modular bamboo drawer
organizer set] on the right. Camera: top-down locked-off overhead
shot, identical framing both sides. A clean wipe transition moves
from left to right at the midpoint. Lighting: even overhead studio
light, clean and product-focused. Audio: quiet ambience, a
satisfying wooden placement tap as the organizer is set in
position (foley). Aspect: 1:1. Duration: 7 seconds.
23. Hair Styling Transformation
A close-up split-screen: left panel shows unstyled, frizzy hair
before application; right panel shows the same hair smooth, glossy,
and shaped after [a thermal repair hair serum] is applied and
worked through with a wide-tooth comb. Camera: locked-off macro
from a 30-degree angle, same framing both panels. A center-wipe
transition reveals the right panel at the midpoint. Lighting:
beauty top-light with a subtle back-rim to show hair texture and
sheen. Audio: calm beauty-shoot ambience, the soft dispense of
the serum pump and gentle comb-through sounds (foley). Aspect:
9:16. Duration: 9 seconds.
24. Cleaning Product Action Shot
A side-by-side comparison: left panel shows a grimy kitchen tile
surface; right panel shows the same tile gleaming after [a
citrus-scented multi-surface cleaning spray] is applied and wiped.
Camera: locked-off medium close-up showing a 30cm section of tile
surface in both panels. The right panel shows the spray application
and a single wipe pass. Lighting: bright overhead utility light
that shows the surface shine clearly. Audio: ambient kitchen
silence, the pressurized hiss of the spray trigger and the
clean squeaking wipe of the cloth on tile (foley). Aspect: 16:9.
Duration: 8 seconds.
25. Fitness Product Result
A before-and-after split-screen concept showing a workout mat
space: left panel shows the space empty and neutral; right panel
shows the same space with [an anti-fatigue thick-foam exercise
mat in slate gray with alignment guides] rolled out and in use.
Camera: wide-angle locked-off shot of the full workout corner,
same framing both panels, wipe transition at center. Lighting:
bright natural gym light. Audio: quiet room tone, clean slap of
the mat unrolling and settling flat on the floor (foley).
Aspect: 9:16. Duration: 7 seconds. Note: this is a conceptual
staging demonstration, not a fitness result claim.
DTC Ad Sequence Prompts (26–30)
26. Three-Shot Hook + Demo + CTA Sequence
A three-shot ad sequence for [a compact stainless-steel manual
coffee grinder in brushed silver]. Shot 1 (Hook, 3 seconds):
extreme close-up of ground coffee pouring from the grinder into
a cup, slow-motion — no text, no branding, pure product sensory
moment. Shot 2 (Demo, 6 seconds): medium shot of the grinder
being used at a kitchen counter, hand turning the burr handle
with smooth circular motion. Shot 3 (CTA, 4 seconds): locked-off
beauty shot of the grinder on a marble counter, product name
appearing as minimal text overlay. Camera: each shot is a
distinct cut with confident framing. Lighting: warm morning
kitchen light throughout for visual consistency. Audio: coffee
grind and pour throughout; foley: burr crunch of grinding beans,
crisp pour of grounds (foley). Aspect: 9:16. Total duration:
13 seconds.
27. Problem-Solution-Product Format
A three-act ad sequence. Act 1 (Problem, 4 seconds): close-up
of a tangled, overflowing cable drawer — relatable chaos, no
product visible. Act 2 (Solution, 5 seconds): hands quickly
organizing cables using [a set of magnetic cable clips in matte
black] — click-in-place motion, satisfaction visible. Act 3
(Product, 4 seconds): clean top-down beauty shot of the
organized cables with the clips in place, product clearly visible.
Camera: each act is a distinct cut. Lighting: warm home-office
light throughout. Audio: cable-drawer rustle in Act 1; satisfying
magnetic click of each cable clip in Act 2 (foley); calm
organizational silence in Act 3. Aspect: 9:16. Duration:
13 seconds.
28. Founder Talking-Head Plus Product Cutaway
A founder-style DTC ad: talking-head footage of a person in a
clean, simple home-studio setup speaking directly to camera for
6 seconds, then a hard cut to a 6-second beauty close-up of
[a clean-label daily probiotic in a white glass bottle with a
minimal sans-serif label], ending with a 3-second wide shot of
both the founder and product on a desk. The talking-head footage
should look authentic and shot-on-phone, not produced. Camera:
talking-head in a slight off-center frame; product cutaway uses
a slow push-in close-up. Lighting: talking-head uses natural
window light; product cutaway uses a soft studio setup.
Audio: natural room tone with clear voice in talking-head
sections; ambient soft music bed under product cutaway.
Aspect: 9:16. Duration: 15 seconds.
29. UGC-Style First-Person Product Use
A first-person POV video of someone using [a refillable stainless-
steel deodorant applicator in matte forest green] in a morning
bathroom routine. The camera (POV) looks down at a bathroom sink
counter; a hand picks up the deodorant, twists the base to advance
the stick, and applies it. The footage should look like authentic
UGC — slight handheld movement, natural lighting, no
professional production feel. Camera: handheld first-person POV,
gentle natural sway. Lighting: bathroom vanity light, warm and
slightly imperfect. Audio: morning bathroom ambient — water
dripping, fan hum — and a soft twist-click of the deodorant
mechanism advancing (foley). Aspect: 9:16. Duration:
10 seconds.
30. Multi-Product Family Campaign Shot
A campaign hero video for a [three-product skincare line —
cleanser, toner, and moisturizer in matching frosted white glass
bottles with gold caps]. All three products are arranged on a
wet marble surface with water droplets. Camera: a slow side-to-
side pan revealing all three products left to right, then pulling
back to a wide establishing shot of the full trio. Lighting:
soft overhead diffused studio light with a subtle warm fill from
below, creating a dewy, luminous atmosphere. Audio: spa-like
ambient silence, light water droplet sounds as the marble surface
glistens (foley). Aspect: 16:9. Duration: 10 seconds. The
products should look exactly as photographed — preserve all
label typography and the gold cap finish precisely.
Product Video Power Tips
Match aspect ratio to platform before you render. Generate 9:16 for TikTok, Reels, and YouTube Shorts; 1:1 for Instagram feed; 4:5 for Instagram portrait feed (this outperforms square in most campaign tests); 16:9 for YouTube pre-roll and product-page video embeds. Cropping after the fact almost always cuts the product awkwardly — build the target platform into the prompt from the start.
Choose a camera move that flatters the product's specific shape. A 360-degree orbit works for products with interesting silhouettes. A push-in to an extreme close-up earns its place when the detail is the selling point — texture, embossing, material quality. A locked-off shot with subtle product rotation works when the background context is the story. Name the move and add a speed cue so the model calibrates pacing.
Audio direction is the multiplier. The difference between footage that looks produced and footage that just looks rendered is motivated audio. Specify foley that matches the product action: the fizz of carbonation, the click of a magnetic closure, fabric rustle, a clean glass tap, the crisp tear of a seal. Sora 2 supports native audio — use it.
Use the preservation cue when your product has a specific design. Adding "the product should look exactly as photographed — no AI reinterpretation of the design" tightens fidelity considerably when logo art, label typography, or a specific colorway must survive the render. Use it on at least the first generation of any prompt; iterate from there.
Calibrate duration to context. For paid ads, 6–15 seconds is the practical range — long enough for a hook-demo-CTA structure, short enough to be used across platforms without trimming. For product-page hero videos, 3–6 seconds works: a single beauty shot with camera motion, no narrative arc needed.
Iterate the same scene at multiple aspect ratios to build a multi-channel campaign. Run the same core prompt three times — once at 9:16, once at 1:1, once at 4:5 — adjusting the camera framing description to match each format. You'll have a full set of platform-native assets from one brief rather than three.
Create a product video for my skincare serum.
A slow push-in from a medium shot to a tight close-up of a [white frosted-glass serum bottle with a rose-gold dropper top] on a wet white marble surface. Camera: controlled dolly-in over 8 seconds, ending on a macro of the dropper tip with one drop forming and falling. Lighting: soft diffused daylight from the left, warm catchlight on the glass. Audio: spa ambient silence, the quiet pop and hiss of the dropper dispensing two drops (foley). Aspect: 9:16. Duration: 8 seconds. The product should look exactly as photographed — no AI reinterpretation of the design.
Start Building Better Product Prompts
These 30 templates work because they give Sora 2 what it needs to produce footage that looks intentional: a named camera move, a specific lighting setup, motivated audio, a clear duration, and a platform-matched aspect ratio. Remove any one of those elements and the output regresses toward generic.
One honest note: AI-generated video may not meet the authenticity policies of certain marketplaces and retail platforms for product listing images and videos. These prompts are built for owned channels, ad accounts, DTC product pages, and concept testing — not for submissions to marketplaces with strict authenticity requirements. Check platform policies before deploying.
The AI prompt generator can build customized Sora 2 briefs from a plain-English description of your product and target platform. For more video prompt templates and techniques, see the best Sora 2 prompts guide, the Sora 2 prompts deep-dive, and AI prompts for ecommerce.