差:a woman
还行:a young woman with brown hair
最好:a woman in her late 20s, tight dark curls at ear length, small silver hoop in left ear, wearing a fitted black turtleneck, neutral expression
差:she looks happy and is enjoying the sunset
好:she slowly turns toward the camera, breeze lifting the hem of her skirt, eyes narrowing against the light
前者给了模型一张照片让它去凑,后者给了一个序列让它执行,差距非常大。
一条几乎没人遵守的规则:主体运动和镜头运动永远分开写。
“spinning camera around a dancing person” 是一条指令,模型根本不知道谁该转。改成 “the dancer spins slowly, camera holds fixed framing”,歧义拆成两条清晰指令,大部分抖动和混乱输出都会消失——这些毛病很多人一直怪模型,其实是提示词的锅。
[0-4s]: wide establishing shot, static camera, misty bamboo forest at dawn, golden hour light filtering through leaves
[4-9s]: medium shot, slow push-in, the fighter steps forward, white silk kimono billowing, determined expression
[9-15s]: close-up, orbit shot, the fighter strikes, slow motion, impact visible in the fabric ripple
格式 B(圆括号):
(0-3s) macro shot of perfume bottle among pink flowers, shallow depth of field, petals floating
(3-7s) camera glides closer, a feminine hand enters frame, touches the bottle
(7-12s) slow-motion spray, mist diffuses in air, particles catching rim light
(12-15s) pull-out to hero frame, product centered, volumetric lighting, minimal background
15 seconds UGC style review video, filmed on smartphone, natural bedroom
window lighting, casual handheld selfie angle, a young woman with brown
hair pulled back, natural skin with visible texture, wearing a casual grey
t-shirt, in her cozy bedroom, she holds a product up to the camera with
genuine excitement, quick jump cut slightly closer angle, she applies it
showing the texture, jump cut she leans into the camera with a natural
smile, the lighting is soft natural daylight no ring light no filters,
direct phone mic audio room ambience natural voice
2. 产品英雄片
ultra cinematic 15-second luxury product commercial, smooth continuous
sequence elegant pacing, fluid cinematic glide macro dolly plus soft
orbit plus gentle push-ins, seamless transitions masked by depth blur and
motion continuity no hard cuts everything flows organically,
(0-3s) macro shot of product on dark surface shallow depth of field rim light catching edges,
(3-7s) camera glides closer warm light rakes across surface revealing texture,
(7-11s) slow motion detail moment volumetric lighting,
(11-15s) pull-out to centered hero frame product isolated premium
minimalist background, sharp clarity no jitter stable picture
3. 电影感场景
cinematic film tone 35mm warm golden hour lighting, a man in his 40s
with weathered features sits at a wooden desk in a sun-drenched workshop
carefully carving walnut wood, slow push-in from medium shot to close-up
on his hands, dust motes float in the light beams from the window,
shallow depth of field background softly blurred, earthy color palette,
quiet ambient sound of wood shavings, avoid jitter avoid bent limbs,
stable picture no temporal flicker
4. 动作序列(带时间码)
high-intensity cinematic fight in a misty bamboo forest 15 seconds photorealistic,
[0-4s]: wide establishing shot static camera mist rolling between bamboo stalks golden hour light two fighters face each other,
[4-8s]: medium tracking shot the fighter in white lunges forward with a spinning strike fluid orbital tracking follows the motion,
[8-12s]: low-angle power shot impact moment slow motion bamboo leaves scatter,
[12-15s]: pull-out wide shot the fighter in white stands victorious rim light separating figure from mist, film grain anamorphic texture, avoid bent limbs maintain face consistency
5. 完整多模态制作
• @Image1 角色参考(保持面部和服装)
• @Image2 环境参考(匹配灯光和色调)
• @Video1 镜头运动参考(复刻慢速环绕)
• @Audio1 背景音乐(转场跟节拍对齐)
15-second cinematic sequence 16:9 2K resolution, character from @Image1
walks through the environment from @Image2, camera performs slow orbit
matching @Video1's motion arc, scene transitions align with beat positions
of @Audio1, golden hour rim lighting shallow depth of field, maintain
character identity across all frames, avoid identity drift avoid jitter
avoid temporal flicker, sharp clarity stable picture