Prompting Guide
Note: Special thanks to contrinsan for providing this writeup!
Overview
The key to prompting is to be specific without going over the token limit - include precise details about the scene, subject, style, and camera movements to ensure the AI accurately interprets your vision. Use the following guidelines to help compose a clear and coherent prompt that will get the results you want.
- Subject: Clearly define the main character or object.
- Scene/Environment: Describe the setting, including location, time of day, or weather.
- Action: Specify what the subject is doing, using dynamic verbs for motion.
- Style: Indicate the visual style (e.g., cinematic, anime, photorealistic).
- Atmosphere/Mood: Convey the emotional tone (e.g., serene, dramatic, eerie).
- Camera Movements: Include specific camera instructions (e.g., zoom in, pan left, tracking shot).
- Lighting: Describe lighting conditions (e.g., soft sunlight, neon glow).
- Shot Size: Specify framing (e.g., wide shot, close-up, medium shot).
- Verbs: Employ dynamic verbs like "running," "zooming," or "tilting" to guide the model in creating motion.
- Incorporate Metadata: Add tags like "hdr," "360-degree," or "fisheye" for specific image types.
- Model-Specific Considerations: Tailor prompts to the strengths of the model. For Hunyuan, use highly descriptive prompts with cinematic terminology (e.g., "wide-angle view," "lens flare") and detailed environmental cues like weather or time of day.
- Prompt Length: Aim for 60–100 words to provide sufficient context without overwhelming the model.
- Logical Sequence: Organize the prompt to paint a clear picture, starting with the subject and scene, followed by actions, camera work, and stylistic details.
- Avoid Overloading: Balance creativity with clarity to prevent confusing the AI.
Example
Prompt
A lone samurai in traditional armor stands on a misty cliffside overlooking a lush valley at dawn.
He unsheathes his katana, performing a slow, deliberate sword dance.
The scene is cinematic, with a serene yet intense atmosphere.
The camera starts with a wide-angle shot, slowly zooming in to a medium shot of the samurai’s focused expression.
Soft golden sunlight filters through the mist, casting gentle shadows.
The video is high-definition, emphasizing realistic textures and fluid motion.
Explanation
- Subject: "A lone samurai in traditional armor" clearly defines the main character.
- Scene/Environment: "Stands on a misty cliffside overlooking a lush valley at dawn" sets a vivid location and time of day.
- Action: "Unsheathes his katana, performing a slow, deliberate sword dance" uses dynamic verbs to describe the subject’s movement.
- Style: "Cinematic" specifies a professional, film-like visual style.
- Atmosphere/Mood: "Serene yet intense" conveys the emotional tone.
- Camera Movements: "The camera starts with a wide-angle shot, slowly zooming in to a medium shot" provides specific camera instructions.
- Lighting: "Soft golden sunlight filters through the mist, casting gentle shadows" details the lighting conditions.
- Shot Size: "Wide-angle shot" and "medium shot" clarify the framing.
- Metadata: "High-definition, emphasizing realistic textures and fluid motion" adds precision for quality output.
- Word Count: The prompt is 73 words, fitting the recommended 60–100 word range for sufficient detail without overloading.