Minimax Hailuo 02
Advanced text-to-video and image-to-video model with realistic physics.
Advanced text-to-video and image-to-video model with realistic physics.
Model Overview
A text-to-video and image-to-video model capable of generating 6s or 10s videos at 768p or 1080p resolution, excelling at realistic physics.
Best At
Generating videos with realistic physics and motion. It shines in complex scenarios requiring accurate physical interactions.
Limitations / Not Good At
Generation speed is an area for improvement. While enhanced, model alignment and stability are ongoing development focuses.
Ideal Use Cases
Creating dynamic video content, product visualizations with motion, illustrative clips for educational content, and artistic video pieces.
Input & Output Format
- Input: Text prompt, optional first frame image, optional last frame image, duration (seconds), resolution ('768p' or '1080p'), and a prompt optimizer toggle.
- Output: Video file (URI).
Performance Notes
Offers a 2.5x improvement in computational efficiency compared to conventional architectures. The 'pro' model (1080p) provides higher quality with better physics and coherency.
Prompt
StringText prompt for generation
First Frame Image
StringFirst frame image for video generation. The output video will have the same aspect ratio as this image.
Last Frame Image
StringLast frame image for video generation. The final frame of the output video will match this image.
Prompt
StringText prompt for generation
Duration
NumberDuration of the video in seconds. 10 seconds is only available for 768p resolution.
6Resolution
StringPick between standard 512p, 768p, or pro 1080p resolution. The pro model is not just high resolution, it is also higher quality.
1080pPrompt Optimizer
BooleanUse prompt optimizer
trueOutput
InferredOutput
Type
Node
Status
Official
Package
Nodespell AI
Category
AI / Video / MinimaxInput
Output