Kling 2.6 Text To Video
Top-tier text-to-video model generating cinematic videos with fluid motion and native audio.
Top-tier text-to-video model generating cinematic videos with fluid motion and native audio.
Overview
Kling Video v2.6 is an advanced text-to-video model designed to generate high-quality cinematic videos from textual prompts. It produces fluid video motion paired with native audio generation, supporting English and Chinese voice output, with automatic translation of other languages into English speech. The model supports customizable parameters such as video duration (5 or 10 seconds), aspect ratio (16:9, 9:16, 1:1), and CFG scale to control adherence to prompts.
Strengths / What it does well
- Generates cinematic visuals with smooth video motion.
- Native audio generation enhances realism, supporting English and Chinese speech.
- Flexible aspect ratios and video duration options.
- Includes negative prompt capability to reduce blur, distortion, and low-quality artifacts.
Best use cases
Ideal for creating short, evocative video clips from detailed textual descriptions, suitable for storytelling, marketing, and content creation when engaging cinematic visuals and synchronized audio are required.
Prompt
StringPrompt
Prompt
StringDuration
StringThe duration of the generated video in seconds
5Aspect Ratio
StringThe aspect ratio of the generated video frame
16:9Generate Audio
BooleanWhether to generate native audio for the video. Supports Chinese and English voice output. Other languages are automatically translated to English. For English speech, use lowercase letters; for acronyms or proper nouns, use uppercase.
trueNegative Prompt
Stringblur, distort, and low qualityCfg Scale
NumberThe CFG (Classifier Free Guidance) scale is a measure of how close you want the model to stick to your prompt.
0.5Output
InferredOutput
Type
Node
Status
Official
Package
Nodespell AI
Category
AI / Video / KuaishouInput
Output