Unified Multimodal Input in One Model
Kling O1 is a strong fit for creators who want one model that can work across text, image, video, and subject references.

Kling O1 is a better fit for workflows that combine text, image, video, and subject references in one more unified creation flow.
Kling O1 workflow dashboard
Choose the main signal first, keep continuity goals explicit, and then write a brief that combines text, image, video, and subject references without conflicting instructions.

Workflow preview
ReadyStart from written direction when story beats, pacing, and camera language should define the rest of the workflow.
Active brief
Use the product photo as the main reference, keep the bottle shape and label consistent, then generate a premium launch video with a slow push-in camera move.
Kling O1 was officially unveiled on December 1, 2025. In Kuaishou's announcement, the company positions Kling O1 as a unified multimodal creation tool and says it integrates text, video, image, and subject inputs in one engine. The same launch announcement says Kling O1 combines reference-based generation, text-to-video, start and end frame generation, video in-painting, video modification, style re-rendering, and shot extension in a single workflow. This makes Kling O1 the strongest page in the cluster for users looking beyond one narrow task and toward a more unified creation system.

Kling O1 is a strong fit for creators who want one model that can work across text, image, video, and subject references.
The official launch frames Kling O1 around solving consistency challenges, so this page can naturally target users who care about keeping scenes and subjects coherent.
O1 is the clearest page for creators who need generation, modification, in-painting, re-rendering, and shot extension in one model story.

Reference-based video generation with subject consistency
Creative projects that mix text, image, and video inputs
Ad and e-commerce videos that need unified editing
Film or social content workflows with scene continuity needs
Users comparing Kling O1 against earlier Kling version pages
01
Decide whether text, image, video, or subject reference is the primary signal before adding anything else.
02
If character or scene continuity matters, say that early and keep the reference instructions direct.
03
Start with generation, then add modification or extension goals so the creation path stays easy to control.
Use the related pages below to compare version-specific improvements with broader workflow-led entry points around Kling O1.
Use the O1 model page to understand the workflow, then move into the right creation path.