ByteDance’s next-gen AI model can generate clips based on text, images, audio, and video
Big Tech's race to leapfrog the latest AI models continues with the launch of ByteDance's next-gen video generator. In a blog post, ByteDance - the China-based company behind TikTok - says Seedance 2.0 supports prompts that combine text, images, video, and audio. The company claims it "delivers a su...
By Emma RothFebruary 12, 2026117 views
Image: The Verge
ByteDance says its new AI video model can more accurately follow prompts. | Image: ByteDance
Big Tech's race to leapfrog the latest AI models continues with the launch of ByteDance's next-gen video generator. In a blog post, ByteDance - the China-based company behind TikTok - says Seedance 2.0 supports prompts that combine text, images, video, and audio.
The company claims it "delivers a substantial leap in generation quality," offering improvements in generating complex scenes with multiple subjects and its ability to follow instructions. Users can refine their text prompts by feeding Seedance 2.0 up to nine images, three video clips, and three audio clips.
The model can generate up to 15-second clips with audio, while taking cam …
Be the first to receive the latest news, market analysis and updates — delivered straight to your inbox.
We value your privacy
We use cookies to run this site and, with your consent, to measure
traffic and improve our content. Necessary cookies are always on. You
can accept all cookies or choose which ones to allow.
Privacy policy.