Which organization will first release an open weights video generation model with fidelity as good as Sora?
Mini
34
Ṁ905
2025
17%
Meta
17%
Stability AI
12%
black forest labs
11%
Kuaishou
11%
Mistral
10%
Other
6%
Google
4%
nvidia
3%
Bytedance
1.8%
ZhiPu AI
1.5%
Alibaba
1.4%
OpenAI
1.3%
Natural Synthetics Inc.
1.3%
Shanghai AI Lab

OpenAI has just announced a new text-to-video model known as Sora which has unprecedented visual quality and object permanence.

This question asks: what company will first release an open-weights text-to-video model (or image-to-video model) with fidelity equal to or greater than Sora.

In order to resolve this question positive the model must be open-weights, meaning anyone can download the model weights (possibly after signing a disclaimer), but need not be open-source. For example it could be research-only or restricted for commercial use.

Notable existing open weights video generation include:
Stable Video Diffusion: Stability AI
Hotshot XL: Natural Synthetics Inc.
Animate LCM: Shanghai AI Lab
I2VGen-XL: Alibaba
ByteDance: MagicAnimate
ModelScope: Modelscope text-to-video

(new answers can be added to this question)

Judgement of quality will be my personal judgement, unless OpenAI releases official scores (for example video FID) of Sora's performance. In order to resolve positive, a model must at a minimum: produce videos of length >=60s, demonstrate object-permanence, most of the time generated humans and animals have the correct number of arms/legs/fingers.

Get Ṁ1,000 play money
Sort by:

CogVideoX-2b released by ZhiPu AI

black forest labs, creator of Flux claims to be training a video model

https://blackforestlabs.ai/up-next/

new model by Kuaishou appears very good. Of course not open weights, but worth keeping an eye on.

StabilityAI CEO announces Stable Diffusion 3

why tf is mistral so high

@ashly_webb secretly hoping the answer is "insider trading"