Ten months after first tease, OpenAI launches Sora video generation publicly

thrillgore · Dec 9, 2024

Throw caution to the wind!

VoidWeaver · Dec 9, 2024

Looks great, but after waiting so long, I really expected it to be a step above the competition—and it's not. (I guess they focused on making it faster rather than better.)

The competition is just as good, less censored, and offers features not available from Sora.

There are also some open-source models that recently came out and seem just as capable as this one (but you need some monster hardware to run them.)

I was kind of hoping OAI would be the first to roll out a model that could generate both audio and video, but I guess we'll have to wait until next year for that.

jks05 · Dec 9, 2024

Looks like their sign in servers are down from the browser and Sora is not available in iOS app yet.

Looking forward to playing with Sora. Still ambivalent about what down stream effects Sora will entail; AI slop, misinformation, etc.

Flailsafe · Dec 9, 2024

Floodgates be openin'

JoHBE · Dec 9, 2024

How long will the tech coast along on boasting the quality of everything they CAN realistically incrementally improve, before inevitably crashing into the hard walls of longterm temporal consistency, in combination with finegrained detailed adjustments?

How close are we to the "95% there!"?

Artem S. Tashkinov · Dec 9, 2024

Marques Brownlee (I'm not subscribed to him but he was the first to post an extensive review) has reviewed Sora:

View: https://www.youtube.com/watch?v=OY2x0TyKzIQ

It might be useful for experimenting with creative ideas and very short very specific clips, but when it comes to motion, especially involving real world physics or limbs (for some reason), it struggles hard. It also struggles with continuity and hallucinates a lot.

And as far as I understood, it's silent at the moment, i.e. there's no audio generation.

Don't expect it to fake reality just yet. Maybe in a year or two.

buback · Dec 9, 2024

A music video by Canadian art collective Vallée Duhamel made with Sora-generated video. "[We] just shoot stuff and then use Sora to combine it with a more interesting, more surreal vision."

The music video felt a bit blah, in that the generated transitions just felt random. It makes me think that this is going to require a human drafting specific prompts very granularly to get anything compelling out of AI.

Y_R_U_Here · Dec 9, 2024

Considering these are the clips and prompts hand-picked by OpenAI, I am very underwhelmed.

The soap dragon doesn't really "breathe" bubbles the way a dragon breathes fire.

The hero-dog loops like a poorly composited greenscreen from a 1990s pet store advertisement. The background looks very generic and not like NYC at all. (Except if you look closely, there are four (4) buildings that kinda-sorta look like the Empire State Building). It's actually worse upon repeated viewings because the tail kind of disappears and reappears from under the cape in an unnatural manner that makes me think the AI lacks object permanence.

luckydob · Dec 9, 2024

buback said:
The music video felt a bit blah, in that the generated transitions just felt random. It makes me think that this is going to require a human drafting specific prompts very granularly to get anything compelling out of AI.

For now...2 years, 5 years, 10 years down the road and we are in something completely different. There is no putting the genie back in the bottle.

gotosoda · Dec 9, 2024

Just as I expected, OpenAI hyped Sora months ago, then vanished. Now that they have actually put something out, the reviews are not great. (High censored and struggling with image consistency seem to be the big complaints.) Meanwhile group like Kling and Minimax are offering more, for less.

Will OpenAI continue to be where the investment goes, or will people with piles of money to burn start looking elsewhere? There is an investment to be made here. Even Sora, with its shortcomings, is going to be a formidable tool in the hands of skilled human user. I just think OpenAI, perhaps due to the hubris of Altman, is slow walking themselves out of being competitive.

How close are we to the "95% there!"?

95%? Hell 90% is more than enough to be considered highy successful .

macphoenix · Dec 9, 2024

Regarding the hero image for this article, why do mushroom-headed creatures have breasts? They must be going for the fans of Jennifer Aniston from Friends.

mr_hoppy · Dec 9, 2024

A good use of energy? Given generating one still image uses a similar amount of energy to a full phone battery.

Joe the Spartan · Dec 9, 2024

Most Gen-Video is just edited stock footage

Oak · Dec 9, 2024

Y_R_U_Here said:
Considering these are the clips and prompts hand-picked by OpenAI, I am very underwhelmed.

The soap dragon doesn't really "breathe" bubbles the way a dragon breathes fire.

The hero-dog loops like a poorly composited greenscreen from a 1990s pet store advertisement. The background looks very generic and not like NYC at all. (Except if you look closely, there are four (4) buildings that kinda-sorta look like the Empire State Building). It's actually worse upon repeated viewings because the tail kind of disappears and reappears from under the cape in an unnatural manner that makes me think the AI lacks object permanence.

The AI was definitely getting confused over whether it was a tail or just an extra red flap beyond the blue cape. Also, the length of the tail varied from one unobscured appearance to the next.

coopster · Dec 9, 2024

Soulless text > soulless pictures > soulless videos

Burning up the planet for shitty knockoffs to save a few bucks.

MrTom · Dec 9, 2024

Wow, that music video was too busy.. I think the guy needed to slim it down a bit. Less is more, right?

the_great_ganonderp · Dec 9, 2024

mushroom people crowd mushroom woman in foreground mushroom beige stock footage poppin nips 4K UHD these nips are blastin beautiful sexy mushroom woman HDR full beams if you know what i mean trending on artstation trending on youtube high quality jennifer aniston type thing going on with the nipples photorealistic uncanny surreal directed by jj abrams wonderful

TaxiZaphod · Dec 9, 2024

coopster said:
Soulless text > soulless pictures > soulless videos

Burning up the planet for shitty knockoffs to save a few bucks.

And putting actual creative people out of work.

GKH · Dec 9, 2024

macphoenix said:
Regarding the hero image for this article, why do mushroom-headed creatures have breasts? They must be going for the fans of Jennifer Aniston from Friends.

"Mushroom nipples" was not on the list of things I expected to see today.

But it makes sense; porn-ification of all the things is far and away the leading use case for AI.

lurknomore · Dec 9, 2024

these types of limitations are based on what example videos were used to train AI models.

It must be hard to train a model to come up with 10-20s videos when anything filmed this century has a cut at most every 7 seconds, and generally a lot less than that.

gotosoda · Dec 9, 2024

TaxiZaphod said:
And putting actual creative people out of work.

Nah, it is just a tool, and it will be the creative people who are just learning the tool and how to push it that will put the soul back into it. It is the same evolution that occurred when the Macintosh landed on every graphic designer's desk.

Niles Gazic · Dec 9, 2024

GKH said:
"Mushroom nipples" was not on the list of things I expected to see today.

But it makes sense; porn-ification of all the things is far and away the leading use case for AI.

I find it both humorous and unsettling that if you expand that mushroom image, that what appeared to be nipples turn out to be either missing or severely inverted nipples.

Fatesrider · Dec 9, 2024

jks05 said:
Looks like their sign in servers are down from the browser and Sora is not available in iOS app yet.

Looking forward to playing with Sora. Still ambivalent about what down stream effects Sora will entail; AI slop, misinformation, etc.

The "playing" part should probably keep OpenAI executives awake at night.

Sure, they get their sub fees as people play, but once the curiosity is sated, what's left? A cancelled account because it's just not useful to, well, anyone for anything that's not directly related to an income or for things that will soon be, if not already is, illegal.

And I'd think that the vast majority of netizens have no shits to give about AI video generation if they have something HONEST to present.

That leaves the cheaters, propagandists, liars, haters, and other folks who pollute the gene pool for them to rely on for income. One wonders if that's a viable business model.

andocom · Dec 9, 2024

GKH said:
"Mushroom nipples" was not on the list of things I expected to see today.

But it makes sense; porn-ification of all the things is far and away the leading use case for AI.

I thought that at first but it is actually unrelated folds of the dress material, maybe thats what you get when AI knows there are often protrusions there but doesn't understand why.

CH77 · Dec 9, 2024

The heads of those mushrooms all appear similar, as does the folds in their clothing. Demo was impressive but this is meh.

Zeebee · Dec 9, 2024

TaxiZaphod said:
And putting actual creative people out of work.

Who do you think will use this instead of hiring people to make professional video? If you're using this it's because you either can't afford to, or simply will not pay someone to do it (e.g.: maybe I might to want a video of a dachshund juggling for funsies, I'm sure not paying someone hundreds of dollars to do it).

A band using generative AI to make a music video would have, instead, shot it themselves with a camera phone.

peppeddu · Dec 9, 2024

Where is Will Smith eating spaghetti?

somechar · Dec 9, 2024

gotosoda said:
Nah, it is just a tool, and it will be the creative people who are just learning the tool and how to push it that will put the soul back into it. It is the same evolution that occurred when the Macintosh landed on every graphic designer's desk.

"Just a tool" sure is doing a lot of heavy lifting here. Calling something "the same evolution" merely serves to obfuscate the important differences between the introduction of the Mac and the introduction of AI.

somechar · Dec 9, 2024

peppeddu said:
Where is Will Smith eating spaghetti?

Highly appropriate that this appears to be your 666th post.

MilkyBarKid · Dec 9, 2024

gotosoda said:
95%? Hell 90% is more than enough to be considered highy successful .

90% is fine if you have an expert reviewing the output and there’s little cost to generating new answers. But that expert costs a lot.

If you want something a non-expert can pick up and use, it needs to be a product, a tool - it needs to just work. And a tool that fails 1 in 10 times is a terrible tool. If a chatbot gives confidently wrong answers to 1 in 10 questions, and is fielding 1000s of requests, that’s hundreds of complaints the company now has to address.

There’s a reason eBay sellers tout 99% positive reviews, and companies aim for six sigma.

Edit: another issue with calling GenAI tools “tools” Is that a tool does what its user intends. GenAI is a poor tool by that standard, both by the 1 in 10 failures and in the difficulty in making fine adjustments.

imchillyb · Dec 9, 2024

$20 vs $200.

10x the price for 200x the value.

This New-AI world will further cement the planet's riches and resources into the hands of the few; the wealthy.

sixstringedthing · Dec 9, 2024

Are you seriously telling me that those clips - the "adorable" foam creature that looks like poorly animated frog spawn, the "looks like cheaply produced 80's TV" Superdog clip with background snow that looks utterly wrong because the tool doesn't understand scale/perspective - those clips are the best that OpenAI could produce to showcase this new tech?

Bad job OpenAI. Bad. Job.
I continue to be amazed at how easily impressed some people are.

As a final note, how much of a wanker do you have to be to shart out a <2min AI-generated music video and then refer to it as a "film".
Fuck all the way off.

TylerH · Dec 9, 2024

Why is the article's top image a still from someone's mushroom porn fantasy? I wasn't even going to click on the article but the associated image was so weird I had to comment on it.

soybot · Dec 9, 2024

I wonder how much "money" it costs them for each video.

CasonBang · Dec 9, 2024

GKH said:
"Mushroom nipples" was not on the list of things I expected to see today.

But it makes sense; porn-ification of all the things is far and away the leading use case for AI.

If the image wasn’t specifically cherry-picked (though it probably was), this could be a side effect of efforts to expand gender and race representation in the models. We’ve seen this recently with improbable races in historical photos, emoji genders in iOS 18.2, etc.

As the training data and reinforcement encourages a wider range, the outputs become more likely to depict combinations we might find to be unusual to our reality-based sensibilities.

I can’t imagine how much harder that makes it to prevent showing us results that are downright racist or pornographic, being generated basically via roll of the dice.

YellowPudding64 · Dec 9, 2024

peppeddu said:
Where is Will Smith eating spaghetti?

At an Italian restaurant?

GKH · Dec 9, 2024

andocom said:
I thought that at first but it is actually unrelated folds of the dress material, maybe thats what you get when AI knows there are often protrusions there but doesn't understand why.

There are two dimensions of "make sense" here; the first is about how the model works. The second (and to me more interesting) is "of the functionally infinitely many things to show, why did the meatbags choose that?"

NinjaNerd56 · Dec 9, 2024

At age 70, this has me engaged.

JoHBE · Dec 9, 2024

sixstringedthing said:
Bad job OpenAI. Bad. Job.
I continue to be amazed at how easily impressed some people are.

Thing is: it IS all mighty impressive in a tech demo kinda sense

But the extrapolation that is necessary to further lift it up to what it is imagined to be in the future, doesn't seem to have much to start from. Something entirely novel will have to be injected to get there. Which is entirely possible, I guess, but WHAT currently actually warrants any supreme confidence in that? It all just feels far too similar to the self driving cars story.

It's like they managed to train a human weightlifter to suddenly lift twice as heavy weights than anyone ever did before, and people would be sold the idea (and believe) that, soon, human construction workers will replace cranes altogether.

Ten months after first tease, OpenAI launches Sora video generation publicly

Ars Praefectus

Account Banned

Seniorius Lurkius

Ars Praetorian

Ars Praefectus

Ars Scholae Palatinae

Ars Scholae Palatinae

Smack-Fu Master, in training

Ars Scholae Palatinae

Ars Centurion

Seniorius Lurkius

Smack-Fu Master, in training

Smack-Fu Master, in training

Ars Tribunus Militum

Ars Praetorian

Ars Tribunus Militum

Ars Praetorian

Wise, Aged Ars Veteran

Ars Scholae Palatinae

Ars Tribunus Militum

Ars Centurion

Ars Praetorian

Ars Legatus Legionis

Ars Scholae Palatinae

Seniorius Lurkius

Ars Tribunus Militum

Ars Scholae Palatinae

Wise, Aged Ars Veteran

Wise, Aged Ars Veteran

Ars Praetorian

Ars Scholae Palatinae

Ars Scholae Palatinae

Ars Praefectus

Smack-Fu Master, in training

Wise, Aged Ars Veteran

Smack-Fu Master, in training

Ars Scholae Palatinae

Ars Praefectus

Ars Praefectus