Google announces agent-optimized Gemini 3.5.Flash and a do-anything model called Omni

AliSard · 2026-05-19T14:19:27-0400

Speaking as a non-engineer but not an entirely uninformed idiot, AI scared me.

Agentic AI scares me more.

redleader · 2026-05-19T14:25:01-0400

Antigravity 2.0 is giving me "failed to make code backend request" on install. Anyone actually get it to run?

Edit:
View: https://www.reddit.com/r/google_antigravity/comments/1thuut3/new_update_issue/

Looks like the roll out is screwed up.

Tactical Finesse · 2026-05-19T14:31:58-0400

Google says its more efficient Gemini 3.5 Flash is the key to your agentic AI future.

Counterpoint: What "agentic AI future"?

sporkinum · 2026-05-19T14:32:28-0400

McTurkey said:
If you're concerned or scared, it's because you're not thinking about the solution: fixing our societal and governmental relationship with labor and human rights. If we're actually going to reach the Star Trek post-scarcity economy and society, that means abolishing human labor as the foundational lynchpin of the entire economy. AI isn't there yet. Neither is robotics. But the rate of improvements to both mean that the technical capacity to achieve this abolitionist goal is within our lifetimes.

It's not Star Trek they are working towards, it's The Hunger Games.

McTurkey · 2026-05-19T14:32:28-0400

redleader said:
Antigravity 2.0 is giving me "failed to make code backend request" on install. Anyone actually get it to run?

Edit:
View: https://www.reddit.com/r/google_antigravity/comments/1thuut3/new_update_issue/

Looks like the roll out is screwed up.

Same issue here. It was working until the update.

Nilt · 2026-05-19T14:34:24-0400

Who's worried about fast enough! I'm worried about 100% accuracy of results! Until and unless they get there, this shit ain't worth my time.

JustYourAverageJDP · 2026-05-19T14:34:43-0400

Speed is not my priority. Accuracy and reliability is.

I will not be using any system with Agentic AI enabled. This might mean I eventually have to do without computers.

Xyler · 2026-05-19T14:37:08-0400

McTurkey said:
The speed and quality improvement for 3.5 Flash in Google Antigravity is wild. It began rolling out over the weekend, and I noticed it on Sunday when not only did it respond near instantly, but it was solving problems that 3.1 Pro had been stuck on.

If you're concerned or scared, it's because you're not thinking about the solution: fixing our societal and governmental relationship with labor and human rights. If we're actually going to reach the Star Trek post-scarcity economy and society, that means abolishing human labor as the foundational lynchpin of the entire economy. AI isn't there yet. Neither is robotics. But the rate of improvements to both mean that the technical capacity to achieve this abolitionist goal is within our lifetimes.

We live in a consumer driven economy. There's no way they would let us basically be labourless. It just doesn't work.

"AI" can never reach sci-fi levels with current technology. It just cannot. It took a breakthrough of material science to deliver us PCs that could fit on a desk. That same breakthrough needs to happen to AI, or else it'll forever be a massive waste. And with current tech, it'll never happen. Inference on the scale of LLMs is just too costly to implement.

Don't delude yourself into thinking we're gonna have Star Trek levels of AI or society anytime soon. I would almost guarantee you we'll destroy our environment WELL before getting 20% of the way there.

Tactical Finesse · 2026-05-19T14:39:46-0400

McTurkey said:
This is a policy choice. Nobody working on AI is working towards either of these.

Do not treat dystopian outcomes as the natural or default or the expectation. Treat them as possible policy outcomes, and then VOTE and LOBBY accordingly.

The people to mind aren't the economically-irrelevant minions...it is people like Peter "I do not think democracy and freedom are compatible" Thiel

Glade9266 · 2026-05-19T14:39:56-0400

From the article:

It’s no secret that generative AI is currently a money pit, and all the major AI players are trying to find paths to greater efficiency. The problem is magnified when you start building agentic experiences that are supposed to run for longer to complete complex tasks. Gemini 3.5 Flash may be a big step toward making that viable. The new model can output nearly 300 tokens per second, but its benchmark scores are similar to larger frontier models (like 3.1 Pro) that build outputs at a quarter of that speed.

Maybe I missed something in my read of the article, or there is extra context here, but the problem, "generative AI is currently a money pit", and the solution up, "the new model can output nearly 300 tokens per second", don't seem to be actually connected.

If the costs of tokens is the problem, how is churning through them more quickly the solution? I didn't see anything in the article saying that 3.5 Flash is more efficient or costs less, just that it is faster. The way I see it, this new model release seems to be doubling down on, "we lose money on every token, but make it up in volume".

McTurkey · 2026-05-19T14:42:50-0400

Xyler said:
We live in a consumer driven economy. There's no way they would let us basically be labourless. It just doesn't work.

"AI" can never reach sci-fi levels with current technology. It just cannot. It took a breakthrough of material science to deliver us PCs that could fit on a desk. That same breakthrough needs to happen to AI, or else it'll forever be a massive waste. And with current tech, it'll never happen. Inference on the scale of LLMs is just too costly to implement.

Don't delude yourself into thinking we're gonna have Star Trek levels of AI or society anytime soon. I would almost guarantee you we'll destroy our environment WELL before getting 20% of the way there.

Who is "they"?

Do you live in a democracy, or are you living in a pure authoritarian state without agency?

Varste · 2026-05-19T14:43:49-0400

I'm not in software development so I'll leave that for the programmers to say how good AI is for the task. But for a lot of people who are trying to sell us on these AI tools, I always read into their quotes that they are useless space-fillers and that AI is basically doing their job for them. Perhaps I'm too cynical on the corporate world, where the people who love AI the most are the ones who seem to generate the most fluff-filled, content-devoid output.

Tactical Finesse · 2026-05-19T14:48:59-0400

Xyler said:
We live in a consumer driven economy. There's no way they would let us basically be labourless. It just doesn't work.

"AI" can never reach sci-fi levels with current technology. It just cannot. It took a breakthrough of material science to deliver us PCs that could fit on a desk. That same breakthrough needs to happen to AI, or else it'll forever be a massive waste. And with current tech, it'll never happen. Inference on the scale of LLMs is just too costly to implement.

Don't delude yourself into thinking we're gonna have Star Trek levels of AI or society anytime soon. I would almost guarantee you we'll destroy our environment WELL before getting 20% of the way there.

Well, no and yes. Yes, we have a consumer driven economy...BUT...who are the consumers? You know how stock ownership is severely top-weighted to the point where 90+% of all trades are billionaires?

The same exact thing has happened to consumer spending in most "western" countries. Particularly the USA. The asset-owning top-10% in the USA performs about 50% of all consumer spending now. Which is why so many economic KPIs like GDP and CPI and so on--are increasingly decoupled from Normal Person lived experience. A trend foreseen 20 years ago by the marxist communists at Citibank--and their recommendation to investors and clients was to abandon working-people and pivot to only servicing high-net-worth individuals.

https://www.sourcewatch.org/images/8/86/CITIGROUP-OCTOBER-16-2005-PLUTONOMY-MEMO.pdf

That memo...they tried to retcon from the internet...because it is a pretty terrible look to say the least.

norton_I · 2026-05-19T14:49:25-0400

Glade9266 said:
Maybe I missed something in my read of the article, or there is extra context here, but the problem, "generative AI is currently a money pit", and the solution up, "the new model can output nearly 300 tokens per second", don't seem to be actually connected.

If the costs of tokens is the problem, how is churning through them more quickly the solution? I didn't see anything in the article saying that 3.5 Flash is more efficient or costs less, just that it is faster. The way I see it, this new model release seems to be doubling down on, "we lose money on every token, but make it up in volume".

The unsupported but plausible implication is the that it means they can produce 300 tokens per second on hardware that used to produce 75 tokens/second (or whatever). So more throughput on the same hardware. To what extent that holds up depends on a lot of details that google isn't sharing.

If Google can run models that compete in performance with competitors and their old "pro" models, but cost google similar to their old flash models, that could definitely change the profitability equation.

S_T_R · 2026-05-19T14:50:50-0400

Glade9266 said:
From the article:

Maybe I missed something in my read of the article, or there is extra context here, but the problem, "generative AI is currently a money pit", and the solution up, "the new model can output nearly 300 tokens per second", don't seem to be actually connected.

If the costs of tokens is the problem, how is churning through them more quickly the solution? I didn't see anything in the article saying that 3.5 Flash is more efficient or costs less, just that it is faster. The way I see it, this new model release seems to be doubling down on, "we lose money on every token, but make it up in volume".

It's plausible to infer that individual sessions aren't being given significantly more processing power. As such, more tokens in the same time implies that Google has found shortcuts that let them process a given token more quickly and therefore is taking less computer (and thus less money) to generate.

Also, the issue with AI is two-fold: queries are getting cheaper to process. However, accuracy still isn't great, and the easiest way to up accuracy is by using more compute per query. So there's a tug of war between wringing efficiency from each token calculated, and using more tokens in each query response.

How much this tech succeeds ultimately depends on if they can get accuracy high enough to satisfy end users while finding enough efficiency gains to eek out a profit.

TylerH · 2026-05-19T14:54:21-0400

Gemini 3.5 Flash might be fast enough for gen AI to make sense

Eh, I doubt it.

It’s no secret that generative AI is currently a money pit, and all the major AI players are trying to find paths to greater efficiency.

Oh, you mean financial sense for the companies to pursue. Maybe, sure.

Anoff · 2026-05-19T14:59:17-0400

Putting aside how terrible Google has been at shoving AI everywhere it's not needed or wanted, with opt outs buried like the best pirate treasure, and how Gemini is a pretty bad, try-hard model that will completely rebuild a website when you simply ask for a font color change, things like this are why I'm much less worried about a lot of the scary environmental impacts of AI.

Electricity and water usage fears aren't unfounded, but they were always in that realm of engineering problems that we've proven incredibly adapt at overcoming - it felt inevitable that the models would become more efficient, and that the hardware would as well, eventually significantly reducing the amount of electricity and water required, and finally trickling down to reduce the ridiculous CapEx numbers AI companies were throwing out there for data centers.

AI still has a hosts of problems - impacts on employment, AI slop masquerading as legitimate news, deep fake nudes, a huge stock market bubble - but at least solutions for the environmental issues are coming soon - maybe not as soon as we'd like, but better late than never

peterford · 2026-05-19T15:02:41-0400

That was a pretty impressive display by Google.

I still don't think there's much there that will make me open my wallet yet but they gave a very strong feeling of everything continuing to speed up.

Can't help but feel retailers will hate the universal shipping cart though.

Anoff · 2026-05-19T15:03:10-0400

redleader said:
Antigravity 2.0 is giving me "failed to make code backend request" on install. Anyone actually get it to run?

Edit:
View: https://www.reddit.com/r/google_antigravity/comments/1thuut3/new_update_issue/

Looks like the roll out is screwed up.

Yea, use Claude Code or Cursor. Antigravity sucks, it's a try-hard that will completely re-do a kitchen when you ask it to fix a stuck drawer. Antigravity f*cked up more simple tasks than I care to count, it tries way to hard to 'guess' what else you might want (refactor an entire webpage) even when you make simple request (change a font color)

DrewW · 2026-05-19T15:04:46-0400

McTurkey said:
The speed and quality improvement for 3.5 Flash in Google Antigravity is wild. It began rolling out over the weekend, and I noticed it on Sunday when not only did it respond near instantly, but it was solving problems that 3.1 Pro had been stuck on.

If you're concerned or scared, it's because you're not thinking about the solution: fixing our societal and governmental relationship with labor and human rights. If we're actually going to reach the Star Trek post-scarcity economy and society, that means abolishing human labor as the foundational lynchpin of the entire economy. AI isn't there yet. Neither is robotics. But the rate of improvements to both mean that the technical capacity to achieve this abolitionist goal is within our lifetimes.

Unless a few thousand people with pitchforks are outside [name redacted]’s mansion with pitchforks, why would they share any wealth to create a Roddenbarry imagined future?

Zero bankers went to prison after the financial crisis while thousands of Americans lost their homes. Boar’s Head killed more people last year than the Manson family during their entire murder spree. I can list examples all day. It’s nonsensical to think the wealthy will suddenly decide to share for the good of all mankind because AI when they have been greedy for the entire history of mankind.

E.g. How many Marc Cuban libraries have you been to? How many Elon Musk opera houses? How many Sam Altman cancer wings are at any US hospitals?

Bill Gates* and Laurene Powell Jobs are statistical anomalies in our world - why do you expect them to become the norm?

*you know why this asterisk
Is here

Errum · 2026-05-19T15:09:04-0400

McTurkey said:
The speed and quality improvement for 3.5 Flash in Google Antigravity is wild. It began rolling out over the weekend, and I noticed it on Sunday when not only did it respond near instantly, but it was solving problems that 3.1 Pro had been stuck on.

If you're concerned or scared, it's because you're not thinking about the solution: fixing our societal and governmental relationship with labor and human rights. If we're actually going to reach the Star Trek post-scarcity economy and society, that means abolishing human labor as the foundational lynchpin of the entire economy. AI isn't there yet. Neither is robotics. But the rate of improvements to both mean that the technical capacity to achieve this abolitionist goal is within our lifetimes.

Yeah I’m more envisaging the idle fatties in WALL-E.

S_T_R · 2026-05-19T15:11:03-0400

Anoff said:
Putting aside how terrible Google has been at shoving AI everywhere it's not needed or wanted, with opt outs buried like the best pirate treasure, and how Gemini is a pretty bad, try-hard model that will completely rebuild a website when you simply ask for a font color change, things like this are why I'm much less worried about a lot of the scary environmental impacts of AI.

Electricity and water usage fears aren't unfounded, but they were always in that realm of engineering problems that we've proven incredibly adapt at overcoming - it felt inevitable that the models would become more efficient, and that the hardware would as well, eventually significantly reducing the amount of electricity and water required, and finally trickling down to reduce the ridiculous CapEx numbers AI companies were throwing out there for data centers.

Utility usage isn't a engineering problem. They're policy problems. Policy allows datacenters in (e.g.) Arizona to use evaporative cooling. Closed loop cooling systems are a solved problem, they just cost more to build and operate. So, absent a legal requirement, data centers are going to build open systems that turn scarce local water into vapor. Require closed loop cooling and you eliminate the bulk of water usage.

Similarly, my electrical bill going up due to data centers is also a policy issue. Data centers, as large bulk buyers of electricity, are allowed to negotiate lower rates. End users are basically stuck with whatever rate they're given. So as rates are bid up (since supply is finite and constrained in the short run), households get hit harder than big users. Regulators could add surcharges to high users (and pipe the money towards rebates for households) to balance this out. I'd be stuck paying more for the same usage, but it wouldn't be as bad.

We're starting to see some of these ideas get enacted at the state and local level.

AdamWill · 2026-05-19T15:15:10-0400

Glade9266 said:
From the article:

Maybe I missed something in my read of the article, or there is extra context here, but the problem, "generative AI is currently a money pit", and the solution up, "the new model can output nearly 300 tokens per second", don't seem to be actually connected.

If the costs of tokens is the problem, how is churning through them more quickly the solution? I didn't see anything in the article saying that 3.5 Flash is more efficient or costs less, just that it is faster. The way I see it, this new model release seems to be doubling down on, "we lose money on every token, but make it up in volume".

Well, the implication I think is that "on the same hardware this model can produce 4x as many tokens in the same time", which should mean each token costs 25% as much. Which would be a significant change, if the quality was similar.

We'll have to wait to see if those things turn out to be true, though.

twilightomni · 2026-05-19T15:31:48-0400

norton_I said:
The unsupported but plausible implication is the that it means they can produce 300 tokens per second on hardware that used to produce 75 tokens/second (or whatever). So more throughput on the same hardware. To what extent that holds up depends on a lot of details that google isn't sharing.

If Google can run models that compete in performance with competitors and their old "pro" models, but cost google similar to their old flash models, that could definitely change the profitability equation.

What implication? Was this hinted anywhere that this was the same hardware? (Genuine question, trying to parse headlines here)

I just would’ve assumed they rented out specialized providers like Cerberus that already provide high-speed models of 100+ token/sec today, like Opus-Fast and Codex-Flash that cost 5-6x more.

And if it was the same hardware, why does it now suddenly cost 5x more per token? [1]

1.
View: https://x.com/artificialanlys/status/2056795055512596817

sporkinum · 2026-05-19T15:32:52-0400

McTurkey said:
Who is "they"?

Do you live in a democracy, or are you living in a pure authoritarian state without agency?

The damage is being done now. 50% of the voting age people are not real bright and the money behind AI is counting on that so they can damn the torpedoes and go full speed ahead.
The only real hope is for the house of cards to crumble and for some real guardrails to get put in place.

WereCatf · 2026-05-19T15:39:15-0400

They can trot out all the rich executives they want and they can throw an entire dictionary's worth of enthusiastic terms around, but....I can't think of anything I'd need or want an AI agent for. I wouldn't give one access to my servers or network infrastructure, nor would I give one access to anything more personal, both for reasons of privacy and because no matter what the executives say, these models keep making random mistakes.

Argent Claim · 2026-05-19T15:43:19-0400

McTurkey said:
If you're concerned or scared, it's because you're not thinking about the solution: fixing our societal and governmental relationship with labor and human rights.

You're not actually thinking about solutions, either, but imagining a literal deus ex machina.

McTurkey said:
If we're actually going to reach the Star Trek post-scarcity economy and society, that means abolishing human labor as the foundational lynchpin of the entire economy.

Star Trek ran on magic and the fiat of writers, and there is simply no guarantee the latter is possible or even desirable. In fact, automation has historically created more jobs, rather than fewer.

McTurkey said:
AI isn't there yet. Neither is robotics. But the rate of improvements to both mean that the technical capacity to achieve this abolitionist goal is within our lifetimes.

Technological progress does not and cannot go on forever; it will eventually end for anything under the Sun. When and where that progress ends for whatever metric you choose to employ varies, but it will end and often far short of where you want it.

If we want real, long lasting solutions, then it will be up to people to actually take action instead of fantasizing about magic disguised as technology saving the day.

NobleNobbler · 2026-05-19T15:50:06-0400

JulianS said:
I don't even see how the author makes that statement with any degree of credibility. Back of the envelope math suggests Anthropic is probably already profitable or near it right now. Similarly, the hyperscalers are seeing positive ROI on all the spend.

It is a broader issue with anything AI on this site nowadays - it gets instantly downvoted, called useless, etc. So much pessimism about an interesting and rapidly improving tech on a tech website.

I... I mean I just don't trust this envelope you've got there

Near as I've been able to tell, they might turn a profit in a few years fingers crossed

Scott_Tu · 2026-05-19T15:52:21-0400

Someone just realized they forgot to check the MTP box. New versionn!!

gkorper · 2026-05-19T15:52:24-0400

Anoff said:
Yea, use Claude Code or Cursor. Antigravity sucks, it's a try-hard that will completely re-do a kitchen when you ask it to fix a stuck drawer. Antigravity f*cked up more simple tasks than I care to count, it tries way to hard to 'guess' what else you might want (refactor an entire webpage) even when you make simple request (change a font color)

Well then you will happy to know that, in true Google fashion, the new Antigravity is a completely different product than than the old one called Antigravity.

Sarty · 2026-05-19T15:58:05-0400

WereCatf said:
They can trot out all the rich executives they want and they can throw an entire dictionary's worth of enthusiastic terms around, but....I can't think of anything I'd need or want an AI agent for. I wouldn't give one access to my servers or network infrastructure, nor would I give one access to anything more personal, both for reasons of privacy and because no matter what the executives say, these models keep making random mistakes.

Indeed, "We made our chaosbot cheaper!" is a solution to a problem that I do not have. It could be free; it could be $20/day paid to the user. I'd still never let it near a system I even slightly cared about--treat it as radioactive as a flash drive you found on the sidewalk.

Sarty · 2026-05-19T16:04:30-0400

JulianS said:
So much pessimism about an interesting and rapidly improving tech on a tech website.

I have never understood this mindset, this expectation of mindless clapping like a brain-dead trained seal. Do you expect a car website to favorably review every new model it drives? Do you expect the sports section of your local newspaper to bleatingly praise every new player signed and every one of the coach's in-game decisions?

Ars Technica writes about technology, but "about" means a whole lot more than reflexively showering with fanboi-ism. Is Engadget still a thing? I think you might find it more over there.

studenteternal · 2026-05-19T16:13:15-0400

Anoff said:
Yea, use Claude Code or Cursor. Antigravity sucks, it's a try-hard that will completely re-do a kitchen when you ask it to fix a stuck drawer. Antigravity f*cked up more simple tasks than I care to count, it tries way to hard to 'guess' what else you might want (refactor an entire webpage) even when you make simple request (change a font color)

Or better yet, don't! There aren't any validated improvments in productivity in anything but an anacdotal way, its going to get more expensive and difficult to control costs on, no matter what model you use, and ultimatly makes you a worse thinker and engineer than learning to do what you want yourself.

The Lurker Beneath · 2026-05-19T16:33:42-0400

JulianS said:
I don't even see how the author makes that statement with any degree of credibility. Back of the envelope math suggests Anthropic is probably already profitable or near it right now. Similarly, the hyperscalers are seeing positive ROI on all the spend.

It is a broader issue with anything AI on this site nowadays - it gets instantly downvoted, called useless, etc. So much pessimism about an interesting and rapidly improving tech on a tech website.

It's optimism, in a sense - as in whistling past the graveyard.

Google announces agent-optimized Gemini 3.5.Flash and a do-anything model called Omni

Wise, Aged Ars Veteran

Ars Legatus Legionis

Wise, Aged Ars Veteran

Ars Tribunus Militum

Ars Tribunus Militum

Ars Legatus Legionis

Ars Praetorian

Ars Scholae Palatinae

Wise, Aged Ars Veteran

Smack-Fu Master, in training

Ars Tribunus Militum

Ars Praetorian

Wise, Aged Ars Veteran

Ars Praefectus

Ars Tribunus Militum

Ars Praefectus

Smack-Fu Master, in training

Ars Praefectus

Smack-Fu Master, in training

Ars Tribunus Militum

Ars Tribunus Militum

Ars Tribunus Militum

Ars Scholae Palatinae

Ars Centurion

Ars Tribunus Militum

Ars Tribunus Militum

Smack-Fu Master, in training

Smack-Fu Master, in training

Wise, Aged Ars Veteran

Wise, Aged Ars Veteran

Ars Tribunus Angusticlavius

Ars Tribunus Angusticlavius

Wise, Aged Ars Veteran

Ars Tribunus Militum