Google tells employees it must double capacity every 6 months to meet AI demand

Missing Minute · Nov 22, 2025

wildsman said:
No this is the right paper - here is the relevant portion:

"Using a dataset of over 200 language model evaluations on Wikitext and Penn Treebank spanning 2012-2023, we find that the compute required to reach a set performance threshold has halved approximately every 8 months, with a 95% confidence interval of around 5 to 14 months, substantially faster than hardware gains per Moore's Law."

The Nano/LLAMA 3 was an example I used to illustrate the general point.

You do understand my confusion seeing as it has next to nothing to do with your example, right?

Can the conclusions about the costs of pre-training be generalized to other AI costs, namely inference?

Missing Minute · Nov 22, 2025

wildsman said:
I will grant that it is a cynical, pessimistic insight that doesn't encourage good practices but I've found it true more often than not.

That's fair, though it can be a bit of a self fulfilling prophecy. It's also a lot easier to reason with someone irl than it is online. Finally, it takes a lot of skill, kindness, patience, understanding, and respect.

jdale · Nov 22, 2025

It’s unclear how much of this “demand” Google mentioned represents organic user interest in AI capabilities versus the company integrating AI features into existing services like Search, Gmail, and Workspace.

There are maybe 13-16 billion searches conducted on Google per day. Just think how much less compute they would need if you had to deliberately click an AI button after each search instead of running it automatically.

For comparison, there are about 2 billion ChatGPT queries per day. So Google is running 8 times that just for searches, where some significant proportion of users aren't even interested in getting their AI summaries at all.

jdale · Nov 22, 2025

PurpleBadger said:
As disturbing as this stuff is, I have to admit a strange fascination with that image of the Oregon data center's cooling pipes. I might be looking at my next weird desktop background...

Then you might love this park in Seattle: https://www.alamy.com/stock-photo/g...96F2EFFF64219AA2A2F090ED6B363&sortBy=relevant

View: https://www.flickr.com/photos/80651083@N00/7148261917/in/photostream/

Seferino · Nov 22, 2025

SixDegrees said:
"Google tells employees it must double capacity every 6 months to meet AI demand"

Who's doing the demanding?

Investors, presumably.

Oh, you meant which users?

Seferino · Nov 22, 2025

jdale said:
There are maybe 13-16 billion searches conducted on Google per day. Just think how much less compute they would need if you had to deliberately click an AI button after each search instead of running it automatically.

For comparison, there are about 2 billion ChatGPT queries per day. So Google is running 8 times that just for searches, where some significant proportion of users aren't even interested in getting their AI summaries at all.

On the other hand, a LLM query requires vastly more compute and dedicated hardware than a search query.

Seferino · Nov 22, 2025

Uncivil Servant said:
Now, maybe this is possible. I'm not an expert on hardware and I don't know enough about material science in general, much less this ultraspecific field. Those advances would be a BFD in so many other ways, though, that it would make our current computing use look as familiar as ENIAC.

Yeah, that's where a huge breakthrough in quantum computing and its applications to neural network training/inference would come in handy.

Because, as we all know, that's exactly how discoveries work: you need them to sustain your investments, so they pop up right on schedule, right?

Seferino · Nov 22, 2025

wildsman said:
This isn't the only vector of improvement. Efficieny of hardware is one thing but the models are also getting more efficient (intelligence/watt).

In 2022, a model with GPT 3 level performance cost $20.00 per million tokens. Today, that same level of capability (GPT5 Nano/Llama 3) costs roughly $0.10 per million tokens.

So that's 200x cheaper in 3 years.

Now let's do the math:

Hardware efficiency (TFlops per watt) is doubling every 2 years.
Algorithmic efficiency (same intelligence for less compute) is doubling every 6-9 months.

With just these two numbers, you'll get 1000x improvement by 2031/2032. This is without increasing capacity (total number of gpus etc) which is another vector to scale up.

Interesting points, thanks. This makes me less skeptical than when I started reading this article. Are you sure about the hardware efficiency, though? This sounds strongly like Moore's law, which stopped describing reality a while ago.

Regardless, GPU efficiency improvements and algorithm improvements are not the only requirements you need to increase capacity. You also need more RAM and HD capacity, for instance, and more bandwidth, both within the machine and between data centers. To the best of my understanding, there hasn't been any great improvement in any of these domains in the last 10 years. In particular, bandwidth needs tend to scale more than linearly, as things get more parallelized.

Let's place all of this in a worldwide situation that threatens supply chains and let's not forget that Google is not the only company attempting to achieve this, so there will be competition for upstream components.

I therefore remain rather skeptical.

sciccoso · Nov 22, 2025

SixDegrees said:
"Google tells employees it must double capacity every 6 months to meet AI demand"

Who's doing the demanding?

CEOs who tell all c-levels to introduce AI into the business for "I don't know, something!" (true story)

Tursiops Amicus · Nov 22, 2025

hillspuck said:
Even trusting those numbers, how could you tell that they want AI shoved into everything? I use ChatGPT and/or gemini weekly. But for very specific things. I don't want them in my calendar, my web browser, my operating system, my refrigerator. I don't even want it in areas where it might make sense like my voice controlled phone assistant, just because it is so damn bad at it.

But Google really want his AI to be used in anything.

fenncruz · Nov 22, 2025

archtop said:
Wow -- a new enhanced exponential Moore's Law for the next five years! (backed by physics? nope)

We can just throw an AI at the problem. That solves all current problems? Right? I totally asked chatgpt if we could violate the laws of nature and it said it was possible, so it must be true.

SnowGhost · Nov 22, 2025

ChatGPT isn't profitable with 800 million users.
Google wants to increase compute 1000x
So .... 800 billion users should do it.

Everyone needs to go out and start fucking everything they can find. OpenAI needs more (100x more than the total population) users.

Tell me again how this isn't a bubble.

Asbestos Muffins · Nov 22, 2025

these guys should know better, there are datacenters that were built 3 years ago that are waiting on grid connections because there isn't enough power. the industries that make all this equipment can't keep up with geometric scaling, its not possible. GE and Pratt won't be able to boost output of gas turbines by much because the aviation industry is already in a shortage of parts because of covid when a large amount of aero workers were let go, nuclear is just greenwashing none of them are serious about it, and renewables are dead in this country right now.

hallj78 · Nov 22, 2025

VaireWeaver said:
I love it. Keep blowing the bubble bigger. I have my popcorn and a front row seat, and my rain coat on for the mess. I hope the bursting bubble takes some of these asshat broligarchs with it. When one is only rich on stock ownership paper tied to a bubble, one tends to no longer be rich when that bubble bursts. Looking at you, Altman

The issue is that when it bursts, it will inevitably collapse the greater economy... just like the Dot Com, just like the Housing Crisis. And then, the American tax payer will be the one to pick up the bill and feel the pain. Those well-off enough can shield their savings by diversification, but the average dude out there with a 401K, just scraping by as it is is boned.

scrimbul · Nov 22, 2025

MacBrave said:
What about the communities all across the country who are fighting to stop these data centers from being built in their towns and counties? Just here in central Indiana:
https://www.indystar.com/story/news...data-center-indianapolis-indiana/87353622007/
https://www.indystar.com/story/news...wn-hall-indianapolis-development/87347734007/
https://www.indystar.com/story/news...rmland-after-google-data-centers/86547091007/
https://www.jconline.com/story/news...vote-to-standing-room-only-crowd/87088384007/
https://www.jconline.com/story/opin...v118460b0084xxd118465&gca-ft=179&gca-ds=sophi

If they want to stop those data centers, shorting the AI companies and flooding the news with reasonable arguments that the accomplishments of LLMs don't mean Jack Shit if they can't scale and their accuracy doesn't improve to the 96% thousandth decimal point from where they are at now... with less effort and time than legal battles to enforce zoning laws.

It sucks to say but waking up the non-tech-broligarch class to the fact that LLMs are a scam in the white collar space to get them to drop the contracts, including ChatGPT subs is cheaper, faster and more effective than legal injunctions. AKA working to pop the market bubble faster by playing the market financial stock market game better than the broligarchs to no one's benefit.

And that sucks.

Xangis · Nov 22, 2025

Hypatia said:
For those interested in future investment opportunities, you might keep an eye on data centers. They’ll be going on sale eventually.

I will definitely be on the lookout for fire-sale toys for the homelab. There's a tidal wave on the horizon.

Melon of Troy · Nov 22, 2025

l3onc1t0_ said:
Yea.. this is really really really not a bubble, right guys?

Rapid expansion is the healthiest thing for bubbles, no?

DarthSlack · Nov 22, 2025

SnowGhost said:
ChatGPT isn't profitable with 800 million users.
Google wants to increase compute 1000x
So .... 800 billion users should do it.

Everyone needs to go out and start fucking everything they can find. OpenAI needs more (100x more than the total population) users.

Tell me again how this isn't a bubble.

Bingo. The WaPo has a wildly optimistic bubble/no bubble article but one of the interesting bits is that they looked at the revenue side of the AI business.

It's not a pretty picture.

Basically, the AI industry need to generate an additional $650 billion in revenue every year just to get a 10% return on their current investments. OpenAI is estimated to be at about $20 billion. Alphabet/Google's total worldwide revenue in 2024 was just under $350 billion.

Now how many people think that the Silicon Valley elite are going to be happy with just a 10% return?

Melon of Troy · Nov 22, 2025

Jeff S said:
Can someone explain to me where a 1000X increase in money/revenue will come from?

It's really hard to believe they can get 1000X more revenue off of advertisers than they already do. Because that would suggest the consumer public needs to spend 1000X more. . .

It's amazing that so few "investors" care about the fundamentals of business...

instant classic · Nov 22, 2025

Jeff S said:
Can someone explain to me where a 1000X increase in money/revenue will come from?

It's really hard to believe they can get 1000X more revenue off of advertisers than they already do. Because that would suggest the consumer public needs to spend 1000X more. . .

Grocery and housing prices are trending that way. They'll find a way.

Ooooompf · Nov 22, 2025

Unless Google is planning to fundamentally shift their entire business model, their investments in AI will cannibalise their existing search-based adserve revenue. What isn't going to increase 1000x is the amount of users you can serve ads to, or their collective buying power.

The Lurker Beneath · Nov 22, 2025

Hypatia said:
For those interested in future investment opportunities, you might keep an eye on data centers. They’ll be going on sale eventually.

Maybe if the bubble pops, they'll eventually get re-purposed for some kick-ass MMOs!

The Lurker Beneath · Nov 22, 2025

Cloudgazer said:
It didn't even wait for AGI! We're paperclip optimizing ourselves into extinction just trying to build better stochastic parrots.

Heh, maybe it will mean the God Machine doesn't worry so much about our alignment...

wildsman · Nov 22, 2025

Missing Minute said:
You do understand my confusion seeing as it has next to nothing to do with your example, right?

What do you mean? My examples exemplified the study's findings.

Missing Minute said:
Can the conclusions about the costs of pre-training be generalized to other AI costs, namely inference?

I can do the pretraining of a gpt 3 level intelligence model (which would have cost millions for openai) for around $2k - from scratch.

SixDegrees · Nov 22, 2025

Ooooompf said:
Unless Google is planning to fundamentally shift their entire business model, their investments in AI will cannibalise their existing search-based adserve revenue. What isn't going to increase 1000x is the amount of users you can serve ads to, or their collective buying power.

This is the part I don't understand at all. Used to be that paying advertisers got the top of Google's search results page all to themselves, followed by Google's generalized results. Now, the "AI Summary" appears in place of that top spot, and one of Google's main advertiser advantages - paying to have your product shoved in consumer's faces - is lost. The advertisers are the ones paying Google's bills, they're cutting advertisers off at the knees with the AI Summary, so...how does this work?

Wheels Of Confusion · Nov 22, 2025

ej24 · Nov 22, 2025

TheMolesRevenge said:
Aside from the senior management who think it will make them more money, who actually wants AI shoved into everything?

They'll slowly find malicious ways to force us. Gmail no longer automatically sorts out "promotions", "social" and other categories unless you agree to full AI scanning of your email, drive and other Google services. What used to be a handy auto-sort filter feature is now locked behind an AI use agreement. They'll keep doing this until their services are miserable to use without agreeing to AI everywhere.

Wheels Of Confusion · Nov 22, 2025

ej24 said:
They'll slowly find malicious ways to force us. Gmail no longer automatically sorts out "promotions", "social" and other categories unless you agree to full AI scanning of your email, drive and other Google services. What used to be a handy auto-sort filter feature is now locked behind an AI use agreement. They'll keep doing this until their services are miserable to use without agreeing to AI everywhere.

They're already sort of doing that by default for YT content creators, creating all the incentives that gradually turn people's eyes to slop content. It's just less viewer-facing.

odikweos · Nov 22, 2025

hillspuck said:
Oh, for sure. Not just arms race fear but CEOs afraid of being fired by the board fear. Investors are gamblers at heart, always looking for the big score. Between all the people telling them it's a sure thing and the person saying it's all smoke an mirrors, they will always choose the grift.

Your point of view explains OpenAI and Anthropic as well as the host of lesser companies. They are important players in the game to be sure.

But explaining the actions of Google and Microsoft (and Google is the focus of this article) takes more than that, because they and their shareholders are well past the gambling enthusiasm phase. They are blue chip stocks. The attitude is different and insisting on applying the wrong model for their choices will leave you wrong almost all the time. NVIDIA is also not a gambling company. At this point the big players in the game include leviathans that are acting like the upstarts because.. why? I believe it's fear.

thinkreal · Nov 22, 2025

Hypatia said:
For those interested in future investment opportunities, you might keep an eye on data centers. They’ll be going on sale eventually.

F!yeah can you imagine the Bitcoin one of those centres could mine?!
Especially if it comes with its own refurbished coal-fired power plant.

SixDegrees · Nov 22, 2025

odikweos said:
Your point of view explains OpenAI and Anthropic as well as the host of lesser companies. They are important players in the game to be sure.

But explaining the actions of Google and Microsoft (and Google is the focus of this article) takes more than that, because they and their shareholders are well past the gambling enthusiasm phase. They are blue chip stocks. The attitude is different and insisting on applying the wrong model for their choices will leave you wrong almost all the time. NVIDIA is also not a gambling company. At this point the big players in the game include leviathans that are acting like the upstarts because.. why? I believe it's fear.

I think it's basically because they've hit the wall, and they know it. They're not going to design or innovate or invent their way around the simple fact that they've taken LLMs as far as they can go, and they will never be any better than what they are right now, because their current performance is a maximization of their fundamental underlying design. Management is trying to bluster their way past this dead end, trying to cover that their incessant demand for more and more money and resources is a shell game without a pea.

thinkreal · Nov 22, 2025

xoid said:
(Off topic I know, but)
You have peaked my interest, and I may have to try that with some leftover turkey. A turkey, peanut butter, and cranberry sauce sandwich sounds kind of good.

I'm with you on that lunch, I it might make turkey taste ok. So all those millions of people just didn't KNOW they wanted it, which is what Sam Altman would say too

oomu · Nov 22, 2025

And the climate ?

And the need to fund public health ?

And the war against Russia ? (Or with Russia and so one day against UE. After all you are amurikans, here)

hillspuck · Nov 22, 2025

odikweos said:
Your point of view explains OpenAI and Anthropic as well as the host of lesser companies. They are important players in the game to be sure.

But explaining the actions of Google and Microsoft (and Google is the focus of this article) takes more than that, because they and their shareholders are well past the gambling enthusiasm phase. They are blue chip stocks. The attitude is different and insisting on applying the wrong model for their choices will leave you wrong almost all the time. NVIDIA is also not a gambling company. At this point the big players in the game include leviathans that are acting like the upstarts because.. why? I believe it's fear.

I think you misunderstood. I said "not just". My point was in addition to yours. CEOs of different companies have different motivations than other companies. I wasn't talking about OpenAI or Anthropic. CEOs of companies that don't actually create their own AIs are the ones being driven by investors hell bent on getting those big returns when the company they are invested in uses AI to somehow magically replace its workforce or give them new capabilities to increase their growth/sales.

And the AI companies need them to be afraid of being left in the dust so they can sell their snake oil to them. Companies like Johnson & Johnson, Boeing, News Corp, etc. Big companies like that, but also much smaller companies as well.

AjaxTelamon · Nov 22, 2025

So RAM is going to be really really expensive for a little while here, and then used RAM is going to be really really cheap for a good long while (as well as servers, server racks, firewalls, etc etc).

OrsonX · Nov 22, 2025

Jeff S said:
Can someone explain to me where a 1000X increase in money/revenue will come from?

It's really hard to believe they can get 1000X more revenue off of advertisers than they already do. Because that would suggest the consumer public needs to spend 1000X more. . .

Easy! You now have to watch 16 adverts before you YouTube video plays....

Google tells employees it must double capacity every 6 months to meet AI demand

Wise, Aged Ars Veteran

Wise, Aged Ars Veteran

Ars Legatus Legionis

Ars Legatus Legionis

Wise, Aged Ars Veteran

Wise, Aged Ars Veteran

Wise, Aged Ars Veteran

Wise, Aged Ars Veteran

Smack-Fu Master, in training

Seniorius Lurkius

Ars Tribunus Militum

Ars Praefectus

Ars Scholae Palatinae

Seniorius Lurkius

Ars Praefectus

Ars Centurion

Ars Centurion

Ars Legatus Legionis

Ars Centurion

Wise, Aged Ars Veteran

Smack-Fu Master, in training

Ars Tribunus Militum

Ars Tribunus Militum

Ars Tribunus Militum

Ars Legatus Legionis

Ars Legatus Legionis

Ars Tribunus Militum

Ars Legatus Legionis

Ars Praefectus

Ars Scholae Palatinae

Ars Legatus Legionis

Ars Scholae Palatinae

Ars Praetorian

Ars Scholae Palatinae

Smack-Fu Master, in training

Ars Praetorian