Google tells employees it must double capacity every 6 months to meet AI demand

Missing Minute

Wise, Aged Ars Veteran
1,386
No this is the right paper - here is the relevant portion:

"Using a dataset of over 200 language model evaluations on Wikitext and Penn Treebank spanning 2012-2023, we find that the compute required to reach a set performance threshold has halved approximately every 8 months, with a 95% confidence interval of around 5 to 14 months, substantially faster than hardware gains per Moore's Law."


The Nano/LLAMA 3 was an example I used to illustrate the general point.
You do understand my confusion seeing as it has next to nothing to do with your example, right?

Can the conclusions about the costs of pre-training be generalized to other AI costs, namely inference?
 
Upvote
12 (13 / -1)

Missing Minute

Wise, Aged Ars Veteran
1,386
I will grant that it is a cynical, pessimistic insight that doesn't encourage good practices but I've found it true more often than not.
That's fair, though it can be a bit of a self fulfilling prophecy. It's also a lot easier to reason with someone irl than it is online. Finally, it takes a lot of skill, kindness, patience, understanding, and respect.
 
Last edited:
Upvote
1 (2 / -1)

jdale

Ars Legatus Legionis
18,402
Subscriptor
It’s unclear how much of this “demand” Google mentioned represents organic user interest in AI capabilities versus the company integrating AI features into existing services like Search, Gmail, and Workspace.
There are maybe 13-16 billion searches conducted on Google per day. Just think how much less compute they would need if you had to deliberately click an AI button after each search instead of running it automatically.

For comparison, there are about 2 billion ChatGPT queries per day. So Google is running 8 times that just for searches, where some significant proportion of users aren't even interested in getting their AI summaries at all.
 
Upvote
24 (24 / 0)

jdale

Ars Legatus Legionis
18,402
Subscriptor
Upvote
4 (4 / 0)

Seferino

Wise, Aged Ars Veteran
129
There are maybe 13-16 billion searches conducted on Google per day. Just think how much less compute they would need if you had to deliberately click an AI button after each search instead of running it automatically.

For comparison, there are about 2 billion ChatGPT queries per day. So Google is running 8 times that just for searches, where some significant proportion of users aren't even interested in getting their AI summaries at all.
On the other hand, a LLM query requires vastly more compute and dedicated hardware than a search query.
 
Upvote
7 (8 / -1)

Seferino

Wise, Aged Ars Veteran
129
Now, maybe this is possible. I'm not an expert on hardware and I don't know enough about material science in general, much less this ultraspecific field. Those advances would be a BFD in so many other ways, though, that it would make our current computing use look as familiar as ENIAC.
Yeah, that's where a huge breakthrough in quantum computing and its applications to neural network training/inference would come in handy.

Because, as we all know, that's exactly how discoveries work: you need them to sustain your investments, so they pop up right on schedule, right?
 
Upvote
19 (19 / 0)

Seferino

Wise, Aged Ars Veteran
129
This isn't the only vector of improvement. Efficieny of hardware is one thing but the models are also getting more efficient (intelligence/watt).

In 2022, a model with GPT 3 level performance cost $20.00 per million tokens. Today, that same level of capability (GPT5 Nano/Llama 3) costs roughly $0.10 per million tokens.

So that's 200x cheaper in 3 years.

Now let's do the math:

Hardware efficiency (TFlops per watt) is doubling every 2 years.
Algorithmic efficiency (same intelligence for less compute) is doubling every 6-9 months.

With just these two numbers, you'll get 1000x improvement by 2031/2032. This is without increasing capacity (total number of gpus etc) which is another vector to scale up.

Interesting points, thanks. This makes me less skeptical than when I started reading this article. Are you sure about the hardware efficiency, though? This sounds strongly like Moore's law, which stopped describing reality a while ago.

Regardless, GPU efficiency improvements and algorithm improvements are not the only requirements you need to increase capacity. You also need more RAM and HD capacity, for instance, and more bandwidth, both within the machine and between data centers. To the best of my understanding, there hasn't been any great improvement in any of these domains in the last 10 years. In particular, bandwidth needs tend to scale more than linearly, as things get more parallelized.

Let's place all of this in a worldwide situation that threatens supply chains and let's not forget that Google is not the only company attempting to achieve this, so there will be competition for upstream components.

I therefore remain rather skeptical.
 
Upvote
12 (16 / -4)
Even trusting those numbers, how could you tell that they want AI shoved into everything? I use ChatGPT and/or gemini weekly. But for very specific things. I don't want them in my calendar, my web browser, my operating system, my refrigerator. I don't even want it in areas where it might make sense like my voice controlled phone assistant, just because it is so damn bad at it.
But Google really want his AI to be used in anything.
 
Upvote
3 (3 / 0)

fenncruz

Ars Tribunus Militum
1,791
Wow -- a new enhanced exponential Moore's Law for the next five years! (backed by physics? nope)
We can just throw an AI at the problem. That solves all current problems? Right? I totally asked chatgpt if we could violate the laws of nature and it said it was possible, so it must be true.
 
Last edited:
Upvote
4 (4 / 0)
these guys should know better, there are datacenters that were built 3 years ago that are waiting on grid connections because there isn't enough power. the industries that make all this equipment can't keep up with geometric scaling, its not possible. GE and Pratt won't be able to boost output of gas turbines by much because the aviation industry is already in a shortage of parts because of covid when a large amount of aero workers were let go, nuclear is just greenwashing none of them are serious about it, and renewables are dead in this country right now.
 
Upvote
19 (19 / 0)
I love it. Keep blowing the bubble bigger. I have my popcorn and a front row seat, and my rain coat on for the mess. I hope the bursting bubble takes some of these asshat broligarchs with it. When one is only rich on stock ownership paper tied to a bubble, one tends to no longer be rich when that bubble bursts. Looking at you, Altman

The issue is that when it bursts, it will inevitably collapse the greater economy... just like the Dot Com, just like the Housing Crisis. And then, the American tax payer will be the one to pick up the bill and feel the pain. Those well-off enough can shield their savings by diversification, but the average dude out there with a 401K, just scraping by as it is is boned.
 
Upvote
9 (10 / -1)
If they want to stop those data centers, shorting the AI companies and flooding the news with reasonable arguments that the accomplishments of LLMs don't mean Jack Shit if they can't scale and their accuracy doesn't improve to the 96% thousandth decimal point from where they are at now... with less effort and time than legal battles to enforce zoning laws.

It sucks to say but waking up the non-tech-broligarch class to the fact that LLMs are a scam in the white collar space to get them to drop the contracts, including ChatGPT subs is cheaper, faster and more effective than legal injunctions. AKA working to pop the market bubble faster by playing the market financial stock market game better than the broligarchs to no one's benefit.

And that sucks.
 
Upvote
10 (10 / 0)

DarthSlack

Ars Legatus Legionis
23,455
Subscriptor++
ChatGPT isn't profitable with 800 million users.
Google wants to increase compute 1000x
So .... 800 billion users should do it.


Everyone needs to go out and start fucking everything they can find. OpenAI needs more (100x more than the total population) users.


Tell me again how this isn't a bubble.

Bingo. The WaPo has a wildly optimistic bubble/no bubble article but one of the interesting bits is that they looked at the revenue side of the AI business.

It's not a pretty picture.

Basically, the AI industry need to generate an additional $650 billion in revenue every year just to get a 10% return on their current investments. OpenAI is estimated to be at about $20 billion. Alphabet/Google's total worldwide revenue in 2024 was just under $350 billion.

Now how many people think that the Silicon Valley elite are going to be happy with just a 10% return?
 
Upvote
26 (26 / 0)
Can someone explain to me where a 1000X increase in money/revenue will come from?

It's really hard to believe they can get 1000X more revenue off of advertisers than they already do. Because that would suggest the consumer public needs to spend 1000X more. . .
It's amazing that so few "investors" care about the fundamentals of business...
 
Upvote
17 (17 / 0)
Can someone explain to me where a 1000X increase in money/revenue will come from?

It's really hard to believe they can get 1000X more revenue off of advertisers than they already do. Because that would suggest the consumer public needs to spend 1000X more. . .
Grocery and housing prices are trending that way. They'll find a way.
 
Upvote
8 (8 / 0)

wildsman

Ars Tribunus Militum
1,800
You do understand my confusion seeing as it has next to nothing to do with your example, right?
What do you mean? My examples exemplified the study's findings.
Can the conclusions about the costs of pre-training be generalized to other AI costs, namely inference?
I can do the pretraining of a gpt 3 level intelligence model (which would have cost millions for openai) for around $2k - from scratch.
 
Upvote
-18 (3 / -21)

SixDegrees

Ars Legatus Legionis
48,614
Subscriptor
Unless Google is planning to fundamentally shift their entire business model, their investments in AI will cannibalise their existing search-based adserve revenue. What isn't going to increase 1000x is the amount of users you can serve ads to, or their collective buying power.
This is the part I don't understand at all. Used to be that paying advertisers got the top of Google's search results page all to themselves, followed by Google's generalized results. Now, the "AI Summary" appears in place of that top spot, and one of Google's main advertiser advantages - paying to have your product shoved in consumer's faces - is lost. The advertisers are the ones paying Google's bills, they're cutting advertisers off at the knees with the AI Summary, so...how does this work?
 
Upvote
23 (24 / -1)
Post content hidden for low score. Show…

Wheels Of Confusion

Ars Legatus Legionis
75,913
Subscriptor
unvuogj.jpg
 
Upvote
26 (26 / 0)

ej24

Ars Tribunus Militum
1,800
Aside from the senior management who think it will make them more money, who actually wants AI shoved into everything?
They'll slowly find malicious ways to force us. Gmail no longer automatically sorts out "promotions", "social" and other categories unless you agree to full AI scanning of your email, drive and other Google services. What used to be a handy auto-sort filter feature is now locked behind an AI use agreement. They'll keep doing this until their services are miserable to use without agreeing to AI everywhere.
 
Upvote
16 (17 / -1)

Wheels Of Confusion

Ars Legatus Legionis
75,913
Subscriptor
They'll slowly find malicious ways to force us. Gmail no longer automatically sorts out "promotions", "social" and other categories unless you agree to full AI scanning of your email, drive and other Google services. What used to be a handy auto-sort filter feature is now locked behind an AI use agreement. They'll keep doing this until their services are miserable to use without agreeing to AI everywhere.
They're already sort of doing that by default for YT content creators, creating all the incentives that gradually turn people's eyes to slop content. It's just less viewer-facing.
 
Upvote
13 (13 / 0)
Oh, for sure. Not just arms race fear but CEOs afraid of being fired by the board fear. Investors are gamblers at heart, always looking for the big score. Between all the people telling them it's a sure thing and the person saying it's all smoke an mirrors, they will always choose the grift.
Your point of view explains OpenAI and Anthropic as well as the host of lesser companies. They are important players in the game to be sure.

But explaining the actions of Google and Microsoft (and Google is the focus of this article) takes more than that, because they and their shareholders are well past the gambling enthusiasm phase. They are blue chip stocks. The attitude is different and insisting on applying the wrong model for their choices will leave you wrong almost all the time. NVIDIA is also not a gambling company. At this point the big players in the game include leviathans that are acting like the upstarts because.. why? I believe it's fear.
 
Upvote
-7 (2 / -9)

thinkreal

Ars Scholae Palatinae
698
For those interested in future investment opportunities, you might keep an eye on data centers. They’ll be going on sale eventually.
F!yeah can you imagine the Bitcoin one of those centres could mine?!
Especially if it comes with its own refurbished coal-fired power plant.
😜
 
Upvote
0 (3 / -3)

SixDegrees

Ars Legatus Legionis
48,614
Subscriptor
Your point of view explains OpenAI and Anthropic as well as the host of lesser companies. They are important players in the game to be sure.

But explaining the actions of Google and Microsoft (and Google is the focus of this article) takes more than that, because they and their shareholders are well past the gambling enthusiasm phase. They are blue chip stocks. The attitude is different and insisting on applying the wrong model for their choices will leave you wrong almost all the time. NVIDIA is also not a gambling company. At this point the big players in the game include leviathans that are acting like the upstarts because.. why? I believe it's fear.
I think it's basically because they've hit the wall, and they know it. They're not going to design or innovate or invent their way around the simple fact that they've taken LLMs as far as they can go, and they will never be any better than what they are right now, because their current performance is a maximization of their fundamental underlying design. Management is trying to bluster their way past this dead end, trying to cover that their incessant demand for more and more money and resources is a shell game without a pea.
 
Upvote
14 (16 / -2)

thinkreal

Ars Scholae Palatinae
698
(Off topic I know, but)
You have peaked my interest, and I may have to try that with some leftover turkey. A turkey, peanut butter, and cranberry sauce sandwich sounds kind of good.
I'm with you on that lunch, I it might make turkey taste ok. So all those millions of people just didn't KNOW they wanted it, which is what Sam Altman would say too 😜
 
Upvote
0 (0 / 0)

hillspuck

Ars Scholae Palatinae
2,179
Your point of view explains OpenAI and Anthropic as well as the host of lesser companies. They are important players in the game to be sure.

But explaining the actions of Google and Microsoft (and Google is the focus of this article) takes more than that, because they and their shareholders are well past the gambling enthusiasm phase. They are blue chip stocks. The attitude is different and insisting on applying the wrong model for their choices will leave you wrong almost all the time. NVIDIA is also not a gambling company. At this point the big players in the game include leviathans that are acting like the upstarts because.. why? I believe it's fear.
I think you misunderstood. I said "not just". My point was in addition to yours. CEOs of different companies have different motivations than other companies. I wasn't talking about OpenAI or Anthropic. CEOs of companies that don't actually create their own AIs are the ones being driven by investors hell bent on getting those big returns when the company they are invested in uses AI to somehow magically replace its workforce or give them new capabilities to increase their growth/sales.

And the AI companies need them to be afraid of being left in the dust so they can sell their snake oil to them. Companies like Johnson & Johnson, Boeing, News Corp, etc. Big companies like that, but also much smaller companies as well.
 
Upvote
7 (7 / 0)
Post content hidden for low score. Show…
Post content hidden for low score. Show…
Post content hidden for low score. Show…
Can someone explain to me where a 1000X increase in money/revenue will come from?

It's really hard to believe they can get 1000X more revenue off of advertisers than they already do. Because that would suggest the consumer public needs to spend 1000X more. . .
Easy! You now have to watch 16 adverts before you YouTube video plays....
 
Upvote
13 (13 / 0)