Google's AI infrastructure chief tells staff it needs thousandfold capacity increase in 5 years.
See full article...
See full article...
You do understand my confusion seeing as it has next to nothing to do with your example, right?No this is the right paper - here is the relevant portion:
"Using a dataset of over 200 language model evaluations on Wikitext and Penn Treebank spanning 2012-2023, we find that the compute required to reach a set performance threshold has halved approximately every 8 months, with a 95% confidence interval of around 5 to 14 months, substantially faster than hardware gains per Moore's Law."
The Nano/LLAMA 3 was an example I used to illustrate the general point.
That's fair, though it can be a bit of a self fulfilling prophecy. It's also a lot easier to reason with someone irl than it is online. Finally, it takes a lot of skill, kindness, patience, understanding, and respect.I will grant that it is a cynical, pessimistic insight that doesn't encourage good practices but I've found it true more often than not.
There are maybe 13-16 billion searches conducted on Google per day. Just think how much less compute they would need if you had to deliberately click an AI button after each search instead of running it automatically.It’s unclear how much of this “demand” Google mentioned represents organic user interest in AI capabilities versus the company integrating AI features into existing services like Search, Gmail, and Workspace.
Then you might love this park in Seattle: https://www.alamy.com/stock-photo/g...96F2EFFF64219AA2A2F090ED6B363&sortBy=relevantAs disturbing as this stuff is, I have to admit a strange fascination with that image of the Oregon data center's cooling pipes. I might be looking at my next weird desktop background...
Investors, presumably."Google tells employees it must double capacity every 6 months to meet AI demand"
Who's doing the demanding?
On the other hand, a LLM query requires vastly more compute and dedicated hardware than a search query.There are maybe 13-16 billion searches conducted on Google per day. Just think how much less compute they would need if you had to deliberately click an AI button after each search instead of running it automatically.
For comparison, there are about 2 billion ChatGPT queries per day. So Google is running 8 times that just for searches, where some significant proportion of users aren't even interested in getting their AI summaries at all.
Yeah, that's where a huge breakthrough in quantum computing and its applications to neural network training/inference would come in handy.Now, maybe this is possible. I'm not an expert on hardware and I don't know enough about material science in general, much less this ultraspecific field. Those advances would be a BFD in so many other ways, though, that it would make our current computing use look as familiar as ENIAC.
This isn't the only vector of improvement. Efficieny of hardware is one thing but the models are also getting more efficient (intelligence/watt).
In 2022, a model with GPT 3 level performance cost $20.00 per million tokens. Today, that same level of capability (GPT5 Nano/Llama 3) costs roughly $0.10 per million tokens.
So that's 200x cheaper in 3 years.
Now let's do the math:
Hardware efficiency (TFlops per watt) is doubling every 2 years.
Algorithmic efficiency (same intelligence for less compute) is doubling every 6-9 months.
With just these two numbers, you'll get 1000x improvement by 2031/2032. This is without increasing capacity (total number of gpus etc) which is another vector to scale up.
CEOs who tell all c-levels to introduce AI into the business for "I don't know, something!" (true story)"Google tells employees it must double capacity every 6 months to meet AI demand"
Who's doing the demanding?
But Google really want his AI to be used in anything.Even trusting those numbers, how could you tell that they want AI shoved into everything? I use ChatGPT and/or gemini weekly. But for very specific things. I don't want them in my calendar, my web browser, my operating system, my refrigerator. I don't even want it in areas where it might make sense like my voice controlled phone assistant, just because it is so damn bad at it.
We can just throw an AI at the problem. That solves all current problems? Right? I totally asked chatgpt if we could violate the laws of nature and it said it was possible, so it must be true.Wow -- a new enhanced exponential Moore's Law for the next five years! (backed by physics? nope)
I love it. Keep blowing the bubble bigger. I have my popcorn and a front row seat, and my rain coat on for the mess. I hope the bursting bubble takes some of these asshat broligarchs with it. When one is only rich on stock ownership paper tied to a bubble, one tends to no longer be rich when that bubble bursts. Looking at you, Altman
If they want to stop those data centers, shorting the AI companies and flooding the news with reasonable arguments that the accomplishments of LLMs don't mean Jack Shit if they can't scale and their accuracy doesn't improve to the 96% thousandth decimal point from where they are at now... with less effort and time than legal battles to enforce zoning laws.What about the communities all across the country who are fighting to stop these data centers from being built in their towns and counties? Just here in central Indiana:
https://www.indystar.com/story/news...data-center-indianapolis-indiana/87353622007/
https://www.indystar.com/story/news...wn-hall-indianapolis-development/87347734007/
https://www.indystar.com/story/news...rmland-after-google-data-centers/86547091007/
https://www.jconline.com/story/news...vote-to-standing-room-only-crowd/87088384007/
https://www.jconline.com/story/opin...v118460b0084xxd118465&gca-ft=179&gca-ds=sophi
I will definitely be on the lookout for fire-sale toys for the homelab. There's a tidal wave on the horizon.For those interested in future investment opportunities, you might keep an eye on data centers. They’ll be going on sale eventually.
Rapid expansion is the healthiest thing for bubbles, no?Yea.. this is really really really not a bubble, right guys?
ChatGPT isn't profitable with 800 million users.
Google wants to increase compute 1000x
So .... 800 billion users should do it.
Everyone needs to go out and start fucking everything they can find. OpenAI needs more (100x more than the total population) users.
Tell me again how this isn't a bubble.
It's amazing that so few "investors" care about the fundamentals of business...Can someone explain to me where a 1000X increase in money/revenue will come from?
It's really hard to believe they can get 1000X more revenue off of advertisers than they already do. Because that would suggest the consumer public needs to spend 1000X more. . .
Grocery and housing prices are trending that way. They'll find a way.Can someone explain to me where a 1000X increase in money/revenue will come from?
It's really hard to believe they can get 1000X more revenue off of advertisers than they already do. Because that would suggest the consumer public needs to spend 1000X more. . .
For those interested in future investment opportunities, you might keep an eye on data centers. They’ll be going on sale eventually.
It didn't even wait for AGI! We're paperclip optimizing ourselves into extinction just trying to build better stochastic parrots.
What do you mean? My examples exemplified the study's findings.You do understand my confusion seeing as it has next to nothing to do with your example, right?
I can do the pretraining of a gpt 3 level intelligence model (which would have cost millions for openai) for around $2k - from scratch.Can the conclusions about the costs of pre-training be generalized to other AI costs, namely inference?
This is the part I don't understand at all. Used to be that paying advertisers got the top of Google's search results page all to themselves, followed by Google's generalized results. Now, the "AI Summary" appears in place of that top spot, and one of Google's main advertiser advantages - paying to have your product shoved in consumer's faces - is lost. The advertisers are the ones paying Google's bills, they're cutting advertisers off at the knees with the AI Summary, so...how does this work?Unless Google is planning to fundamentally shift their entire business model, their investments in AI will cannibalise their existing search-based adserve revenue. What isn't going to increase 1000x is the amount of users you can serve ads to, or their collective buying power.
They'll slowly find malicious ways to force us. Gmail no longer automatically sorts out "promotions", "social" and other categories unless you agree to full AI scanning of your email, drive and other Google services. What used to be a handy auto-sort filter feature is now locked behind an AI use agreement. They'll keep doing this until their services are miserable to use without agreeing to AI everywhere.Aside from the senior management who think it will make them more money, who actually wants AI shoved into everything?
They're already sort of doing that by default for YT content creators, creating all the incentives that gradually turn people's eyes to slop content. It's just less viewer-facing.They'll slowly find malicious ways to force us. Gmail no longer automatically sorts out "promotions", "social" and other categories unless you agree to full AI scanning of your email, drive and other Google services. What used to be a handy auto-sort filter feature is now locked behind an AI use agreement. They'll keep doing this until their services are miserable to use without agreeing to AI everywhere.
Your point of view explains OpenAI and Anthropic as well as the host of lesser companies. They are important players in the game to be sure.Oh, for sure. Not just arms race fear but CEOs afraid of being fired by the board fear. Investors are gamblers at heart, always looking for the big score. Between all the people telling them it's a sure thing and the person saying it's all smoke an mirrors, they will always choose the grift.
F!yeah can you imagine the Bitcoin one of those centres could mine?!For those interested in future investment opportunities, you might keep an eye on data centers. They’ll be going on sale eventually.
I think it's basically because they've hit the wall, and they know it. They're not going to design or innovate or invent their way around the simple fact that they've taken LLMs as far as they can go, and they will never be any better than what they are right now, because their current performance is a maximization of their fundamental underlying design. Management is trying to bluster their way past this dead end, trying to cover that their incessant demand for more and more money and resources is a shell game without a pea.Your point of view explains OpenAI and Anthropic as well as the host of lesser companies. They are important players in the game to be sure.
But explaining the actions of Google and Microsoft (and Google is the focus of this article) takes more than that, because they and their shareholders are well past the gambling enthusiasm phase. They are blue chip stocks. The attitude is different and insisting on applying the wrong model for their choices will leave you wrong almost all the time. NVIDIA is also not a gambling company. At this point the big players in the game include leviathans that are acting like the upstarts because.. why? I believe it's fear.
I'm with you on that lunch, I it might make turkey taste ok. So all those millions of people just didn't KNOW they wanted it, which is what Sam Altman would say too(Off topic I know, but)
You have peaked my interest, and I may have to try that with some leftover turkey. A turkey, peanut butter, and cranberry sauce sandwich sounds kind of good.
I think you misunderstood. I said "not just". My point was in addition to yours. CEOs of different companies have different motivations than other companies. I wasn't talking about OpenAI or Anthropic. CEOs of companies that don't actually create their own AIs are the ones being driven by investors hell bent on getting those big returns when the company they are invested in uses AI to somehow magically replace its workforce or give them new capabilities to increase their growth/sales.Your point of view explains OpenAI and Anthropic as well as the host of lesser companies. They are important players in the game to be sure.
But explaining the actions of Google and Microsoft (and Google is the focus of this article) takes more than that, because they and their shareholders are well past the gambling enthusiasm phase. They are blue chip stocks. The attitude is different and insisting on applying the wrong model for their choices will leave you wrong almost all the time. NVIDIA is also not a gambling company. At this point the big players in the game include leviathans that are acting like the upstarts because.. why? I believe it's fear.
Easy! You now have to watch 16 adverts before you YouTube video plays....Can someone explain to me where a 1000X increase in money/revenue will come from?
It's really hard to believe they can get 1000X more revenue off of advertisers than they already do. Because that would suggest the consumer public needs to spend 1000X more. . .