Some report burning through their whole monthly "AI credit" allotment in a single day.
See full article...
See full article...
Or... instead of jumping through increasingly convoluted hoops to make the supposedly super sophisticated tool kinda sorta work you could - and bear with me here - just write some code?We went through something similar at work a few months ago with Cursor. My boss is all in on AI coding. And frankly its becoming indispensable annoyingly quickly. But you cant be naive about usage.
He got the whole team using it and we burned through our monthly allotment in a week. Whoops. Then I learned that keeping an agent around for multiple requests increases the context and hence the cost. And frontier models while better are ten times as expensive.
Im now careful to use cheaper models and spawn new agents frequently to control context. I only use Cursor (usually on auto occasionally on a recent Claude model) for coding tasks. Anything else that doesnt need the context of my whole codebase I use Chat GPT which is a flat monthly fee.
Its slightly annoying but works well for me without breaking the bank. A coworker is more sophisticated. She has three tiers of models. She has a smart model generate the overall plan. After review she has a midtier model break that down into simple tasks that she assigns to fairly stupid agents. She's able to get top tier results without top tier pricing by ensuring she only allows the appropriate agent to do each job. I think thats the future. We need better automation and instrumentation around agent use so we can tune our desired mix of cost vs quality.
My gut feeling is that the free local tools are about to suddenly go bye-bye. There’s no way that the industry can afford to keep a) absorbing training costs and b) undercutting their own business models now that the bubble is finally popping.
I believe models developed in EU is mostly based on public eu funding meaning free models. So maybe some models will not be free to run locally but I believe some will stay free.My gut feeling is that the free local tools are about to suddenly go bye-bye. There’s no way that the industry can afford to keep a) absorbing training costs and b) undercutting their own business models now that the bubble is finally popping.
I agree with your general principle about technological improvement, but 2005 is an odd date to choose. In fact, I think someone in 2005 would be disappointed by today's computing hardware.
In 1985, Apple fans were likely using an Apple IIe with 64 KB of RAM, a 1MHz processor, and an 80-column text display. A 20MB hard disk was an expensive upgrade.
Twenty years later, in 2005, the iMac G5 had a 2GHz processor, 512 MB RAM, and a 250 GB HDD. That is a 8192x increase in RAM, a 2000x increase in clock speed, and a 12500x increase in disk capacity. This was achieved by a steady and constant improvement in computing power between 1985 and 2005.
Just extrapolating out the trend, someone in 2005 might expect that by 2025, mainstream computers would feature a 4000GHz processor with 4TB RAM and a 3000TB of storage.
In reality, 2005 is around the time that computing performance improvements began to grind to a halt.
Still vandalism. No need to double-collapse your takes, geeze.I left the original text there, what's the problem?
This was the entire premise of "The Singularity" that popular AI and Transhumanist boosters were pushing on the public for decades. The whole thing is "Once we have human-level artificial intelligence, we'll have it design better-than-human artificial intelligence, and then that will make even smarter bots, and upwards asymptotically, solving all the world's problems with genius artificial minds along the way..."I wonder if at some level the model was intended to be "make an AGI and then get the AGI to figure out how to make itself cheaper".
I'm an engineering director at a biggish tech company that you have probably heard of (I started out as a developer ~20y ago, so don't hate me), and -- like pretty much every other company in the industry -- we're under immense pressure from our board / major shareholders (and, through them, the C-level) to adopt agentic AI to the maximum possible extent. Nobody is saying it out loud here yet, but this is very obviously going to lead to what we euphemistically call "savings", i.e. significant reductions in headcount. The tools (we're mainly using Claude Code) are undeniably useful, but the downsides scare me.
Firstly, we're aiming for 100% of our code to be written by AI (the new mantra is "code is free"), with our more senior engineers acting as reviewers, not authors. This pretty much dooms our junior engineers. We're going to be kicking out the bottom two or three rungs of the career ladder for software developers. The obvious problem here is that senior engineers only exist because they were once junior engineers -- if the talent pipeline is destroyed, there won't be any senior engineers in future.
Secondly, as has been well-discussed in this thread, the costs of the tools are certain to increase (a lot); nobody seems to have any idea of what the upper bound on costs is likely to be, and everybody assumes that the economics will still somehow make sense. There is a lot of suspension of disbelief / magical thinking going on among the boards/C-level across the industry.
Thirdly, there is an implicit assumption that a company's profitability is directly related to the amount of code it can generate -- 10x more code must mean 10x more revenue, right? Right? This seems laughably naive -- many/most of the other processes involved in getting a product to market (marketing, legal/contracting, the sales process, etc) are far harder to accelerate with AI than raw code generation. Even if we could use AI to bring products to market 10x faster, the market isn't suddenly going to grow to accommodate all these exciting new products.
Very selfishly, I'm glad that I'm closer to the end of my career than to the start. I hope we'll find a way through this without leaving a huge number of people on the scrapheap, but I'm not optimistic.
That's not the scary part. The scary part is that their leadership clearly doesn't match your description despite it being their literal job.You know what's REALLY alarming? Your 20 years of experience are totally unnecessary to be able to raise those "concerns". They are blatantly obvious for anyone with half a brain who thinks through it for 5 minutes.
If you assume it was a scam from the beginning (bear with me)God forbid people realize the cost of the thing they've been using. This was, of course, inevitable.
All that VC / hyperscaler money that had been getting burnt on customer acquisition and experimenting was going to demand ROI at some point.. and that point is now.
2026: The year the AI bill came due.
It turns out you can't run a massive cash burn hopes-and-dreams machine forever without consequences. It's no small coincidence that that OpenAI / Anthropic are rushing to IPO before the costs catch up with everyone waking up to the real costs of the LLMs.
Are you equating charging a market price for services as enshitification?Is enshittification a good thing if it triggers a desloppification?
I’m sure the CCP is spending a fortune to make it trivially easy for lazy, ignorant people to generate reams of code they don’t understand and will never audit purely out of the goodness of their hearts.If China is smart, they will do their utmost best to keep providing open weights that approach frontline models as closely as possible. The ROI for them is all worth it x1000.
This bubble has plenty of pricks involved already.bubbles need a prick or two to make them pop … this is gonna be good!
Isn't Musk reputed to have a problem in that department?This bubble has plenty of pricks involved already.
… surely you forgot the /sHow can they go bye-bye? They are posted online, able to be downloaded and used by anyone, and once something is posted on the internet, free, it's not going away.
I disagree, there is a need, I no longer stand by those words, I need to make sure that people reading them know this with absolute certainty before they do.Still vandalism. No need to double-collapse your takes, geeze.
I hate when my prompts are "compacted" during long chats, specially during the planning phase. The first thing to get dropped as irrelevant is somehow, always, something critical or important to the design. Or some nuanced view on something that'll have severe impact downstream. Nope. Much easier to spend extra time rewriting the original prompt to incorporate elements from the first round of results and start anew.Inefficient tokenmaxxing design of the context window where it scans your entire chat every time you prompt it instead of simplifying to a smaller summary and using that context instead is driving up costs for the consumer.
There is still major engineering to optimize the LLM stack, and they want us burning even more with autonomous agents? Where is the money coming from for this?
Its bad enough when I'm simply wasting my time with a bad outcome in a vibe coded SQL query that doesnt run or gives me the wrong result and I have to QA for a while. If that blew through my budget for the month I'm not going to be satisfied to keep paying for this slop.
I hate when my prompts are "compacted" during long chats, specially during the planning phase. The first thing to get dropped as irrelevant is somehow, always, something critical or important to the design. Or some nuanced view on something that'll have severe impact downstream. Nope. Much easier to spend extra time rewriting the original prompt to incorporate elements from the first round of results and start anew.
I certainly cannot imagine a Microsoft product having a bug that causes the amount of money paid from you to Microsoft inside the product to be ten times what it would cost if paid via other channels. Microsoft is not capable of that level of {cunning; incompetence; malevolence; stupidity; fiendishness}. You must be imagining it.The new usage-based billing frustrations are being massively compounded by some rather egregious bugs on Microsoft's end. In VSCode, the "open in agents" feature auto-switches to specific models, even if you manually set it to something else. Additionally, I'm currently unable to even select a non-Claude model in the "chat" mode.
This is compounded by some apparent bugs in the "open in agents" (which appears to be Microsoft's attempt at a Claude Code) that cause token usage to absolute explode. Making queries in Copilot is currently consuming roughly 10x the number of tokens as the same request directly in Claude Code (both within VSCode). Runtime for Copilot also seems to have increased substantially. This could all just be an "open in agents" bug (and might have been there since its introduction - we just haven't noticed due to the billing being the way it was). I haven't really used the chat sidebar for a while, so it might have similar token usage to Claude Code.
Copilot is completely useless right now, but Claude Code using the same token pricing mechanism (ostensibly) works very well for me.
That's nothing really new though. Ed Zitron calls them business idiots (though I think he got that term from somewhere else). The business world is filled with them. Especially at larger companies where size and momentum can paper over a lot of incompetence.That's not the scary part. The scary part is that their leadership clearly doesn't match your description despite it being their literal job.
No I didn't. How can something that's posted for free, online, available to download for anyone, be going away? You do realize that local models are different then the free cloud-based models, right?… surely you forgot the /s
I think OpenAI's and Anthropic's IPOs will tell us more. For now, we need to believe the numbers they publish. But then they have to disclose them for real.I believe very strongly the bubble will pop when prices are raised and the quality doesn't increase. We're really close to that limit now. I just hope people will realize it popped after November.
Because I’m nearly sixty and have seen countless free stuff online ‘disappear’No I didn't. How can something that's posted for free, online, available to download for anyone, be going away? You do realize that local models are different then the free cloud-based models, right?
I recommend downloading GPT4All or Jan and just messing around with some models from Hugging Face. There's always new models out there, and there are literally thousands that can run on a modern mid-range consumer notebook (i.e. MacBook Pro M5)Ars, it'd be cool to see a good in-depth article that reviews and runs through the small language models people can run at home on their own hardware.![]()
A botched gender-confirming surgery left it deformed, which is why he couldn't get people to touch it even if he offered to buy them a horse. Also explains why his children are test-tube babies with names from their vials (X-AE88 or whatever).Isn't Musk reputed to have a problem in that department?
No we don't. See https://www.flyingpenguin.com/wheres-ed-anthropic-told-court-5-billion-but-public-19-billion/For now, we need to believe the numbers they publish
By no one sharing them online anymore?No I didn't. How can something that's posted for free, online, available to download for anyone, be going away?
Have you ever been to Hugging Face? You can find the original OSS-20b model at https://huggingface.co/openai/gpt-oss-20b . Because it's free and open source, people take the model, and create their own versions of it.Because I’m nearly sixty and have seen countless free stuff online ‘disappear’
Hah I'm still running a 2012 PC too. It's on its 2nd CPU and 3rd HDD. It's no longer my primary, but it's fine as a living room PC.Nitpicking maybe, but I'd put it a bit later, when multi-core 64 bit processors became really mainstream, maybe around 2008ish. Of course there's also the argument that SSDs becoming standard at virtually all levels was a real performance game changer, which would be around 2015ish. I've got 2012 era hardware which has been given a whole new lease of life by swapping the HDDs for SSDs. It really depends if you're focusing on specifics like CPU core speed or overall system performance.
Oh, I bet they will, especially for smaller places and some start-ups. Maybe migration to cheaper models will ensue, but the true market competitive nature of agentic programming services looks like it's about to get underway for real, now.And my employer is pushing for us to use AI to do more coding. I wonder if they’ll change their tune when the full bill comes due?