Gemma 4 12B uses a new encoding scheme and token prediction to punch above its weight.
See full article...
See full article...
You'd need a somewhat modern graphics card as well, and a 2014 laptop won't cut it (probably).Nah, I'll pass. My laptop is from 2014 and 16GB is all the RAM in it.
And despite the headline specifying "laptop", I still don't wanna dedicate half of my desktop's RAM to it either. It's not as if I can just go out and buy two more sticks to fill the empty slots without remortgaging the house...
Until recently, Apple was selling laptops that had a baseline of 16GB of RAM. I picked up an M5 MacBook Pro for sale that had 24GB of RAM.Pretty average consumer laptops have 16GB RAM? I can't imagine what the average Ars reader must have. o_o
I would have tried it for this comment but I haven't 18GB space free because I'm recording tutorials! NOT because I'm THAT bad at managing drive space! Really! It's true!
I wonder how the OpenAI and Anthropic IPOs will go once Wall Street learns about the onward march of local/open-weight LLMs?Google says Gemma 4 12B is unique in that it can run on many consumer laptops without sacrificing quality. As long as you’ve got a computer with 16GB of system RAM or VRAM, the 12-billion-parameter model will work.
Same as Microsoft never seriously going after Windows pirates - you want to be the default choice for the technology and to make money from companies that hire people already familiar with your tools. And of course you lock the really nice features behind the enterprise subscriptions.So, just what does Google get for releasing these models that run locally? The effort, the development time, the maintenance for an LLM that runs on someone's system, sends no data to Google, no fingerprinting, no means to target advertisements even closer, generates no revenue?
I don't know about that. They see AI becoming a commodity, running whatever model anywhere, and they're staying pretty neutral about what runs on their stuff. People build on them, they make money. Maybe they make a little more on their own stuff (e.g., Gemini), but possibly they make more long-term not trying to corner the market.I mean, that's a great question. Google is a huge company, and I think they are making the same mistake as OpenAI made. They are releasing their models under the Apache 3.0 licensing standards because they have a bunch of academia working for them, and they are used to sharing information. Google is so large that I don't think the right hand knows what the left hand is doing.
Developer mindshare - it doesn't want devs to use Qwen or LLama to run models locally - it wants its own models to dominate in that space as well. It also helps them recruit researchers and attract talent.So, just what does Google get for releasing these models that run locally? The effort, the development time, the maintenance for an LLM that runs on someone's system, sends no data to Google, no fingerprinting, no means to target advertisements even closer, generates no revenue?
Microsoft also doesn't go after cross-region which is really nice. You can buy legitimate OEM windows license stickers off alibaba for $10 each.Same as Microsoft never seriously going after Windows pirates - you want to be the default choice for the technology and to make money from companies that hire people already familiar with your tools. And of course you lock the really nice features behind the enterprise subscriptions.
My guess would be to get the association with "mobility" firmly planted in people's heads for these models. Today it's the laptop, tomorrow it's the smartwatch, Android phone, and other wearables.Okay, I'm putting on my stupid face and asking, why specifically "laptop"?
Seems to me it SHOULD run on a desktop, too, since most desktops have better hardware than most laptops, yes? What am I missing here?
For the other models you need an array of used 3090s or unified memory. This comes out to $1,000-$2,000 of additional expense you would otherwise not incur. "Free" models aren't free to most consumers due to additional hardware requirements, but this one is.My guess would be to get the association with "mobility" firmly planted in people's heads for these models. Today it's the laptop, tomorrow it's the smartwatch, Android phone, and other wearables.
But as I understand it, these are locals models, not remote models. How would Google make money off the local models?I don't know about that. They see AI becoming a commodity, running whatever model anywhere, and they're staying pretty neutral about what runs on their stuff. People build on them, they make money. Maybe they make a little more on their own stuff (e.g., Gemini), but possibly they make more long-term not trying to corner the market.
Probably pretty convoluted, but in theory increasing use of openrouter-style systems would increase demand for their TPUs, which leads to increased revenue from hardware sales.But as I understand it, these are locals models, not remote models. How would Google make money off the local models?
Loading and unloading models is pretty quick, unless you're using it constantly you'll be fine. Certainly better than burning money on cloud LLMs.Nah, I'll pass. My laptop is from 2014 and 16GB is all the RAM in it.
And despite the headline specifying "laptop", I still don't wanna dedicate half of my desktop's RAM to it either. It's not as if I can just go out and buy two more sticks to fill the empty slots without remortgaging the house...
Amazon got screwed on ridiculous cloud computing expense for Alexa devices just a few years ago.My guess would be to get the association with "mobility" firmly planted in people's heads for these models. Today it's the laptop, tomorrow it's the smartwatch, Android phone, and other wearables.
Same thing that investing in Chrome bought them. They want to control the space so that a decade later they can extract money when people are too deeply invested to leave.So, just what does Google get for releasing these models that run locally? The effort, the development time, the maintenance for an LLM that runs on someone's system, sends no data to Google, no fingerprinting, no means to target advertisements even closer, generates no revenue?
There are plenty of people that are running models essentially locally on virtualized platforms. This keeps the processing local for sensitive workloads where even contractual promises that an AI company won't use it for training aren't enough. At my company, especially with usage-based pricing becoming so popular among AI companies, they're looking at it as a way to control costs for certain RAG and MCP tasks, and to potentially to limit costs around agent use.But as I understand it, these are locals models, not remote models. How would Google make money off the local models?
It was Firefox that unseated the IE monopoly. Chrome had nothing to do with that.Same thing that investing in Chrome bought them. They want to control the space so that a decade later they can extract money when people are too deeply invested to leave.
With Chrome and the web they built a browser that unseated IE's monopoly and now it's the #1 browser in the world. It's the lens by which most people interact with the web. Now they are abusing that dominance to support their actual business which is selling user data, ostensibly for targeted advertisements but realistically for any willing buyer.
ETA: Tbc, we're in the honeymoon phase so I'm not saying there's any risk to using this model locally.
Edge is chromium, so Chromium effectively has a monopoly again.It was Firefox that unseated the IE monopoly. Chrome had nothing to do with that.
Without Firefox there may have never been a Chrome. Google used their monopoly power to gain what is dangerously close to being a monopoly again.
I didn't see anything in the article that stated that.You'd need a somewhat modern graphics card as well, and a 2014 laptop won't cut it (probably).
Do they still do that thing where they throw in a crappy mouse because you can’t sell an OEM license unless it comes with hardware?Microsoft also doesn't go after cross-region which is really nice. You can buy legitimate OEM windows license stickers off alibaba for $10 each.
The Benchmarks here indeed put it between E4B and 26B-A4B: https://huggingface.co/google/gemma-4-12B#benchmark-resultsIf this model could fill the gap between the E4B model and the larger 26B MOE, that would be awesome (E4B was interesting for me, but wasn’t that useful, whereas 26B handles 95% of my AI use right now).
Nope I get sheets of physical stickers.Do they still do that thing where they throw in a crappy mouse because you can’t sell an OEM license unless it comes with hardware?
Good thing I don't do that either!Loading and unloading models is pretty quick, unless you're using it constantly you'll be fine. Certainly better than burning money on cloud LLMs.
Which other models are you referring to specifically?For the other models you need an array of used 3090s or unified memory. This comes out to $1,000-$2,000 of additional expense you would otherwise not incur. "Free" models aren't free to most consumers due to additional hardware requirements, but this one is.
If you don't use LLMs why are you complaining about the inability to use this LLM on your hardware?Good thing I don't do that either!
I sometimes wonder what life is like in the universe where Microsoft chose Gecko instead of Blink. Because this universe has been on a pretty bad trajectory since Edgium was released in 2019.Edge is chromium, so Chromium effectively has a monopoly again.