Apple and Amazon are two of the biggest players in home voice assistants (hardware and software). Both have struggled for years to develop a (arguable) compelling voice product. Each is taking a very different approach.You want to compare them to the worst "AI" assistant on the market, why??
Well, Apple are definitely being smarter burning less money on a money-losing industry, but at least Alexa has a bigger cash-fireFunny how Apple’s caution seems to be borne out by Amazon’s experience.
Personally, I’d rather have tiny, incremental, predictable improvements in on-device agents with abilities to do real-world things.
There’s always ChatGPT if you want to…chat.
Yes, I'm glad Ars at least told it straight in the article but if the entire media could start saying "blatantly false BS" instead of "hallucinations" that would be great. Would help this dumb bubble pop quicker hopefully too.I still find it funny they say "hallucinate" instead of "spews random bullshit".
Also, things like "hallucinations have been near zero" are meaningless. If there is a 0.3% chance that it spews bullshit, but does 10 million tasks a day, that means its gonna be making alot of mistakes every day.
THIS needs to be hammered into everyone's head again and again. The fundamentals on which it works, don't contain any "mistakes" that can be "fixed" or "improved" to specifically address that aspect. You gotta start from scratch, and whatever successes you had over the last 5 years, are possibly and probably not very predictive.Saying "we want to eliminate hallucinations from LLMs" is a meaningless statement. All LLMs do is "hallucinate". When whatever they make up has some overlap with reality we then attribute intelligence to them, and when it doesn't, we call it a "hallucination", instead of what it really is, which is working as designed.
“The reliability is the issue—getting it to be working close to 100 percent of the time,” the employee added.
For some time Siri has been unable to play regular old playlists - especially in CarPlay mode. Now it has 'improved' its behavior by playing near hits from Apple Music - which I don't subscribe to. Doesn't do this all of the time, sometimes it just plays any ol random song.I, for one, just want to know how much it's going to cost. I use my Alexa for time, date, weather, cooking measurements, reminders, and timers, and that's pretty much it. Alexa refuses to play my saved music collection, tiering to their paid services, which was something it used to do, but now been blocked and monetized, seriously leading me to seek FOSS solutions, but I haven't found one yet. All this Alexa AI stuff means is a new subscription, and i'm not paying.
It's funny watching LLMs get smarter and smarter while still being unable to do the only thing anyone wants.
I feel like we're going to wind up accidentally building God trying to get a dumb speaker to tell us who won the football game.
So you are saying that everything's a hammer (referrring to Abraham Maslow's quote)?THIS needs to be hammered into everyone's head again and again. The fundamentals on which it works, don't contain any "mistakes" that can be "fixed" or "improved" to specifically address that aspect. You gotta start from scratch, and whatever successes you had over the last 5 years, are possibly and probably not very predictive.
Prasad, the former chief architect of Alexa, said last month’s release of the company’s in-house Amazon Nova models—led by his AGI team—was in part motivated by the specific needs for optimum speed, cost, and reliability, in order to help AI applications such as Alexa “get to that last mile, which is really hard.”

We're almost ready, all we have to do is solve the #1 problem facing the industry, a problem that arises directly from the very foundations of the technological approach we are using.This includes solving the problem of “hallucinations” or fabricated answers, its response speed or “latency,” and reliability. “Hallucinations have to be close to zero,” said Prasad. “It’s still an open problem in the industry, but we are working extremely hard on it.”
I know there is negative sentiment of AI in Ars community, but I for one hope Alexa has AI natural language processing capability. At the moment Alexa is dumb to a point of unusable.
I cannot ask it to add an event to my calendar.
I cannot ask it to turn off my air conditioner unless I say a very specific and unnatural phrase such as turn the downstairs thermostat to off . It doesn't understand even words like HVAC.
I can't tell it say like automatically turn off my Echo show screen at 10pm at night. There is simply no understanding of my instructions.
It can be worse than that. Don't assume everyone's English is educated native quality. My German wife speaks very good English and has a large vocabulary, but still pronounces many words with a foreign accent. She is also prone to translating German literally, resulting in stuff like the old joke, "Throw Mama down the stairs her hat."
Saying "we want to eliminate hallucinations from LLMs" is a meaningless statement. All LLMs do is "hallucinate". When whatever they make up has some overlap with reality we then attribute intelligence to them, and when it doesn't, we call it a "hallucination", instead of what it really is, which is working as designed.
The problem both Google and Amazon have with these devices is that they want them to do things that the consumer is not interested in. I don't want a "proactive" assistant. I want a device that will wake up when I call it and do the basic shit that I want: Turn the lights on or off, turn the TV on or off, play music. Otherwise, just stay out of the way.
If regular Alexa couldn't generate any meaningful subscription revenue or sales commissions, does the team seriously expect GenAI Alexa (which is presumably far more expensive per query) to be different? It seems like Amazon is doubling down on a failed monetization strategy.
We got an Echo Dot for $5 on a punt circa 2018 because it was bundled with something or another. My pronunciation is about as straightforward and uninflected as southern England can produce and my history in publishing means that I chide myself if I even so much as split an infinitive. That's despite having spent now almost half my adult life in the US; people even ask when my natural-born-American child moved here because he's done such an excellent job of learning my British accent.The cynic in me expects these voice assistants to recognize Mid-Atlantic, BBC RP, NorCal/Left Coast, and some Indian English dialects perfectly, and categorize Appalachian and Tidewater as "non-English" simply because they've drifted less from 17th century English than the others.
tfw you realize even motherboards are chips which are networked together by soder, all systems are networked systems, none of them, so far, have been guaranteed to be accurate 100% of the time, whether it be pc's, healthcare, or space systems. not one.I expect the technology I buy to work 100% of the time. Not close to it. All the time*. I would be very unhappy if my spreadsheet software only gave me the right answers 99.999% of the time, if my e-reader showed me only 99.999% of the words in a book I'd bought.
More to the point, how do you even define "working right" for an LLM-based chatbot when it's whole existence is just making up plausible sounding replies?
(*Yes, I understand there are issues like networked services, where it's never going to be guaranteed 100%, but that's obviously not what they're referring to here)
Exactly. And with respect to a quote from the article--"The complexity comes from Alexa users expecting quick responses as well as extremely high levels of accuracy"--given that Alexa often struggles to complete those simple tasks, I have not ever expected a high level of accuracy, and do not see me ever relying on any automated "agent" to take unilateral action on my behalf.I think they have seriously overestimated how much people want that.
I have Amazon Echos in my home....the majority of use is home automation activation, weather, and timers. My wife and I also use it to communicate between our two home offices, which are on different floors.