Steven Rosenbaum explains how inaccurate quotes got into his book <em>The Future of Truth</em>.
See full article...
See full article...
Sure, the author maintains the final responsibility for whatever's in the book. He's the one that requested the AI help that led to the "synthetic quotes" being published, and didn't catch the error in what he got from the AI.AI put “synthetic quotes” in his book.
No. The author did that. The particular mechanism (I refuse to call LLMs "tools") that he outsourced his thinking to matters less than the fact that he failed to practice his craft correctly.
That's what really gets me about this. So many people are already saying they "need" this tool, and how important it is. Since when? This tech only JUST emerged on the scene! There's NO WAY that many people are ALREADY at a "can't live without it" stage, and NO ONE should be in a "that's just how it's always been" stage. All I can imagine is it really is just THAT addictive to have a constant work companion who tells you how amazing you are.I'm sorry, while i don't know anything about this guy on a reputational level, at this point I trust ZE-RO non-fiction authors who have the stupid / cahones to use AI in their research, either to synthesize concepts for purposes of minor rephrasing and inclusion, or for finding out "what _____ said about _____ " and then using those quotes without directly verifying.
This clown absolutely deserves egg on his face. We've got along just fine all these centuries without authors needing a brainless chatbot to "do the work" for them or "reduce the boring work," so there's no excuse. It's not like researching online without using chatbots is difficult or heavily time consuming vs the old ways of using libraries (books, microfiche, etc). Do the fucking work / don't take lazy shortcuts with AI and this won't happen. Pretty simple.
When I first got access to the Internet in the '90s (and then broadband a few years later), it quickly achieved "can't live without it" status for me. So I do think it's possible for new tech to feel almost instantly indispensable.That's what really gets me about this. So many people are already saying they "need" this tool, and how important it is. Since when? This tech only JUST emerged on the scene! There's NO WAY that many people are ALREADY at a "can't live without it" stage, and NO ONE should be in a "that's just how it's always been" stage.
The companies providing the tool share in the blame, because they are advertising and selling the tool as something it's not. I know I keep banging this drum, but MS recently added that "For entertainment purposes only" disclaimer. As I've said over and over, that's not normal for useful tools. It IS normal for "psychic friend networks".Sure, the author maintains the final responsibility for whatever's in the book. He's the one that requested the AI help that led to the "synthetic quotes" being published, and didn't catch the error in what he got from the AI.
All that said "His use of AI research tools led to him missing "synthetic quotes" that got inserted into his book" is a little long for a headline...
I think he has anthropomorphized his LLM writing buddies. He speaks about them as if they are partners who betrayed him....
More generally - I really don't get Mr Rosenbaum's attitude towards AI tools. The whole interview is a litany of ways they got things wrong and were a pain to deal with, but somehow they're still magical, and super powerful and he can't quit using them.
They don't even know what a "fact" is. They are statistical word pickers, incapable of knowing what they are saying. They can, 90% of the time, accurately put together words that relate to one another in ways that seem like human speech. The other 10%, the mask falls off and you see the cogs and sprockets underneath.Because verification turns into an incredibly hard ask. Carefully check every fact, every quote? Because some percentage of them will have slipped in there but they'll all look entirely plausible.
Maybe one day they'll work out to have them know the limits of their certain knowledge, then they might be useful.
Precisely.Hey, I've spent many, many, many hours doing mindless copy-and-paste drudgery (not to mention manual input because the source software had no means of putting information in a copy buffer), and it does suck, yes. But a lot of this stuff can be automated properly. If you want to be more efficient, learn how to (really) use a computer. Learn how to automate menial tasks with simple algorithms you can actually understand which will give predictable results. Bugs can still happen, of course, but you can get to something you have confidence in that you won't have to swear at, because you did it yourself only faster.
This is not a live-and-learn sort of thing. He should be mortified and aghast and should never want to touch ChatGPT for work ever again, I would think.
The Turing Test ends up mistaking exactly how easily fooled humans are in assuming something is actually intelligent. Including themselves.I think he has anthropomorphized his LLM writing buddies. He speaks about them as if they are partners who betrayed him.
I swear that passing the Turing Test - hyper-simplified to "make it sound exactly like a human" - is proving to be the worst aspiration for "AI". LLMs sound so much like an omni-helpful chat buddy that people forget (or never understood in the first place) what's really going on on the other side. I wonder if any research has looked into how people's perceptions of LLMs differ if their output was not conversational, but more computer-ish. (I don't know how to describe "computer-ish" sufficiently to support said research, but that's why we have professional researchers!)
"Hey Copilot, make your output more computer-ish. Thanks, buddy."
Why would anyone read anything this idiot has ever written?Speaking to Ars in the wake of the controversy, Rosenbaum says he “learned a lesson” and is “going to be much more suspicious” and “reticent to trust” AI outputs going forward.
I think he's a fucking lazy "author" who's more concerned about profit than his craft.I think he has anthropomorphized his LLM writing buddies. He speaks about them as if they are partners who betrayed him.
I swear that passing the Turing Test - hyper-simplified to "make it sound exactly like a human" - is proving to be the worst aspiration for "AI". LLMs sound so much like an omni-helpful chat buddy that people forget (or never understood in the first place) what's really going on on the other side. I wonder if any research has looked into how people's perceptions of LLMs differ if their output was not conversational, but more computer-ish. (I don't know how to describe "computer-ish" sufficiently to support said research, but that's why we have professional researchers!)
"Hey Copilot, make your output more computer-ish. Thanks, buddy."
So many people are already saying they "need" this tool, and how important it is. Since when? This tech only JUST emerged on the scene!
Probably it's difficult to grasp precisely because this technology is deceptive. It's fake-it-till-you-make-it in software form, and the digital charlatans are just as effective as the organic ones at reeling certain types of people in.I don't know how "neat, but not ready for primetime" is such a hard concept to grasp for some people.
To be clear, I am not convinced the tech will EVER be "ready for primetime". The underlying way in which it functions, as a statistical word picker with no concept of meaning, prevents it from ever reaching such a lofty goal. This isn't the general AI you're looking for. While it can be CALLED AI, in the same way Pacman ghost pathing can be called AI, I'm opting to stop using that word because to the general public, it's a loaded term, and companies like Microsoft and "Open"AI know it.I'm quite fond of playing around with generative AI, it's a lot of fun. But I would never use it in my work at this stage because the tech simply isn't there yet, and won't be for some time.
I don't know how "neat, but not ready for primetime" is such a hard concept to grasp for some people. These tools are advancing incredibly quickly, but still not nearly as quickly as they seem to think.
"Sharpening" and "cutting to pieces" are different things, one might observe."Rosenbaum made it clear that using AI was the relatively safe “bicycle” option in this analogy. I responded that the supercharged efficiency and catastrophic risk inherent in using AI made it feel a bit more like the motorcycle. Rosenbaum said “that might be fair” and thanked me for “sharpening” his analogy."
This alone tells me that Rosenbaum doesn't understand AI at all. Or analogies...
To be clear, I am not convinced the tech will EVER be "ready for primetime". The underlying way in which it functions, as a statistical word picker with no concept of meaning, prevents it from ever reaching such a lofty goal. This isn't the general AI you're looking for.
I was gonna say that but I feared the AI bros would downvote me to oblivion lol.Better idea... just don't use AI.
I think this is the crux of the matter. I absolutely fear these things. I don't fear them because they're smarter than me. They're not; they don't even map onto a smart-dumb spectrum. But I know that I am human and that I am subject to human failings. I am confident that I would grow to like and trust them, my plastic/silicon parasocial friends, no matter how strong I think my guardrails are.I’m reading this thinking “this man’s brain is cooked.” The tool is making stuff up, and making him look silly and stupid for putting them in a book, and he doesn’t want to go back to the way things were, when he was less likely to have made-up quotations in his work?
As designed, any use of these models beyond tasks suitable for the "research and report back" level of delegation really is only suitable for amusement purposes.To be clear, I am not convinced the tech will EVER be "ready for primetime". The underlying way in which it functions, as a statistical word picker with no concept of meaning, prevents it from ever reaching such a lofty goal. This isn't the general AI you're looking for. While it can be CALLED AI, in the same way Pacman ghost pathing can be called AI, I'm opting to stop using that word because to the general public, it's a loaded term, and companies like Microsoft and "Open"AI know it.
That's the job. If you don't want it, then find a different one. Or maybe use a more reliable process.Because verification turns into an incredibly hard ask. Carefully check every fact, every quote?
This phrasing suggests that AI¹ has any power to be controlled at all.It’s also magical in another way: Like J.R.R. Tolkien’s One Ring, AI convinces many of those who use it that they can control its power properly. But can they?
Fact-Checking Duck says:All that said "His use of AI research tools led to him missing "synthetic quotes" that got inserted into his book" is a little long for a headline...
99 characters and spaces.His use of AI research tools led to him missing "synthetic quotes" that got inserted into his book.
No, the verification workflow is exactly the same: CHECK EVERYTHING BEFORE PUBLISHING, FFS. "Measure twice; cut once" NOT "Move fast and break things."Kyle Orland said:Rosenbaum, for his part, agreed that “publishers are going to need new verification workflows designed specifically for AI-era research.
^scattered emphasis mineAt the time, he found AI answers “spectacularly useful” about 8 out of 10 times, with the remainder being confabulations that were “just not true.”
Despite these errors, he kept using the tools in his life and work. When we talked on Tuesday, Rosenbaum said he had recently asked an AI tool to extract his “no changes, verbatim” speaker’s notes out of a slide deck so he could use them for an upcoming presentation. He was about to print those extracted notes when he realized that the LLM had actually rewritten his words despite his “very clear instructions for the robot.”
“And I say to it, ‘Did you rewrite the words?’ And it says, ‘Well, I just made the language a little stronger.’ Well, pardon me, but like, fuck you!” he said.
Even in the face of these kinds of profanity-inducing errors, though, Rosenbaum still believes that AI tools are too efficient not to use.
“The deck was 100 pages,” Rosenbaum said. “To cut and paste page by page, the text from each page would have been an hour’s worth of work, of mindless cutting and pasting. ChatGPT did it in about four seconds.”
To which the obvious retort might be: Yes, it was fast.
But it was also wrong.
The very least I would expect is for the author to instruct the LLM to provide a link for every sourced piece of information. Then I expect the author to follow each link to verify the source. That's the minimum.Because verification turns into an incredibly hard ask. Carefully check every fact, every quote? Because some percentage of them will have slipped in there but they'll all look entirely plausible.
Maybe one day they'll work out to have them know the limits of their certain knowledge, then they might be useful.
As the author of 35 novels in about 50 years of writing, you, sir, are no writer if you let a machine create your words for you.“As a writer, AI is often a delightful writing companion,” Rosenbaum told me. “When I say ‘writing companion,’ I don’t use that lightly. It’s strangely creative and crafty and unusual in all these ways… and then it betrays you in ways that are just really quite horrible.”
Ironically I agree with Qwertilot that you can't double-check every single thing an LLM does. It doesn't scale, because the whole point is that it can do X faster than a person can do X. This, however, just leads to the inescapable conclusion that you can't use LLMs for anything, because their output is practically unverifiable, and you can't start from the assumption that errors are mere accidents, because they are not people.That's the job. If you don't want it, then find a different one. Or maybe use a more reliable process.
Comment (or string) of the week.That's why you talk "at" an AI. Not with. Don't fucking anthropomorphize that thing.
Recently, I prompted ChatGPT 5 mini, "Without querying any outside or online resources, retell the story of the hit film [REDACTED]."I can't fathom running across so many examples of these tools fucking things up and still being like "yeah but it's faaast." It's just.... man, I don't even know. Like the "extract this quote verbatim" and the tool STILL doesn't do it, that really should be some kind of sign.
Rather amazingly, Rosenbaum is not interested in going back to the AI-free research process he used to write previous books.
And that's not the right use of "reticent". eyerollSpeaking to Ars in the wake of the controversy, Rosenbaum says he “learned a lesson” and is “going to be much more suspicious” and “reticent to trust” AI outputs going forward.
I'm not even sure it's good in that extremely limited sense. Since it doesn't understand your intent, it's ability to "translate" an order is suspect, and it's already caused some catastrophic failures at enterprise level by doing the absolute WORST thing.Oh, I'm right with you there. I think LLM's show promise as a narrow UI layer, translating natural language into computer instructions which would then be passed off to some entirely different sort of architecture for processing. If you use them for the processing itself You're Going To Have A Bad Time, which a lot of people are learning the hard way.