Editor’s Note: Retraction of article containing fabricated quotations

Status
Not open for further replies.
Maybe I’m naive, but I feel a joint byline is like jointly signing a mortgage. Both are willingly responsible, and if things go sideways, it’s on both of parties. We give a lot of responsibility to journalists in exchange for higher expectations.
You know how sometimes one person on a mortgage ends up suing the other person on a mortgage? That's an outcome of trust being violated. Taking joint responsibility for the mortgage means if the other person bails on you, yes you are still responsible for making payments but they can and should be held culpable for the financial and reputational damage they have done to you by violating your trust.

All a shared byline means is "I did some work on this and so did they." The byline doesn't say it, but if one person pitches the story and provides background information to the other person then they are more responsible. It's like if two people jointly sign the mortgage paperwork but the one with the higher credit score also co-signs as a guarantor.
 
Upvote
19 (20 / -1)

Wheels Of Confusion

Ars Legatus Legionis
75,398
Subscriptor
Ironically, this latest article by Benj, the comments related to it and the eventual resolution will teach us more about the actual current state of AIs in publishing then the last 100 articles that fawned over AI (look, Conan next to the boob tube - neat!).

This episode should be a learning experience. End result should be improvement.
I wanted to highlight this part of your post because I agree with it 100%. We can't overlook this as an opportunity. Not just for a single Ars writer, not just for Ars Technica itself, but it needs to be examined and lessons drawn for all of journalism at the very least.
Generative AI is corrosive to good journalism, full stop. WE NEED TO EXAMINE TIMES WHERE IT GOES WRONG, INCLUDING (ESPECIALLY) THIS ONE.

The analogy to the One Ring someone made earlier is apt. Middle Earth was not just full of wannabe gods who thought the One Ring was their chance to rule everything. Everyone else was tempted to use the ring for good purposes. But ultimately it's not a thing that can do good; it can only ever corrupt and ruin. It wasn't just a flaw baked into the ring itself, but also a flaw in the hearts of all the people around it. They saw a tool and imagined how they could use it to achieve things, even with good intentions, while inevitably the ring would betray them. We even have real-life examples of people becoming deranged and obsessed with their LLMs the way Gollum was obsessed by the active influence of the ring. To the point of suicide, even! This isn't some theoretical faff.

Ultimately the only answer was not to have it and not to use it. That's the only responsible thing to do with the tool; unmake it, remove it from the world so that its influence is gone.
In practice we may never put the genie entirely back into the bottle, but we can make sure it's known that using LLMs and other generative AI is seen as completely invalidating and disreputable.


Which is the author's job, and which they failed to do.
Based on personal experience, I can imagine a "best case" scenario like this:
  • work while sick (bad idea)
  • can't focus eyes; try out new verbatim quote tool (not a policy violation in itself)
  • tool doesn't work; rabbit hole--why not? Ask Chat GPT.
  • fever related confusion on which window is which; pull what I thought were parts of actual blog, not ChatGPT. (Know better than to use ChatGPT for actual writing, but confused.)
No clue whether this is even closer to true, but because I can see it happening (and it's consistent with my mini-mental-model of the writer), I need to wait for the postmortem from the people that know. Ars has enough trust in the bank with me that I believe that will come.
He literally said, “… I decided to try an experimental Claude Code-based AI to help me extract relevant verbatim source material.”
He doesn’t seem to understand that LLMs are so unreliable that expecting them to provide “verbatim” quotes is incredibly stupid. He may as well have been asking for fabricated quotes, because that’s what LLMs do.
Obviously there might be cases where the usage of AI tools can be very useful for a journalist (same as it can be used as a tool for developers) as long as you always verify the results.
I also call nonsense on “being sick made me seek out a new tool and use it uncritically.” People rarely seek out new and inventive complications to their daily workflow when they’re feverish. They just do the same things they always do, but with more mistakes (mistakes such as getting caught inventing quotes).
Benj's literal job at Ars has been to familiarize himself with the blossoming AI trend and to investigate the technologies, companies, and products flooding the Web and meatspace alike.
Him having and attempting to use a new citation tool is 100% something that's explicable as part of his job covering the AI beat. I expect he was early to find the citation-maker and was going to investigate it for a write-up, the same way he reviews new versions of those image gens. Hell we have Lee Hutchinson writing an article about his experience vibe-coding. Actually using the tools is what's necessary to separate out original content from churnalism mills that merely rehash press releases or round up Tweets and call it reporting. How many times have Arsians complained that some gadget roundup didn't include Ars writers testing the widget in question? We have to be consistent in expecting hands-on experience with the products and services Ars covers or they're no better than those innumerable slop sites that just aggregate Amazon reviews and call it a "guide."

Things started going wrong, in his explanation, when the tool failed to run.

I can understand bleary-eyed mistakes along the lines of copying and pasting the wrong version of a document, or not catching the parts of a piece of text that were supposed to be taken out in the next editing pass before submitting it. Multitasking means having a lot of things to juggle; do it enough and eventually something hits the ground. Working while sick makes that nearly inevitable. Things like that have definitely happened to me; lesser versions happen to me almost daily just leaving all these comments at Ars.


I understand the desire to not continue distributing bad information. However, when the retraction posts simply to a page that is essentially deleted, it's not at all useful for the reader to understand what is happening.
Specifically:
  • Memory-holing the original article
  • Effectively deleting said article’s comments
  • Not referencing the original article in the editor’s statement (i.e., in this article)
  • The locking of comments in the Ars forum post about this…incident
I expect a higher standard of: transparency; responsiveness; editorial oversight (in the article production process); and, in general, integrity from Ars.
The comment locks I expect and think are good practice. In the absence of information, speculation turns to toxic rumor-mongering and assuming the worst.

Only so long as there is still an actual investigation and process playing out, though. If it stops here, it's just Mushroom Management: feed them shit and keep them in the dark. And I've seen Ars do that before, after breaching readers' trust.

It's poisonous to good faith; secrecy means you don't trust your readership with the truth. You don't want them to make informed decisions. You either believe there was wrong-doing so severe that it would be deleterious if public knowledge, or else you believe your readership is too flighty and reactionary to make the "the right call" with the information.

And what I have less tolerance for is memory holing the community's role in uncovering this breach of journalistic integrity. If Ars is really and truly proud of the community it's built, the fact that commenters brought Scott's accusation to management's attention needs to be widely known. The reactions of the community in that thread were remarkably level-headed and rational and this deserves to be recognized by Ars itself.

Throwing the original discovery and our participation into digital oblivion is throwing away some of that regard for the community. It reduces the response from that of a proud community shepherd to that of a mere corporate golem, reflexively covering its own ass when distasteful things happen to shield itself from accountability. And it shows that everything readers feel is important enough to share with the site, including with the writers and editors, can be cast aside for convenience.

So I'm willing to give Ars some space only so long as I can believe they're working to do the right thing, and that the job isn't done yet. We'll have to see if this incident plays out differently. If this "notice" ends up being the final word, then I'll have to make hard choices.
 
Last edited:
Upvote
78 (84 / -6)

Snark218

Ars Legatus Legionis
36,436
Subscriptor
Okay, I've been staying out of it but your last paragraph made me think about how that describes exactly how I managed teams too. But where you lose me is that there's a big difference between doing a dumb and blatantly subverting stated policies and procedures.

Forget all the bad car analogies, let's look at this like it was code. Effectively what happened is a developer asked an LLM to write a new function for them on their local machine and then they pushed it directly to Production without running it through Dev, Staging, or QA review. That kind of thing is more than an oopsie.

I say that as someone who once crashed a big ecommerce site by pushing out a small change that required rebuilding the cache of every single URL during the peak load for the year. It was my error and I owned it, but I didn't make that change during a code freeze period without first making my case for why it was necessary and getting approval from everyone else responsible for site stability. It was my fuck up even though in the end it turned out the documentation we all relied on was incorrect and that's why the site crashed. I was forgiven but if I'd just shoved that out all on my own without following procedures, looping in others, and getting the change approved then I damn well should have been fired.


(For those keeping score, yes I actually have been both a professional writer and a manager of front-end web dev. And other things too. "Specialization is for insects" pretty much defines my career.)
I think you're drawing the critical distinction here. I manage people and I have overseen contractors. An error of omission is an "oh fuck, I ran out of time to submit that deliverable/didn't get that email/forgot to call into that meeting because I was in the zone." I have a lot of tolerance for errors like that, because if there's too much work or too few people or not enough time or all of those together, yeah, shit falls through cracks or has to get triaged. Mistakes happen. Work on improving it, but I'm not going to rip anybody a new asshole unless it becomes a trend.

An error of commission, on the other hand, would be "I didn't notify the regulators of a reportable incident because I thought we could clean it up quick and avoid the hassle," or "I falsified four years worth of inspections because fuck it, that's why" or "I manipulated the data because otherwise it contradicted my pet theory." And that's a lot less forgivable, because it's a conscious, active choice to take an action to subvert or ignore a policy, reg, or requirement that one knew about and then disregarded. That requires a different sort of response.
 
Upvote
57 (57 / 0)

Snark218

Ars Legatus Legionis
36,436
Subscriptor
So I'm willing to give Ars some space only so long as I can believe they're working to do the right thing, and that the job isn't done yet. We'll have to see if this incident plays out differently. If this "notice" ends up being the final word, then I'll have to make hard choices.
Not that your whole quote wasn't relevant and important, but I cut it down to say, yeah. This. Same. This cannot be the end of the conversation. They don't need to shitcan Benj to make a point, either, but I do expect a "this is why this failure happened, this is what we're doing to avoid it, we apologize."
 
Upvote
52 (52 / 0)

Jim Salter

Ars Legatus Legionis
17,133
Subscriptor++
Out of curiosity, do you see a difference between an excuse and an explanation? Considering that Benji in his posts did accept his responsibility for the situation?
Not being snarky, answering honestly: an excuse is something that makes an action okay, once you understand the excuse. An explanation gives you the why, but that why isn't sufficient to excuse the bad action taken.

For example, "I was drunk when I wrecked my car" is an explanation, but it's not an excuse.

"I had COVID brain fog when I used ChatGPT as a primary source" is, similarly, an explanation--but not an acceptable excuse. Particularly not from a reporter who we're supposed to be trusting to objectively analyze the technology in the first place. That analysis clearly wasn't objective enough, and here we are.

I do have sympathy for Benj feeling enormous pressure to get that piece out, and do it that day. I've got personal experience of that pressure, and it's very real. That's also the Writers' Guild's job to address, and it's still not sufficient mitigation to excuse pumping ChatGPT slop into an article.

Again, I wish Benj well, I don't think this is or even necessarily should be a career ending mistake. But it's definitely not the kind of mistake you get no serious consequences from. And on Ars' side of the equation, there has to be a realization of what message the readers and subscribers AND authors take from this only getting a slap on the wrist.

There's no getting out of this without sending a message. Another commenter earlier pointedly said they would unsubscribe if Benj doesn't keep his job... Which should just make clear that fence sitting isn't going to work. Ars needs to decide what message it's going to send, and then send that message clearly.

And I sincerely hope that clear, unambiguous message is "this is absolutely not acceptable behavior, best of luck at your next job." I'm perfectly fine with the old "we'd like your resignation letter by $date" dodge.

I say none of this out of a spirit of vindictiveness. I say it because this site and this community is important to me.

It's been important to me for more than half my life now, and this is a watershed moment that will unavoidably shape what Ars is. And I do not want it to be the kind of place where you're wondering just how much slop went into an article, and you have to wonder that, because you've seen management tolerate it even when it's this dead-to-rights obvious.
 
Upvote
99 (104 / -5)

graylshaped

Ars Legatus Legionis
67,695
Subscriptor++
Fair point ;) but do online news publications really delete articles after publication, like the NYT, WashPost ?
There sites are an alternate way to access their traditional newspapers, though. Ars is not in that camp. I'm not saying one way or the other is better. I think the EU's right to be forgotten law is just dumb. both practically and ethically. I

can make an argument both ways here. I can argue for the journalistic record to be maintained intact as a record; I can also argue that when a company is aware it published something false a reasonable cure is to remove the falsehood.
 
Upvote
-2 (6 / -8)

graylshaped

Ars Legatus Legionis
67,695
Subscriptor++
Not being snarky, answering honestly: an excuse is something that makes an action okay, once you understand the excuse. An explanation gives you the why, but that why isn't sufficient to excuse the bad action taken.

For example, "I was drunk when I wrecked my car" is an explanation, but it's not an excuse.

"I had COVID brain fog when I used ChatGPT as a primary source" is, similarly, an explanation--but not an acceptable excuse. Particularly not from a reporter who we're supposed to be trusting to objectively analyze the technology in the first place. That analysis clearly wasn't objective enough, and here we are.

I do have sympathy for Benj feeling enormous pressure to get that piece out, and do it that day. I've got personal experience of that pressure, and it's very real. That's also the Writers' Guild's job to address, and it's still not sufficient mitigation to excuse pumping ChatGPT slop into an article.

Again, I wish Benj well, I don't think this is or even necessarily should be a career ending mistake. But it's definitely not the kind of mistake you get no serious consequences from. And on Ars' side of the equation, there has to be a realization of what message the readers and subscribers AND authors take from this only getting a slap on the wrist.

There's no getting out of this without sending a message. Another commenter earlier pointedly said they would unsubscribe if Benj doesn't keep his job... Which should just make clear that fence sitting isn't going to work. Ars needs to decide what message it's going to send, and then send that message clearly.

And I sincerely hope that clear, unambiguous message is "this is absolutely not acceptable behavior, best of luck at your next job." I'm perfectly fine with the old "we'd like your resignation letter by $date" dodge.

I say none of this out of a spirit of vindictiveness. I say it because this site and this community is important to me.

It's been important to me for more than half my life now, and this is a watershed moment that will unavoidably shape what Ars is. And I do not want it to be the kind of place where you're wondering just how much slop went into an article, and you have to wonder that, because you've seen management tolerate it even when it's this dead-to-rights obvious.
May I suggest a qualifier? An excuse is something that makes something seem okay, but may or may not represent reasonable justification for the choice made.
 
Upvote
8 (8 / 0)
Do Ars even have editors like that for their long term/senior writers? Serious question.
I don't know the answer. But speaking as a professional editor, I can say that verifying sources is something we really don't want to do or have time to do. We really need to depend on our writers to provide us with good sources, correctly quoted and properly cited.

Of course, "verifying sources" can be more, or less, rigorous; and sometimes rigorous verification of sources, especially in book manuscripts citing other books, isn't realistically possible given time and budget constraints.

However, if sources are to be verified by an editor, then the kind of editor normally tasked with such verification would be a copyeditor. (And most copyeditors I know are already overworked.)

Edit: spelling
 
Last edited:
Upvote
59 (60 / -1)
He literally said, “… I decided to try an experimental Claude Code-based AI to help me extract relevant verbatim source material.”

He doesn’t seem to understand that LLMs are so unreliable that expecting them to provide “verbatim” quotes is incredibly stupid. He may as well have been asking for fabricated quotes, because that’s what LLMs do.

And this coming from a supposed “senior” AI reporter. He has a recorded interview where he talks about using AI chat bots to help him write (or rather, assemble) articles. Nobody should trust anything he assembles from the slop trough of AI output.
It wouldn't surprise me in the least if that experimental tool turns out to be an internal tool being pushed by Condé Nast.
 
Upvote
-17 (8 / -25)

DarthSlack

Ars Legatus Legionis
23,059
Subscriptor++
Forget all the bad car analogies, let's look at this like it was code. Effectively what happened is a developer asked an LLM to write a new function for them on their local machine and then they pushed it directly to Production without running it through Dev, Staging, or QA review. That kind of thing is more than an oopsie.

The problem with this analogy is that apparently the system in place allowed the code to go straight to production without anyone in Dev, Staging or QA review it. Yeah, the bad code is a problem, but so is the system that allowed it to production without more than a cursory glance.

I don't know about you, but if my QA lead found me running a process that allowed straight to production, I'd be smothered in honey and staked out on an anthill.
 
Upvote
33 (33 / 0)

IPunchCholla

Ars Scholae Palatinae
867
I just want to add my voice asking for an explanation of how editing works at Ars. While it's not been a ton of times and not too recently, every time I have been verbally interviewed by a news outlet, an editor has called to verify that the quotes were accurate. That seems like a good policy to ensure this type of thing regardless of LLM involvement does not happen. I just assumed that something similar was happening at Ars. It would be nice to know what the current process is, and what steps are being considered to rectify this in the future.
 
Upvote
36 (36 / 0)

Snark218

Ars Legatus Legionis
36,436
Subscriptor
Not being snarky, answering honestly: an excuse is something that makes an action okay, once you understand the excuse. An explanation gives you the why, but that why isn't sufficient to excuse the bad action taken.

For example, "I was drunk when I wrecked my car" is an explanation, but it's not an excuse.

"I had COVID brain fog when I used ChatGPT as a primary source" is, similarly, an explanation--but not an acceptable excuse. Particularly not from a reporter who we're supposed to be trusting to objectively analyze the technology in the first place. That analysis clearly wasn't objective enough, and here we are.

I do have sympathy for Benj feeling enormous pressure to get that piece out, and do it that day. I've got personal experience of that pressure, and it's very real. That's also the Writers' Guild's job to address, and it's still not sufficient mitigation to excuse pumping ChatGPT slop into an article.
I made a similar mistake once. I sent a deliverable directly to a client without internal review, and there were problems, including using the wrong version of a letterhead. The explanation was, I was under a tight deadline and rushed through and didn't think about it and assumed it was ready to roll, because missing the deadline was more important than the internal review (it was not). The excuse was, I was a brand new dad and was on a week of about two hours of sleep a night and I was, literally, not in my right mind. I sent written apologies to the client and immediately went on a week of unpaid leave to get rested and support my wife so we could get to a place where the kid wasn't waking up 8-10 times a night screaming like a circular saw cutting sheet metal. Thank fuck it was a sufficient excuse.

I would accept "sick as a dog and irrationally target-locked on a story going live by COB" as an excuse. But there needs to be some kind of control to prevent that kind of thinking from dictating the quality and reliability of reportage here.
Again, I wish Benj well, I don't think this is or even necessarily should be a career ending mistake. But it's definitely not the kind of mistake you get no serious consequences from. And on Ars' side of the equation, there has to be a realization of what message the readers and subscribers AND authors take from this only getting a slap on the wrist.

There's no getting out of this without sending a message. Another commenter earlier pointedly said they would unsubscribe if Benj doesn't keep his job... Which should just make clear that fence sitting isn't going to work. Ars needs to decide what message it's going to send, and then send that message clearly.

And I sincerely hope that clear, unambiguous message is "this is absolutely not acceptable behavior, best of luck at your next job." I'm perfectly fine with the old "we'd like your resignation letter by $date" dodge.
I don't even necessarily request that. I would not be disappointed if that was the outcome, mind you. It would be reasonable and appropriate. But even if he stays, and the message is something specific about stricter editorial oversight and more reach back support for writers if they can't make a deadline, I'm okay with that. Personally, that is. But it can't just be a retraction and a "sorry, our bad."
I say none of this out of a spirit of vindictiveness. I say it because this site and this community is important to me.

It's been important to me for more than half my life now, and this is a watershed moment that will unavoidably shape what Ars is. And I do not want it to be the kind of place where you're wondering just how much slop went into an article, and you have to wonder that, because you've seen management tolerate it even when it's this dead-to-rights obvious.
I cannot possibly say any of this better than you just did, and I cosign it. Because, same. This site and this community is important to me. But it will become less so, much less so, if this is implicitly tolerable to its editors.
 
Last edited:
Upvote
49 (49 / 0)
May I suggest a qualifier? An excuse is something that makes something seem okay, but may or may not represent reasonable justification for the choice made.
I might be spliitting semantic hairs but I think there's a big difference between making an excuse and having an excuse. We're in a linguistic period where the word "excuse" is more often used in the former sense of trying to deflect blame, but we do still use it in the latter sense of agreeing there are mitigating circumstances.
 
Last edited:
Upvote
22 (22 / 0)

Jim Salter

Ars Legatus Legionis
17,133
Subscriptor++
That would be bad. I still don't think that's a fair analogy.

To stay with the car analogy: he made a huge scratch on the front door.
Fine, you want to stay with the car analogies?

Well, there's a cubic yard of lumpy Bondo under the $200 paint job on that door. You don't even need a magnet to figure out how bad that "scratch" really was, and if you've bought a few used cars in the past, you're extremely wary of frame damage camouflaged by that extremely superficial cosmetic "fix."

This isn't my first time at a dealership.
 
Upvote
19 (22 / -3)
I don't know the answer. But speaking as a professional editor, I can say that verifying sources is something we really don't want to do or have time to do. We really need to depend on our writers to provide us with good sources, correctly quoted and properly cited.

Of course, "verifying sources" can be more, or less, rigorous; and sometimes rigorous verification of sources, especially in book manuscripts citing other books, isn't realistically possible given time and budget constraints.

However, if sources are to be verified by an editor, then the kind of editor normally tasked with such verification would be a copyeditor. (And most copyeditors I know are already overworked.)

Edit: spelling
I think a lot of people don't understand how much of an outlier The New Yorker's editorial process is. I consider myself lucky that I ever had a job where what I wrote was looked over by a copy editor. A fact checker would have been a total fantasy.
 
Upvote
49 (49 / 0)

graylshaped

Ars Legatus Legionis
67,695
Subscriptor++
I might be submitting semantic hairs but I think there's a big difference between making an excuse and having an excuse. We're in a linguistic period where the word "excuse" is more often used in the former sense of trying to deflect blame, but we do still use it in the latter sense of agreeing there are mitigating circumstances.
Yes, I think that is an meaningful difference, and the line I was trying to suggest exists.
 
Upvote
11 (11 / 0)
The problem with this analogy is that apparently the system in place allowed the code to go straight to production without anyone in Dev, Staging or QA review it. Yeah, the bad code is a problem, but so is the system that allowed it to production without more than a cursory glance.

I don't know about you, but if my QA lead found me running a process that allowed straight to production, I'd be smothered in honey and staked out on an anthill.
Every place where I had the power to push to prod I could have easily done it without going through any intermediary steps. Not the best practice, but this was consistent from small non-profits to one of the largest market cap companies in the world.
 
Upvote
19 (19 / 0)

belrick

Wise, Aged Ars Veteran
124
Two different writers were listed on the byline. Did one use AI without the other's knowledge? Seems like a big lapse in judgement happened somewhere.
I would like the retraction/apology to go further because of this exact reason. Ars publishes with bylines for a reason. Ars presumably knows exactly how this happened, but we still don't. If one or both the authors made the judgement error, then an apology from them to the audience (I would hope they've already apologized to the subject) would go a long way to helping us trust Ars still.
 
Upvote
8 (8 / 0)

Jim Salter

Ars Legatus Legionis
17,133
Subscriptor++
I hope all the people dropping their subscriptions over this one incident (which is still playing out) are living lives of absolute, infallible perfection.
This is an interesting take, when delivered by a Lurkus with no subscriptor tag of their own.
 
Upvote
11 (33 / -22)

pauleyc

Ars Praetorian
437
Subscriptor
The people defending Ars and Benji are why journalism is in the toilet.

Lenny - can I call you Lenny? - did you consider it's maybe not as much defending Ars or Benji but rather keeping a level head until the editorial board/HR/whoever is responsible for this completes their investigation? And Ars has actually time to review and act accordingly?

Because right now there is in this thread less of a furious community but something closer to an enraged pitchfork+torches mob demanding blood. And the criticism is veering sharply into bullying while nobody knows the actual full picture yet. And bullying has the nasty tendency of getting out of hand and can sometimes lead to irreversible actions.

So no, you don't get to make such proclamations from your high horsie. Not while others are asking to maybe keep cooler heads and actually wait until Ars shares more about this clusterfuck of a situation.
 
Upvote
18 (38 / -20)
(I put this comment inline on the post, relying here for visibility since I think it's helpful info)

You can mouse over the ban icon for a tool tip giving you more info, though that doesn't work on mobile. But the way to tell a difference between temp and permanent is temp = grey slash circle, permanent = red

I will be moderating low effort trolling, and if it's from low post accounts who only showed up to do it I'm not going to be lenient. Being critical is fine, filling the comments with noise isn't.
Pretty sure on mobile on my Android Chrome I can long press and get the tool tip. Just a heads up. IDK if maybe it's different on safari
 
Upvote
5 (5 / 0)

ramblevine

Smack-Fu Master, in training
79
This response is totally inadequate.

Benj Edwards fabricated quotes. That is a catastrophic, inexcusable journalistic failure.

Anything short of firing Benj Edwards and transparently establishing new editorial processes to prevent exactly this kind of failure from happening again is inadequate.

Like all people who get caught using LLMs to do their work, Edwards claims that this was the first time, that it's an anomaly. This is an obvious lie. Ars, of all outlets, should be intimately familiar with this pattern.

Fabricating quotations is the single most colossal error it is possible to make in journalism. Memory holing the article, obliquely claiming this appears to be an isolated incident, and keeping the known fabicrator on staff is a clear announcement that Ars Technica is not a journalistic outlet, has no respect for its readers, and is not trustworthy.

Others have speculated there may be an ongoing investigation, and the matter is not concluded. That doesn't appear to be true from the statement here or from Benj Edwards' personal announcement. I am prepared to be surprised, but as matters stand I will not be reading anything from Ars Technica, and I will make clear to those in my limited sphere of influence that Ars Technica is not trustworthy.
 
Upvote
6 (31 / -25)
Post content hidden for low score. Show…

graylshaped

Ars Legatus Legionis
67,695
Subscriptor++
Every place where I had the power to push to prod I could have easily done it without going through any intermediary steps. Not the best practice, but this was consistent from small non-profits to one of the largest market cap companies in the world.
But you knew better.

Um... tell us you knew better?
 
Upvote
12 (13 / -1)

Eurynom0s

Ars Tribunus Angusticlavius
7,885
Subscriptor
I agree with your general sentiment around keeping incorrect text up, but I think it gets thorny with fabricated quotes. Future AIs will inevitably slurp that up, ignore the context that they were fabricated, and then confidently assert that they were actual quotes.

Well as we saw the LLMs will make stuff up regardless, so better to have the source live with RETRACTED SEE BELOW FOR DETAILS prominently at the top of the original article so that people who click through to see the source at least see that, instead of say the unfixed version on the Internet Archive.
 
Upvote
14 (14 / 0)

Aurich

Director of Many Things
40,904
Ars Staff
Pretty sure on mobile on my Android Chrome I can long press and get the tool tip. Just a heads up. IDK if maybe it's different on safari
Hmm, doesn't seem to work on Safari. I just tried it on Jim's eject icon, just tries to select it instead of the tool tip appearing.

Just a note: it's 100% fine to be critical of Ars and this story. I'm staying out of this discussion but I'm not trying to influence it by moderation. But I'm not going to overlook the rules here, and changing people's quotes is a basic one that I'm not gonna let slide. Just for the transparency for why Jim was ejected for 12 hours.

Please don't do that. It's in the posting guidelines for a reason. Doing it to be funny or cute isn't an excuse.
 
Upvote
40 (40 / 0)

Vladimir Ilyich Ulyanov

Smack-Fu Master, in training
17
You can even call me Al, @pauleyc

Fair point about letting investigations complete, but he's admitted to fabricating the quote. That's not speculation, and in my opinion, that's a fireable offense.

He says that he was sick. He says that it was AI that hallucinated the quote. I'm not calling him a liar, but why it happened is immaterial.
It's his name on the byline, and the article has exposed Arstechnica to a lawsuit because Benji, insanely, has admitted to fabricating a quote publicly. That's before we get into the damage to the site's reputation or its claim to journalistic integrity.

We can wait for Ars to complete their review while still recognizing that the admission itself is significant. Those two things aren't mutually exclusive.
 
Upvote
-17 (6 / -23)
Post content hidden for low score. Show…

xoa

Ars Legatus Legionis
12,364
Subscriptor++
The comment locks I expect and think are good practice. In the absence of information, speculation turns to toxic rumor-mongering and assuming the worst.
While I agree fully with the vast majority of your great post, I strongly, strongly disagree with this part. First, because a good community can and generally does stick to higher standards, and as we see in this thread has been pretty measured. Simply assuming everyone will behave badly is frankly directly insulting. Some of the most controversial issues of the day have gotten (and are getting) discussed on the Ars forums, issues where the stakes and emotions go far far higher than this. And second to that point, if anyone individually goes over the line that's what moderation is for. Right? Moderators can just use the mod yellow box and say "Hi everyone, remember this is ongoing, please keep things cool" and eject the repeatedly misbehaving. And that's what happens on the Soap Box.

Simply insta-locking all threads without anything objectionable going on ahead of time both feels like a cover up/wagon circling, and on top of that doesn't even accomplish the objective. Ars Technica isn't some tiny little blog since long ago, it's gotten fairly notable. And in turn there was rampant discussion threads on Hacker News and no doubt other forums around the internet. Which didn't generally go the way you're suggesting either, I might add, but even if they had wouldn't that more support keeping it here? Ars isn't an island, shutting people down merely pushes them elsewhere.
So I'm willing to give Ars some space only so long as I can believe they're working to do the right thing, and that the job isn't done yet. We'll have to see if this incident plays out differently. If this "notice" ends up being the final word, then I'll have to make hard choices.
Yes, but I think we can say that objectively speaking, regardless of personal individual aspects around the author himself, there was a clear systemic failure here on the part of Ars itself too. For those parts it doesn't actually matter at all whether someone was malicious/grossly negligent or made a perfectly reasonable mistake or whatever. A few particular systemic things I'd like to see addressed:
  1. How are employee incentives/continuing ed/ethics in journalism etc handled and maintained, if at all? Are there any pressures to publish regardless of skipping steps (there might not be, but good thing to check)?
  2. Does Ars actually have any sort of formal independent normal journalism basic fact checking system in place at all? Ie, each submitted article must have sources for each fact and an editor or whomever goes through one by one and follows up to double check them all. If it's there and not universally used, why not? If it was used how did this fall through?
  3. Is there a formal checklist for handling retractions, including community interface? If not, that should be addressed as well. If so, what is it and how was it followed?
Best case would this being a moment to mature as an organization and be stronger going forward.
 
Upvote
22 (24 / -2)

DarthSlack

Ars Legatus Legionis
23,059
Subscriptor++
The thing I think a lot of people are missing is that in order to understand these AI tools, what they can do, and their limitations, they have to be used. And ideally used with real-world use cases.

I think it's entirely expected that an AI reporter would be using these AI tools as much as possible. Obviously something went wrong this time, but you're not likely to be a good AI reporter if you never use the tools you cover.


That's not the line anyone is pointing at. If Edwards was reporting on the tool, it would have been a very different story. But that's not what happened, he used AI tools to help research and write the story. Anyone with any familiarity with using AI knows that's playing with a lit stick of dynamite. LLMs hallucinate, that's what they do and a senior AI writer should know that.
 
Upvote
52 (53 / -1)
The thing I think a lot of people are missing is that in order to understand these AI tools, what they can do, and their limitations, they have to be used. And ideally used with real-world use cases.

I think it's entirely expected that an AI reporter would be using these AI tools as much as possible. Obviously something went wrong this time, but you're not likely to be a good AI reporter if you never use the tools you cover.
What went wrong was twofold:

1. He "tested" AI by using it in prod without actually checking the results. That's not a real test, or alternatively it's a test of "can I get fired for this".
2. He was too sick to work but checked in bad work so he could get paid anyway.

Your point about real-world use cases is woefully incomplete. You don't use new tooling, you don't rely on new tooling, without testing it properly. This was not the way to do it. It was somewhere on the laziness-fraud spectrum and Ars leadership has to decide exactly where.
 
Upvote
49 (51 / -2)
Post content hidden for low score. Show…
Status
Not open for further replies.