If it's interesting context or if anybody cares regarding "active comments section", that comment section ran for about two pages (definitely sure) in maybe ninety minutes (less sure).5. Shit hit the fan, etc. As I understand it, there was an active comments section following Shambaugh's engagement.
That's fucking hilarious - in an article about inaccurate AI slop, you're suggesting replacing a human author with more AI slop.
SMH
Edwards believed that the tool he was using was actually going to perform as advertised.
That belief was reasonable (I haven't looked into it, I can imagine an architecture that would function in a way that would reliably allow it to only output verbatim quotes).
This was the first use of a tool like this to produce an article submitted to ars by Edwards for publication.
The article contained falsified quotes that were not defamatory or substantially harmful (I haven't read the quotes, reminder, this is the most charitable interpretation)
Edwards was so sick he was not capable of making good decisions.
Edwards is a bit of an AI aficionado and his judgment was influenced by this.
Many in the community believe the appropriate response should be termination and failure to do this would permanently damage the reputation and culture of ars (both internal and external).
This has happened before on Ars and every time it happens it is incredibly disappointing. FFS Ars wont let you delete or edit comments after a grace period, but the much more important front page reporting can get memory holed or edited to be a totally different article, with no way to read the original article.https://publicationethics.org/guidance/guideline/retraction-guidelines
In order to be best practice, the original text should still be readily available, clearly marked up with what was wrong with it and corrections to it, along with an explanation of how this happened and why it shouldn't continue happening.
Benj did not write (or imply) the tool was used for the first time. He merely mentioned it was “experimental”.
After that tool’s failure he then “pasted the text into ChatGPT”, not saying anything about whether that’s his usual workflow that he has done a hundred times or something he only did once.
We have reviewed recent work and have not identified additional issues. At this time, this appears to be an isolated incident.
I don't think "deleted"; the mods/admins have expressed an unwillingness to do that except for first-post trolls. Hidden/moved seems a better bet, especially given the "You do not have permission to view this page" error when you try to access the thread; when something is deleted from the forum and you try to access it, you typically get a Moonshark, not a permission error.Ars locked the comments section, and has either hidden or deleted all comments on the article, so I can't verify what happened there. (Or moved the thread, etc. I don't know how this gets handled.)
That may have been a result of some closer over site of her work. Do you not remember the two articles she wrote early on (regarding NFT's if I remember correctly) that got torn to shreds in the comments similar to this one?I find that Ashley Belanger's articles generally have a more critical angle WRT AI, and provide a balance to the more pro-AI viewpoints. Overall, I think they try to provide balance, but in a tech news site anyone is going to struggle trying to maintain that.
I wouldn't say no to Jim returning but I'd prefer he focus on ZFS for my own selfish reasons.Well, are you planning a comeback ?
Asking for.......about most reader on here
Agree, have reworded that section.I don't think "deleted"; the mods/admins have expressed an unwillingness to do that except for first-post trolls. Hidden/moved seems a better bet, especially given the "You do not have permission to view this page" error when you try to access the thread; when something is deleted from the forum and you try to access it, you typically get a Moonshark, not a permission error.
I don't think 5a happened before 6. Listing it first implies Ars tried to shut down the conversation while leaving the article up, and I don't know that it actually happened that way. Of course once the original article was pulled, it makes sense that the comments would also be hidden, that has happened before.I've had to read too many posts to try to piece everything together but this is where things seem to be at:
1. There was a minor social media storm last week about how an OSS maintainer had rejected a PR submitted by an 'autonomous AI agent', and how that agent had then written a blog post smearing the maintainer and accusing him of 'gatekeeping'.
2. The maintainer, Scott Shambaugh, wrote a blog post on the subject.
3. Ars published an article on Friday at 14:40 EST (byline Benj Edwards and Kyle Orland), titled "After a routine code rejection, an AI agent published a hit piece on someone by name". It included several quotations, attributed to Shambaugh's via his blog. The original article is available on the Internet Archive.
4. Scott Shambaugh showed up in the comments and said that several of the quotations appeared to be fabricated. (I'm being generous with "appeared to be fabricated" here.)
5. Shit hit the fan, etc. As I understand it, there was an active comments section following Shambaugh's engagement.
5a. Ars has hidden/moved/deleted the original comments thread, so I can't verify what happened there. There is now a new comments thread, locked, with one comment that links to this Editor's Note.
6. Ars retracted the original article on Friday at 16:22 EST (1hr 42m after publishing) and have replaced it with a retraction notice. They have also published an Editor's Note - this page - on the subject on Sunday at 13:09 EST.
7. Shambaugh has since published a follow-up blog post with some screenshots.
8. Benj Edwards has since posted (Sunday 16:03 EST) a mea culpa on Bluesky. (I would prefer people to read this rather than rely on me attempting a summary.)
Have I missed anything?
Edited to better word 5a, and add an extra timestamp in 6.
The sad reality is that the "AI" bullshit generator industry is undermining any kind of media. On YouTube when I step out of my bookmarked and largely trusted channels I feel inundated with completely made up or at least significantly slop-contaminated videos to a degree that the platform is becoming unusable for discovering anything new.Ars Technica itself has suffered a step down in credibility, not just these authors. I now have to read every article here with a more skeptical eye, and that’s more work for me, as well as being disappointing. It’s going to take public announcements of improvements in editorial standards and practices that I never worried about before to recover that trust.
The comments were locked somewhere around 3:30 Central (quoting from my own memory here), and the article remained up for, eh, a few tens of minutes thereafter. But the duration was short enough that it could well have been the time it takes between clicking one button and clicking another.I don't think 5a happened before 6. Listing it first implies Ars tried to shut down the conversation while leaving the article up, and I don't know that it actually happened that way. Of course once the original article was pulled, it makes sense that the comments would also be hidden, that has happened before.
That's just how LLMs work. They produce text that is a high-likelihood possible follow-up to the text that is in the prompt. There's no "thinking" about meaning. If you prompt an LLM for an article, and articles typically have quotes, then the LLM will produce an article with quotes, because that's the kind of text that it is emulating. The likelihood that the quotes will be real is not particularly high because that's irrelevant to how LLMs work.Can someone eli5 why the designers of llm services decided to let the models generate "direct quotes" instead of having them use copy-pasta, as humans would?
What are you even talking about? Sure that this paragraph wasn't auto-generated without any comprehension?Boggles my mind how people can look down at the 'machine slop' but then also want to help the machines with their 'slop' by changing their behaviors to accommodate them. Stop evaluating what are the appropriate behaviors based on how much it helps out the machine with their 'slop', that's just nonsense. The machine's job is to manage the slop, not yours.
That's what a thoroughly vetted autopsy would and should be for!Furthermore, by removing the original article and not having to strike out the offending material, they also conveniently get to avoid acknowledging specifically what parts of the article were AI generated, which it likely went beyond just the quotes. This to me is most suggestive of their incentives, they do not want to leave the article up because they don't want to provide specifics about what they need to correct or identify as AI generated.
Or that they shouldn't simply mail things in.Hopefully Benj and Kyle have learned that AI sources are simply not to be trusted.
Yes, it was two pages (63 comments). I actually have the first page saved, and would post a link to an archive of it if I didn't think Ars might get cranky about it and ban me.If it's interesting context or if anybody cares regarding "active comments section", that comment section ran for about two pages (definitely sure) in maybe ninety minutes (less sure).
Have edited (again) to try to make it a bit clearer. I don't think the exact order of that event is particularly material, but have tried my best anyway. There were probably several actions wrt comment threads, and again I don't think the order is exactly important:The comments were locked somewhere around 3:30 Central (quoting from my own memory here), and the article remained up for, eh, a few tens of minutes thereafter. But the duration was short enough that it could well have been the time it takes between clicking one button and clicking another.
I had actually read Scott Shambaugh's blog post around 3 a.m.Friday because one member posted it in the comments of the OpenAI article. When I saw the ars article about it, I naturally wanted to learn more and when I saw Scott's comment, I went and scanned his blog post and confirmed his blog post hadn't changed, (because some speculated he may have changed the post, etc).I've been having a difficult time reconciling why some people are highly upset at Ars for deleting the article, locking comments, etc.
Then it hit me, it doesn't bother me because I just happened to be online and actively participating in the article's comments when Scott showed up and it went down. I knew what the article said, what Scott said, verified he was telling the truth, and was among those saying we deserved an explanation. So for my personal view, I got the explanation I needed of "we're looking into it", saw the article got taken down, and saw the responses to that in the comments as well as on two new threads in the forum and I was good with that.
Now that I put myself in someone else's shoes that missed Friday completely and just saw Ken's mini article today, I can absolutely understand why others are as upset as they are about it.
Normally, the editor would be the fact checker. For the "fake text", it was a fake quote, and something an editor would not be able to check. Ars burned their sources.
I hadn't considered this, reduced credibility with sources could be one of the biggest negative consequences for ars should Edwards continue to be employed. And it might not manifest in particularly obvious ways, fewer high quality tips, more guarded responses to requests for comment, more refusals to comment, less collaboration in general.their sources
Right here, three pages ago.The honorable thing would be for Benj to resign without waiting to be fired. Ars should publish the resignation letter as part of the post-mortem process. I can't believe it took 12 pages of comments before someone says this.
I realize there are real ramifications for unemployment benefits and health insurance when you resign instead of being fired, but it is still the right thing to do.
As for the article itself, clearly you don't want the AI hallucinations on the web because they will spread. That said, the rest of the article is fundamentally sound. It should be put back up with a heading note explaining what happened, and most importantly, with the AI garbage removed. Mark the deletions. Maybe put in legit quotes in their place. That should fix the issues.
Put the comments back. Ars placing a comment at the place where the original comments ended.
And yes, a full and public post-mortem.
I also wonder how much influence the parent company Conde Nast is exerting here.
I apologize for any typos because I am seeing serious input lag while typing this.
I fired my share of people over the years. I also did NOT fire my share of people over the years for what at face value would have appeared to be career-ending conduct. It's a process. Gather the facts. Sleep on it. Consult. Then, whichever way you go, own it.That's good to hear. But frankly, this is still the kind of "isolated incident" that should be considered an immediate firing offense. This was not a peccadillo, this was an utter abnegation of journalistic work, let alone standards and integrity.
If posting slop to the front page isn't a firing offense, I have to start questioning what the job is in the first place.
I'm on the fence about zero tolerance for using AI in this line of work. First, it's completely unenforceable for remote work. Second, we don't know what the future holds for this tech and its perception. Third, I think the best policy is to have those that submit work be accountable for what they submit, regardless of how it is produced.I have been a regular reader of Ars for over a decade. While Benj's linked apology is definitely worthwhile (a good apology makes a huge difference vs. a defensive-asshole stance), I believe any journalistic or creative enterprise should take a zero-tolerance stance towards AI tools. My initial instinct is that the writer making the error should be fired.
But I just said a good apology makes a difference, and I believe that. I would like to see a full postmortem published next week, with Benj's apology included in full, and a statement that he (and all Ars staff) will refrain from using AI tools in their work going forward, full stop. With appropriate apologies and lesson-learning, I think I am fine with a one-strike rule. Otherwise, Ars is losing a reader.
We were led to believe those quotes were taken directly from the blog post, so yes they could have checked by reading the source.Normally, the editor would be the fact checker. For the "fake text", it was a fake quote, and something an editor would not be able to check. Ars burned their sources.
The forest you missed while educating us on the terminology is that there is a whole lot of young smoke surrounding he who is currently the zero-th in line.While the posts here over-index on those that comment on Sundays, they do show the wide breadth of humanity: maturity, appreciation of due process, susceptibility to reductive thinking, the desire to gossip, and in a few cases abject stupidity.
It further reinforces my current theory: a good chance we are fucked as a species. Funny enough, back with Star Trek, the danger was the nukes. Turns out that's not going to be the root cause.
Can we at least settle this?
- The President is "zero-th" in line. They are currently holding the office
- The VP is first in line, in case something happens
- The Speaker is second in line. Even the phrasing with the word "from" helps to visualize this