I saw it on Mastodon but only from other accounts linking to the article. I follow the Ars account feed and it never showed up there.I saw it in my own Mastodon feed from the arstechnica flipboard account. Didn't see it in the mastodon.social feed. I'm honestly not sure if Ars even runs the "arstechnica" profile on flipboard, but it looks official, so maybe that's what you remember popping up in your feed?
I hope so, anyway. Although either way... The retraction really ought to be going out on social media feeds just like any other article does.
I can relate. I once was fired by Microsoft, but laid off by the contracting company I was working for. They knew the firing was just more stupid MSFT politics so they made sure I was eligible for unemployment until I got the next gig. Try explaining that on a resume.I have (thankfully?) been terminated twice under such vague and mysterious circumstances that I genuinely could not explain my termination.
"Using AI" and "not verifying the accuracy of the output of the AI that is appearing under your byline" are not equivalent. Personally I've chosen not to engage in the former because, as someone else said above, the first hit is free. Once you've crossed that line keeping your footing solid while standing on the slope on the other side takes a lot more discipline than most humans have.For everyone brandishing the pitchforks I suggest you read this Columbia Journalism Review.
Some journalists that are using AI:
(snip)
The only thing that's hypothetical about that is using an LLM to do it. I could write this in either JavaScript or PHP using RegEx to match patterns with a very high degree of accuracy and my degrees are in Communications, not Computer Science.Hypothetically, you could use an LLM to create a software tool that downloads the content of a webpage, identifies the quote marks, and copies the text between them. That would not be subject to hallucination.
You just made me realize how much DuckDuckGo's AI identifying the sources it uses has been becoming part of how I search. At first I ignored it, but now I fairly often glance at it and then click the link it gives to the sources it used. It saves me the trouble of scrolling through all the crappy AI-generated clickbait sites in the actual search results to find those sources.So the only use I've had for it along these lines is using it to find primary sources because Google has become so degraded as a search engine as to be useless. Would rather use a search engine but most either rely on Google's useless index or have their own useless index.
The double byline on the story makes me think Benj probably was online while sick, which is not something I would say is inherently wrong, and saw the legitimately interesting emerging story. So then he asked a colleague to help write it.IMO, Ars needs to take a deep look at itself. BIG "if" here, because I don't know what the actual scenario is, but it's plausible that the below is similar to what might have happened:
If Benj felt so pressured to push out an article quickly, despite being sick, then the fault is not just on Benj for writing it, nor just the editors who didn't review it, but also on Ars' management/ownership. Only in the US (among supposed first-world countries) would such a scenario be even remotely normalized, and even then, it would plainly be the wrong way to do things. A culture where an employee doesn't feel they can call off when sick means that the employees are not secure, they're not safe, and they feel they have to choose their job over their health. It doesn't matter if you have FMLA days left or sick days accrued if you feel that you'll be punished in some way for using those "benefits," or that you have to work through sickness to "keep up" at work.
Now, there are many other things that could have happened. Hell, Benj could even be making up the illness after the fact. But there is something concerning in this entire chain of events that suggests something is deeply wrong.
I probably didn't make myself clear, but what I was trying to say is that I don't accept the AI summary at face value but click the link it provides to the source(s) it cribbed from. I don't know if Google provides those because I rarely use Google anymore.Be careful with using AI summaries for scientific topics. Within my subject area, the AI summary frequently presents some entirely nothing paper from a vanishingly obscure journal instead of the most important / best studies. These summaries also make up facts by combining sentence fragments from their “sources” in a stochastically plausible manner.
Which is not to say that SEO didn’t already do something pretty similar back in the Age of Search. I’m guessing the search companies weighted the models for their own, terrible, SEO algorithms.
If anything you’re better off asking chatGPT. There’s a (slightly) better chance that an actual human might have reviewed the outputs at some point.
Okay, I've been staying out of it but your last paragraph made me think about how that describes exactly how I managed teams too. But where you lose me is that there's a big difference between doing a dumb and blatantly subverting stated policies and procedures.That would be bad. I still don't think that's a fair analogy.
To stay with the car analogy: he made a huge scratch on the front door. It's ugly and totally unprofessional. But he then apologized profusely, took full responsibility, and told me he will do whatever he can to fix it.
So, to be clear: nobody got permanently hurt, and he wants to fix it.
As I find it rare for people to own their mistakes, and as there is no pattern of this being a systematic problem, I am leaning heavily towards forgiveness.
You can call me a shitty leader if you like, but I treat my team the same way. The result: they're HIGHLY productive, NEVER afraid to own up to their mistakes, and we have a healthy culture where discussing f*ckups and fixes is never toxic or dangerous. That goes for me to: I am secure enough in my leadership position to share my own f*ckups with them on equal terms. In leadership terms, this is called "psychological safety," and it's the means to make high-performers stick with their job for a very, very long time.
You know how sometimes one person on a mortgage ends up suing the other person on a mortgage? That's an outcome of trust being violated. Taking joint responsibility for the mortgage means if the other person bails on you, yes you are still responsible for making payments but they can and should be held culpable for the financial and reputational damage they have done to you by violating your trust.Maybe I’m naive, but I feel a joint byline is like jointly signing a mortgage. Both are willingly responsible, and if things go sideways, it’s on both of parties. We give a lot of responsibility to journalists in exchange for higher expectations.
I might be spliitting semantic hairs but I think there's a big difference between making an excuse and having an excuse. We're in a linguistic period where the word "excuse" is more often used in the former sense of trying to deflect blame, but we do still use it in the latter sense of agreeing there are mitigating circumstances.May I suggest a qualifier? An excuse is something that makes something seem okay, but may or may not represent reasonable justification for the choice made.
I think a lot of people don't understand how much of an outlier The New Yorker's editorial process is. I consider myself lucky that I ever had a job where what I wrote was looked over by a copy editor. A fact checker would have been a total fantasy.I don't know the answer. But speaking as a professional editor, I can say that verifying sources is something we really don't want to do or have time to do. We really need to depend on our writers to provide us with good sources, correctly quoted and properly cited.
Of course, "verifying sources" can be more, or less, rigorous; and sometimes rigorous verification of sources, especially in book manuscripts citing other books, isn't realistically possible given time and budget constraints.
However, if sources are to be verified by an editor, then the kind of editor normally tasked with such verification would be a copyeditor. (And most copyeditors I know are already overworked.)
Edit: spelling
Every place where I had the power to push to prod I could have easily done it without going through any intermediary steps. Not the best practice, but this was consistent from small non-profits to one of the largest market cap companies in the world.The problem with this analogy is that apparently the system in place allowed the code to go straight to production without anyone in Dev, Staging or QA review it. Yeah, the bad code is a problem, but so is the system that allowed it to production without more than a cursory glance.
I don't know about you, but if my QA lead found me running a process that allowed straight to production, I'd be smothered in honey and staked out on an anthill.
I, uh, did know better. Although I am pretty sure at one job we may have jointly agreed to push a critical, but low-risk, fix straight to prod. That would have been 20ish years ago and things get fuzzy.But you knew better.
Um... tell us you knew better?
I'm kind of feeling the other way personally. I lurked for a long time in part because it was clear if I commented I'd occasionally get my ass handed to me on a plate, and since de-lurking that has definitely happened.Perhaps the lurker has been debating whether this bloodthirsty community is one he wants to pay to join?
For clarity moving forward, can you confirm this was because the quote was replaced by commentary and not because it was simply modified? When responding to a point made in a 500 word post, snipping out 470 words to leave just the 30 words needed to contextualize the response is okay?Two tips people may or may not know:
1) If you click on the little meeple icon on people's posts it will show you everything they posted in a thread
View attachment 128410
2) When we eject people there now a notice on their post, and it remains even after the eject expires. It will explain the reason, the length etc (this is why the tool tip thing on mobile isn't really an issue for ejects, there's always a clearly visible message on the actual post itself with any browser)
For convenience here is that post, and the explanation I left:
View attachment 128409
If you wouldn't mind deleting the first paragraph in what you quoted from me, which I had not actually intended to post, I'd appreciate it. I don't know if it's just a Firefox thing but the text box on Ars forums can hold on to stuff that I've manually removed sometimes and that's what happened here.I came here to post something similar. I've seen other threads where folks will snip, even summarize the responses of another with [clearly edited] brackets. I don't actually see a problem with this, and have definitely seen it not punished before.
I would suggest treading carefully as a mod. I don't think that suddenly applying the "letter of the law" is the best move here. Spirit is probably a better call given the totality of the circumstances.
Unrelated, but I'd also agree with some of the others in here that Ars has, over a decade+, slipped into very low value "reporting." You know, the very stuff that AI gets like 80%+ of the value from at a fraction of the cost. I'd personally be a lot happier if Ars went back to a model of "artisanal" reporting, with only highly in-depth, well-researched articles published. The organization would be a lot less likely to have this happen again, without the overhead of editors suddenly needing to double/triple their time reviewing an article before publication.
Just some free, unsolicited advice from a roughly 20 year reader. Hope you get your money's worth!![]()
Fair enough. Ironically I just arguably violated the rule even though I had thought better of posting what I wrote but my browser didn't respect me deleting it from the text box. It happens sometimes, probably due to how the multi-quote script works.The rule is to not modify people's words to things they didn't say. What's in the quote box should be words they typed.
It's fine to take a section of someone's long post and reply to just the relevant part, provided you don't do it some malicious way. Selectively quoting just part of a sentence to change the meaning to be cute or whatever. Basic common sense applies in other words.
Otherwise that's totally fine.
The rule exists for a bunch of reasons, but there are two primary things at play here:
1) If you are replying to a post people should be able to see what you're replying to. It's basic courtesy if nothing else
2) Even when the modification is supposed to be funny and not malicious we can't play the game of deciding when it's okay or not. It's a blanket rule, and once we start adding exceptions it just turns into a game of whack-a-mole
While I don't think this was directed at me specifically, I always find it interesting when people on the internet tell me that the fact I have a different opinion from them is solely because I'm emotionally upset, angry, or whatever when I'm actually feeling quite calm and composed. It may actually be the closest I'll ever come to knowing what my life would have been like if I'd been born into this society as a woman.Nearly the entirety of this thread has been emotional. Anger at how Ars has handled the retraction. Expressions of feeling "betrayed" by being fed misinformation. And most disturbingly, in my opinion, calling for the summary dismissal of an employee without a complete and thorough investigation.
This thread has become a horde of villagers with pitchforks standing outside of Frankenstein's castle, with a small contingent of other villagers at the gate saying "well, let's just wait a moment and evaluate this."
What I was reacting to was your characterization of "nearly this entire thread" as emotional. While I've certainly seen some over the top appeals to emotion I wouldn't say that they dominate the discussion. I've read, I believe, every comment and if you aren't sure you've read my earlier ones that suggests to me that maybe any statement about the emotional tone of the entire thread might not be as objective and dispassionate as you think.No, it was not directed at you specifically. I'm not sure I've read your previous comment(s). My only point was that this is a very emotionally charged thread. And certain allowance might need to be made to accommodate that.
I'd just like folks to step back for a moment and consider some different perspectives.
It's not about me. I feel you've discounted the actual level of discourse of nearly everyone else.If you feel I in any way discounted your comments, I apologize. It was not intended.
It's an interesting thought, isn't it? Now I'm remembering a semi-serious/semi-joking paper I wrote in 2008 for an algorithmically generated evaluation system for determining the trustworthiness of social media accounts. I'm not a CS person but I knew there wasn't a good way to do it then and I mostly was trying to show original thinking for the class I was taking. Now I'm like, "Hmm. Something like that could probably be done with AI (or 'AI') to evaluate, categorize, and assign trustworthiness scores to sites in a web directory."Every time I search for technical information, only to find a mess of auto-generated, referral-link infested slop promoted via SEO, I keep thinking of how a modern web directory could help - even if it only covered a small proportion of sites.
Ultimately, when looking for information, the best option is to go straight to a website that is likely to have the information to begin with. Astronomy gear? Skip the horrendously-incorrect slop sites full of random referral links and go straight to Cloudy Nights or Astrobin. Computer gear? Skip the search engine and look to see if Ars or TechPowerup has a review. Search is rapidly becoming worthless, and we need to fall back on trusting individual sites.
But how to know which sites even cover a topic to begin with? It used to be that you just went to Yahoo Directory or DMOZ to find websites on a particular topic. These days, you need to stumble across them. Returning to a directory-based approach could help with discovery.
But there would need to be some way to keep the slop factories out. If the directory was fully open and they could just spam their multitude of identikit sites in to every category, that would kill the directory as quickly as they killed web search.
Even better would be some kind of carefully-curated list ranked by reputation (as assessed by experts in the field). But getting experts to even agree in the first place would be a challenge.
There's another definition of "fabricate" which is entirely devoid of intent. And he used a piece of software which fabricated quotes in that sense of the verb.Does intent matter in this situation? There are lots of accusations of "forging," "fabricating," "lying," and so on throughout these comments, and every single one of those requires intent to deceive. To whit, from the macOS dictionary:
I don't believe it's likely that Mr. Edwards intended to quote the subject of his article incorrectly, therefore I don't think those adjectives accurately convey the situation.
So back to intent - does it matter? My initial reaction is that it should matter, but I'm open to other thoughts on the matter.
Okay, so the new twist in the story of the human taking credit for a bot's bad work is that the whole thing was kicked off by another human pretending to be a bot upset that its bad work was called out? Have I got that right?Here's a hilarious version from the person doing the digging: https://social.treehouse.systems/@ariadne/116065540537300456
Exactly my point.I don't think this way of looking at it improves Benj's position, what if AI meant Actually Indians? I passed my work to them and slapped by name on the result, I just didn't verify their work properly. But you're the whole source of the problem to begin with, there's no way AI slop ends up in the article without you handing the reins to AI, even when there was an explicit policy not to. Stealthily outsourcing your work should be at least as big a no-no as making shit up yourself.
The expected transgressive thrill of going into the girls' bathroom was such a letdown when I did it in third grade that I really can't understand the total failure of imagination involved in thinking it's the best answer to the question "If I was me but I looked like a woman, what would I do that I can't do now?" There definitely has to be more to the psychology going on there.I let out a pretty depressing chuckle anytime I hear the bathroom argument. The only trans person I know personally is a trans man. He has a shaved head and a big, burly beard. If he went into "the right" restroom, these people would have a fit.
It varies not just by state but by company. I've worked at places that allotted a certain number of vacation days and a certain number of sick days. But by the late '90s the definite trend was to lump them together into one bank of Personal Time Off days. Obviously this incentivizes not taking sick days so that you'll have that time available as vacation days. My wife is currently in a situation where because basically any day that she answers an email counts as a work day (which is very much not true at many companies) she's taking every other Friday off for a few months so that she doesn't hit the limit of PTO she can accrue.i actually have a question about this. earlier someone mentioned not wanting to use "PTO" to cover sick days. i thought this was just a terminology difference, but is it actually the case that in the US, you have to use vacation days to get paid sick leave? or is this something that varies by state? (in which case how does it work in NY, where i think Ars is based?)
If you ask a search engine to help you track down a quote, you know that it's going to show you what it finds. If you ask AI software to act as a search engine and help you track down a quote, you don't know for sure that it's going to show you what it finds or if it's going to try to please you by making up something that sounds like it could be something it finds.It cannot “help track down quotes” without creating content. Selecting the quotes you want to use IS AUTHORSHIP. If I cite a case and want a quote from it, selecting which quotation best benefits my argumen is part of my specialized skillset. That quote is part of the brief I turn in or what I say at oral argument. M It is, while a quote, also a fundamental piece of MY content. I stake my credibility to what I turn in, including the portions I choose to quote.
Thank you for introducing this topic. I look forward to the discussion it will engender.Now that the conversation has shifted to who's using which bathrooms and who's posting too much, it's probably time to close this thread.
At least with the ones I use if I ask them to track down a quote they are going to present me with a list of links to pages that already exist and which match to some degree the criteria I asked them to search for. Now, those pages may be infested with AI garbage but if I'm only looking at search results and not the clearly identified text generated by the search engine's own integrated AI, the problem you cite isn't an issue.Can't really trust search engines anymore as they're getting increasingly tied with AI...
The only time I even think about Bing is when I remember reading when it was launched that they intended it to become a verb the same way "Google" had. I immediately saw the problem with that is that for any native English speaker the natural past tense of "Google" is "Googled" but the past tense of "Bing" is "Bung." Picking that name was another classic Ballmerian bungle.Confusingly, Copilot has now gained a Search mode which does focus on giving you the citations. At the moment that is separate to Bing Search and, unfortunately, it is vastly better than keyword search in the case where you would otherwise have semantic difficulties, like searching for stuff about the band "The The."
But it's quite possible that once Microsoft have kicked that around a bit they'll put it behind Bing by default, and then you will indeed be using that engine whenever you just use Bing.
(The only thing that can save us here is that no-one uses Bing.)
The snarky reply that popped into my head was "Ones written by Molly White," but honestly that's not far from the truth. I value distanced skepticism over cozy access, and I am far more interested in the broader societal effects of any technology than the touted benefits of it in specific uses.I'm just curious, what type of AI related articles would Ars members want to read?
While I haven't said anything at all about libel, it isn't a "ridiculous" concern at all. If a plaintiff can convincingly demonstrate they suffered reputational harm as a result of untrue published information, they win a libel suit. It doesn't matter in the slightest if the falsehood was written by a human or generated by software, it's still the publisher that bears the risk. The only justification I can see for claiming it's ridiculous is if someone felt AI was incapable of error.In terms of Mr. Edwards putting Ars Technica at risk of libel suits (or whatever I am reading in every third post) ... that is ridiculous at least in terms of USA laws.
No, there's more nuance to it than that.Under U.S. law, a statement has to be shown to have been knowingly false at the time it was made. If the person posting the AI slop believed it to be true at the time of publication, that's all that's needed as a defense.
I am a parent and I refrained from saying it earlier but you are making exactly the same error people who say all gay and transgender people are pedophiles make. I didn’t reply before because I was giving you the benefit of the doubt but perhaps I shouldn’t have.You are either not a parent, or someone who should be watched carefullly.
To be clear, I have never engaged in sex tourism and never intend to but I think you are potentially libeling the Netherlands, Germany, and possibly other countries I don’t know about with different laws than the majority of US states. Hell, you might be libeling Nevada and that’s an accomplishment.There are countries VERY well known for facilitating child sex tourism.
Most of them are the same countries known for sex tourism at all.
I don't have a problem with someone consensually using the services of a sex worker, but the phrase "sex tourism" frequently means abusing "marriage services" even when it doesn't mean abusing children. It's not really a label anyone should want hung on them, even if they're an out and proud hedonist.
It's a ridiculously loaded thing to talk about on the internet, as the reply from the poster who first brought it up and is now insinuating that my objection to "if A, then B" assumptions about people suggests I have something to hide shows. But that said the first place that pops into my mind when I hear the term is Amsterdam. And just to reiterate what I said before about myself no, I've never been to Amsterdam. I lead a pretty dull life and that's okay with me.When I hear the phrase "sex tourism" used, it's more frequently applied to Thailand, the Philippines, and Russia.
I'd love to go someday.I have. No hanky panky, though, just delivering a talk, visiting a tea shop and a bunch of museums. The Netherlands is pretty lovely!