Stack Overflow users sabotage their posts after OpenAI deal

Your general point is well made. On this specific point, some excellent songs succeed because of the contrast of the bright and wonderful melody, and then you listen to the lyric and it's in complete contrast.
Compare and contrast.

Exhibit A:
View: https://www.youtube.com/watch?v=djV11Xbc914


Exhibit B:
View: https://www.youtube.com/watch?v=-xKM3mGt2pE


For the record, I prefer the second - maybe it's my age and life experience, but it sings to me in a way that the first doesn't.
 
Upvote
1 (2 / -1)
While Stack Overflow owns user posts, the site uses a Creative Commons 4.0 license that requires attribution. We'll see if the ChatGPT integrations, which have not rolled out yet, will honor that license to the satisfaction of disgruntled Stack Overflow users. For now, the battle continues.
You can bet the ToS and licensing model will be changing very very shortly.
 
Upvote
1 (2 / -1)

mg224

Ars Scholae Palatinae
1,352
Subscriptor
Legally permissible copying is heavily related to the amount of the work copied. See "fair use." There are incredibly few answers on Stack Overflow that include a substantial enough proportion of a program's code to violate that program's copyright. Even in aggregate, it would be impossibly difficult to piece together what code belongs to which program, although perhaps the legal system will re-evaluate that if AI makes it considerably easier to reassemble said pieces.

Do you have a legal reference for that opinion? copyright is usually asserted per file of source code, containing maybe one class’s implementation (let’s not cite google/oracle here), in my experience.

There is also a difference in ‘putting it on SO’ and ‘copying off SO and putting in another project (possibly closed source, or for profit’.

Devs get one no-blame free-pass do over from me when they unthinkingly cut and paste from SO - the second offence is not so easily over looked.
 
Upvote
-1 (1 / -2)

jbode

Ars Legatus Legionis
11,625
Subscriptor
Whats the qualitative difference? So a programmer pocketing money is ok, unless he is working on AI?
Jesus, really?

It's the difference between helping someone like me do their job better vs. helping someone put people like me out of work altogether.

How is that not clear? How is that not obvious?
 
Upvote
6 (13 / -7)

Hagen Stein

Ars Scholae Palatinae
680
Subscriptor
I think this is just a case of shutting the barn door after the horse has already bolted. Not scientific, but at work, we tested out a few generic code generation questions via ChatGPT. Then we googled the offered solutions and we found either directly or INDIRECTLY the answer usually led back to stack exchange one way or another.

In that regard, phind.com is at least "honest" in the sense that it links the relevant articles/websites from which it composed the answers on the right side of its chat window.
 
Upvote
1 (1 / 0)
I think it's a testament to how civilized we've become as a society when a company can do something legal yet so infuriating without fear of physical violence from their users. Like if I somehow did that to someone I knew personally I'd be scared of him burning my house down.
People have forgotten that workers used to drag owners and managers out of their houses and beat/kill them in front of their families....maybe we need to remember history.....
 
Upvote
5 (5 / 0)

Hagen Stein

Ars Scholae Palatinae
680
Subscriptor
All that's missing is a Zuckerberg-esque quote from the early days of SO with a founder calling the users dumb fucks for freely giving so much info.

Joel Spolsky is the one founder I'm aware of. For a couple of years I read his blog, which I found informative. You'll find this article from 2021 there, announcing that SO was sold to an investment company:

Today we’re pleased to announce that Stack Overflow is joining Prosus. Prosus is an investment and holding company, which means that the most important part of this announcement is that Stack Overflow will continue to operate independently, with the exact same team in place that has been operating it, according to the exact same plan and the exact same business practices. Don’t expect to see major changes or awkward “synergies”. The business of Stack Overflow will continue to focus on Reach and Relevance, and Stack Overflow for Teams. The entire company is staying in place: we just have different owners now.

So the current development seems to follow the long "tradition" of tech companies that went down the drain when the original "techies" founders leave the company and MBAs take over.
 
Upvote
14 (14 / 0)

Enigma990

Ars Scholae Palatinae
967
Subscriptor
Has anyone considered the long term effects of AI training on AI generated content?

Seems like there will be a general enshitification of data that will drive the quality lower.

Once that's happened in a (human) generation or two, is anyone going to remember how things work?
All the AI will result in knowledge being stuffed behind paywalls. Want to lookup an oddball setup and query for WorkDay? Now you got to get an account, bound to a credit card with x number of lookups per day to thwart AI Scraping.

Instead of open, collaborative knowledge paywalls will go up.
 
Upvote
4 (4 / 0)

bp_968

Wise, Aged Ars Veteran
199
Oh how lovely. The AI enshitification of the web continues. Way to destroy online communities you greedy profit driven arse holes.
We need to go back to web forums. Do we really need to use some giant centralized company's product? We didn't in the 90s. News groups used to be amazing and mostly decentralized.

Remember, nothings free. If your not paying for a subscription, or donating bandwidth/drive space (p2p) then most likely you are (or will be) the product.
 
Upvote
8 (9 / -1)
The simplest form of protest would be to stop using Stack Overflow. If people keep contributing to it as a whole, then I would view it as acceptance to the OpenAI deal.
Unfortunately that’s not how most people think. They talk a lot but rarely turn their words into action.
 
Upvote
6 (6 / 0)
There's actually a nuanced version of this viewpoint.

There are people who love AI, but doesn't like the way certain AI companies are "strip-mining" the training data without respect for copyrights and/or otherwise not respecting for content creators.

I have been reading on both the pro-AI and anti-AI viewpoints and have come to realize that some of this is very nuanced.

One of the many examples are content creators whose content has been incorporated into current AI's and able to regurgitate almost everything that person wrote, even in niche industries, to the point where they are aghast.

Another of the many examples are the resentments by people who have been laid off by AI, or are threatened to be. Basically the Jacquard Loom Riots of 1801🔗 factor. If you read up the history, it is pretty clearly what some of these people are doing. Coders -- users of StackOverflow -- are especially vulnerable.

I don't like the loss of StackOverflow knowledge by sabotaging past posts, but when viewed through the lens of the Jacquard Loom Riots, it begins to make sense and sadly understandable, even if not necessarily endorseable.
That's probably a good way of looking at it. How well the weavers fared after 1801 is also worth considering.
 
Upvote
6 (6 / 0)

bp_968

Wise, Aged Ars Veteran
199
So many techies think that art is the final product. The film is the thing, or the song, or the book, or the image. And they believe that the final product can be arrived at without the process, and hence you get stuff like Dall-E or Suno, or this. “Just write a prompt, skip the process, and you’ll have your finished piece.” And they’ll call it art. It’s the ultimate get-done-quick-scheme.

The problem is, of course, that the process IS the art. If you remove the process, you basically create facsimiles of what’s been done before. If the facsimilies are wrong, you replicate the wrongs. LLMs can's tell which is which. And, you don’t get a The Godfather; you get #999 of a Marvel film containing lines and thoughts that’s been in all the previous Marvel films. There’s nothing new in it because the AI can’t think. Can’t invent. Can’t create. Can’t draw parallels or contrasts. Can’t enhance, emphasize, or diminish. It can only regurgitate. It can only be literal, but can’t do similes, aphorisms or metaphors, or god forbid original wordplay.
(Snip)
AI does none of that.

AI doesn't do that yet. The human brain is a pattern machine, and a soup of chemicals. It's not a singular magical object that's impossible to duplicate outside of divine hands.

AI will be disruptive, just like most other major technological leaps. I'm not sure how old the average posters are on here but do any of you remember the breakpoint of the internet era? That point in time where it started destroying businesses and business ideas that couldn't survive in an internet connected world?

Either AI will be able to replace coders or it won't. It will be able to replace artists, or it won't. If it can replace either of those professions, even a moderate amount, it will totally upend the way the economy is designed to function.

The refrain with automation was always that it would free humanity from manual labor, and I still believe that. The problem is if you free humans from manual labor and free them from a big chunk of mental labor then they don't really have much to fall back on anymore. So you either need to re-calibrate the economy so that the labor that robots and AI are doing directly fund the people they replaced, or, you need far less people.

I'm a bit nihilistic and so don't have a lot of faith in humanity to make an adjustment like that and it work in most people's favor. In fact, history doesn't support having much faith in that either. And current events show us that there are plenty of regimes (and their subjects) around the world that are not concerned with the general welfare of humanity either.

Let's maybe think about that before we hand over the reins of censorship and free speech, or get in too much of a hurry to toss either of those first two constitutional admendments under the bus (for those of us in the US anyway).
 
Upvote
0 (6 / -6)

bp_968

Wise, Aged Ars Veteran
199
That's probably a good way of looking at it. How well the weavers fared after 1801 is also worth considering.
Something else worth looking at is how everyone fared after all of that. The factory system harmed the hand weavers for sure. But drastically improved everyone else's lives over time. The world would look vastly different today (and much more similar to the 1800s) if none of that had happened.

Of course that doesn't really help the loom workers of the time who lost their income, possibly their homes or even their lives.

Sigh. Right back to nihilism.
 
Upvote
8 (8 / 0)
Some how I don't see people like Jon Skeet or Eric Lippert using a subtle insult tone.
They absolutely don't, but many, MANY others do. It's not accurate to suggest that a condescending, patronizing affect is not prevalent. It's so endemic that it is a meme.

Me: Why would this function return a one-off result? I've tried ... <LOTS OF EXAMPLES>

SOGPT: As any first-year college student knows ...

or

SOGPT: As the API spec declares, in the first paragraph ...

or

SOGPT: Not trying <XXX> indicates laziness ...

That said, the answers are often useful and correct, which is why I generally ignore the slaps. It's just rather puerile, and quite annoying, as, speaking only for myself, I'm an experienced engineer, with decades of shipping software behind me, and being treated like a lazy freshman is pretty annoying.
 
Upvote
-5 (2 / -7)

motytrah

Ars Tribunus Militum
2,942
Subscriptor++
I'm honestly surprised they didn't do this last year during the Reddit API protests when the writing was on the wall, to where they could get the damage to stick. As of right now, Stack Overflow is big enough to find ways to revert damaged writing faster than people can contemplate what and how to damage or remove user contributed content.
I would think they'll have a influx of people from "California" making CCPA requests.
 
Upvote
4 (4 / 0)

Ozy

Ars Tribunus Angusticlavius
7,449
Jesus, really?

It's the difference between helping someone like me do their job better vs. helping someone put people like me out of work altogether.

How is that not clear? How is that not obvious?
Because, at the very same time, people are saying that AI produces shit output, hallucinates, and will get worse and worse over time.

How can THAT put people like you out of work?
 
Upvote
2 (5 / -3)
Unfortunately that’s not how most people think. They talk a lot but rarely turn their words into action.
I haven't asked/answered a question on SO for ages. I may, occasionally, do so, but I haven't found the will, in a while.

Most of my SO karma is from the questions that I asked, as opposed to the answers I gave, so it doesn't really bother me.
 
Upvote
2 (2 / 0)

OllieJones

Ars Praetorian
536
Subscriptor
Hmmm. There's another piece to this story ya might have missed, Benj. The headline is this:

"Private equity-owned Stack Exchange Inc ditches sustainable business model, switches to harvesting assets."

Yesterday I got a popup alert on StackOverflow announcing something called StackOverflow Jobs by Indeed. WHAAAAT? The whole idea of StackOverflow was to create a highly effective tech recruiting platform by scoring publicly shared expertise. That was the WHOLE DEAL. It generated a huge trove of really good written material, not to mention a nice payday for Jeff Atwood, Joel Spolsky, and their team.

(Save the snark, OK? StackOverflow is exactly the kind of human-generated material LLMs must ingest to do what they do. It's even got crowdsourced quality ratings.)

They've dumped the recruiting biz, and sold the assets to Open AI? You think they care whether their contributors are upset? Naaah. Laffing all the way to the investment bank.
 
Upvote
-3 (3 / -6)
While the companies tout the collaboration's benefits, many Stack Overflow users have expressed their displeasure with the deal. This is especially true considering that until very recently, Stack Overflow seemed to take a negative stance toward generative AI in general, banning answers written using ChatGPT.

I don't see any inconsistency here. Stack Overflow was against hosting content produced by ChatGPT because they were planning to sell all their human-generated content to ChatGPT. They didn't have a negative stance towards AI in general, they had a negative stance towards AI contaminating their valuable training data before they were able to sell it.
 
Upvote
9 (9 / 0)
When I used Stack Overflow, the most useful stuff was the description and additional comments. It wasn’t so much “here’s the answer”, much as looking at the why and the ensuing discussion. I might accept one answer, but the comments others had would influence how I resolved the issue.

What worries me is that the AI will miss the subtleties in proclaiming the one and true answer. We’ll end up with users simply copying and pasting, not understanding why. These users won’t be able to contribute back to SO. Within a decade SO will be about as useful as Yahoo Answers.

This doesn’t surprise me. This is Gracenote all over again.

When CDs came out, the music was there, but not any information of who or what. In the early 1990, the open source CD Database (CDDB) used checksums to identify music tracks and the CD. Volunteers would put entries into a giant database. Programs could use that database whenever someone ripped a CD. Put in a CD into your Mac and iTunes would identify it.

Not even a decade later, CDDB went from open source project to private company and renamed Gracenote. All of the free work volunteers put in was now owned. It was worth over a quarter billion dollars.

There were protests and lawsuits, but in the end, the data was just too valuable to be given away for free. All of this took place way before AI existed.

As we discovered way back in the 16th Century, if something is valuable, people will transverse entire oceans, desserts, and jungles to proclaim ownership over it.
 
Upvote
9 (9 / 0)
Devs get one no-blame free-pass do over from me when they unthinkingly cut and paste from SO - the second offence is not so easily over looked.
If the code works, and does what its intended to do how do you even police this? Unless you have memorized every SO answer, how do you even know its from SO? Are you googling every single line of code to make sure it doesnt show up?
 
Upvote
2 (2 / 0)
AI sucks and this move by SO is stupid, sure, but this suggestion is absolutely a very shitty asshole thing to do as well, so I guess if you don't mind becoming the monster you seek to destroy, go ahead.

All you're doing is causing immediate problems for people who curate and contribute to Stack Overflow and causing future problems for people who rely on answers to get work done.

But yeah, stick it to the company you freely gave your content to to do what they wish with for doing something legal with that content! I guess. Just do us a favor and actually read the terms you accept on the next site you join.

Do tell us how "actually read the terms" would have helped someone who joined up, say, 5 years ago?

Even if the necessary banalities about rights assignment to put SO in a position to make data commercially available were in place; they meant something considerably different back before chatbots took a substantial leap into the "will be a prolific and unreliable confabulator until it replaces you" zone.
 
Upvote
3 (5 / -2)

The Lurker Beneath

Ars Tribunus Militum
6,637
Subscriptor
My only logical solution to this trend is total stop of contributing anything anywhere, deleting all sorts of accounts, and starting to treat internet as read only (if even that...). I have no say in which site is gobbled up next, so I must act preemptively.

The Internet of old was a fun place, this modern internet is crap.

Thanks for reading this comment, which turned out to be my last contribution :)

To be fair, your policy seems to be almost a decade old.
 
Upvote
2 (2 / 0)
Because, at the very same time, people are saying that AI produces shit output, hallucinates, and will get worse and worse over time.

How can THAT put people like you out of work?
Because the people that do the hiring and firing think that AI produces good output, does not hallucinate, and believe it will get better and better over time, despite the evidence.
 
Upvote
3 (4 / -1)

poochyena

Ars Scholae Palatinae
4,917
Subscriptor++
Neat, so they are struggling to fill positions in a field of emergent tech that has very few qualified people because it's new. They need 500 (probably a liberal estimate) AI devs to put out their product that is anticipated to automate 40% of work away, globally. That seems like an equivalent exchange. Definitely sustainable.
automated farm equipment automated away ~90% of jobs. I guess you are big mad about tractors and combines too, right?
Jesus, really?

It's the difference between helping someone like me do their job better vs. helping someone put people like me out of work altogether.

How is that not clear? How is that not obvious?
Those 2 things are the exact same thing. Most tech fields are competitive, and most tech is meant to replace jobs. Most jobs people held 200 years ago were entirely replaced by new tech.
 
Upvote
-2 (4 / -6)
These stories are confusing to me. Not to say they're not 100% true, but man I've never found SO code good enough to just copy and paste in ....

I always tinker with it. I'm guessing my SO google fu was not very good.
I have, but only on TeX.se where some of the major package maintainers (egreg, for example) would provide a lot of detailed help.
 
Upvote
1 (1 / 0)