Cursor AI support bot invents fake policy and triggers user uproar

tigerhawkvok

Ars Scholae Palatinae
1,124
Subscriptor
Vibe coders using an LLM driven workflow given wrong information by an LLM chatbot that they didn't error-check.

Completely on-brand for vibe coders. (Full disclosure I use Github Copilot for low-stakes refactors, line autocompletes, and trivial API lookups. But I error check statically damnit.)
 
Upvote
273 (278 / -5)
Post content hidden for low score. Show…

Mechjaz

Ars Praefectus
3,401
Subscriptor++
Vibe coders using an LLM driven workflow given wrong information by an LLM chatbot that they didn't error-check.

Completely on-brand for vibe coders. (Full disclosure I use Github Copilot for low-stakes refactors, line autocompletes, and trivial API lookups. But I error check statically damnit.)
LinkedIn is flooded with terrible takes about how we like, won't need version control anymore, mannnn, like GitHub you know? Because our code will just like be the prompts, and the prompts are your code, mannn.

I ran out of characters in the comment field well before I ran out of ways to point out how dumb that was.
 
Upvote
238 (238 / 0)

Fatesrider

Ars Legatus Legionis
25,457
Subscriptor
"Hey! We have no such policy," wrote a Cursor representative in a Reddit reply three hours later. "You're of course free to use Cursor on multiple machines. Unfortunately, this is an incorrect response from a front-line AI support bot."
Narrator: Though the reason for the cancellations had changed, the cancellations continued because coders want humans to help them deal with issues caused by AI-generated vibe coding subs.

So, question: If you can continue to use Cursor on multiple machines, why didn't that particular coder experience the continuity of doing that across multiple machines?

Sounds like they still have a problem, and no current fix for it.
 
Upvote
71 (79 / -8)

Yaiba

Seniorius Lurkius
38
Subscriptor++
While I appreciate Benj's diligent work on the AI beat, I remain disappointed at the uncritical acceptance and parroting of the anthropomorphizing language the industry uses to build its hype train, i.e. "confabulation", "hallucination" etc. Why not "error", "random output", "bad data", or if we want to be technical, "bullshit"?
 
Upvote
328 (347 / -19)

HMSTechnica

Wise, Aged Ars Veteran
161
My engineering director at work recently asked me how I can use AI to help me speed up my development, like using it to do cross-platform development between android and iOS and web, without knowing the other platforms or languages. We already don't have any QA team, and now they're prodding more and more vibe coding. I didn't think much of him before this meeting (too much hearing himself talk, fatuous questions), but now any respect I had is gone.
 
Upvote
195 (196 / -1)

Joshua Pearson

Seniorius Lurkius
4
Subscriptor++
An aspect that seems to be overlooked isn't just that the users believed Sam to be a real person, but that the first level of technical support is presenting falsehoods. I don't care if I am getting information from a person or from an AI. I care that I am getting valid information. The addition of the phrase "We use AI-assisted responses as the first filter for email support" does not eliminate the actual problem.
 
Upvote
250 (251 / -1)

bretayn

Ars Centurion
226
Subscriptor
Narrator: Though the reason for the cancellations had changed, the cancellations continued because coders want humans to help them deal with issues caused by AI-generated vibe coding subs.

So, question: If you can continue to use Cursor on multiple machines, why didn't that particular coder experience the continuity of doing that across multiple machines?

Sounds like they still have a problem, and no current fix for it.

FTA - there was a glitch in their session handling. They fixed… the glitch. The rest will just work itself out.
 
Upvote
52 (53 / -1)

oluseyi

Ars Tribunus Militum
1,513
I never was much of a coder, but I've been vibe coding like crazy. It produces insanity a lot, but if you're careful and focused you can get some real gold out of it. It helps that most people are pretty fucking stupid to begin with and most things out there are complete trash, so the bar is low.
And what do you think the model you're vibe coding with is trained on…?
 
Upvote
162 (164 / -2)
Vibe coders using an LLM driven workflow given wrong information by an LLM chatbot that they didn't error-check.

Completely on-brand for vibe coders. (Full disclosure I use Github Copilot for low-stakes refactors, line autocompletes, and trivial API lookups. But I error check statically damnit.)
So question on that. How would they of checked that the support email response was valid and from a person?
 
Upvote
33 (39 / -6)
My engineering director at work recently asked me how I can use AI to help me speed up my development, like using it to do cross-platform development between android and iOS and web, without knowing the other platforms or languages. We already don't have any QA team, and now they're prodding more and more vibe coding. I didn't think much of him before this meeting (too much hearing himself talk, fatuous questions), but now any respect I had is gone.
Does he have an MBA?
 
Upvote
92 (93 / -1)
So. The real problem here is with all Ai usage. It's a blackbox that end users just get screwed by. The company didn't disclose the response as Ai upfront. Users had no way to reach out to a real person.


The reddit side of locking the thread and deleting the post also is simply wrong and should not of happened.
 
Upvote
119 (123 / -4)

TheBrain0110

Ars Centurion
256
Subscriptor++
The mistake of it is bad.

The mods on the reddit sub deleting it is even worse. Cursor employees are mods in the sub. There has been no announcement if it is was a cursor employee or regular mod that deleted it. There is no mention of it on the pinned topics or in any other location.

I also don’t see that the post would have broken any rules of the sub or any reason it would have been deleted in the first place.

If cursor wants to keep trust they really need to explain why the post was deleted as well as make clear that in the future that will not happen if they can get get mods to agree or if the owners of the sub who I don’t think are cursor employees disagree have their own employees leave as mods and make clear the sub is not officially supported by Cursor.
 
Upvote
185 (185 / 0)

oluseyi

Ars Tribunus Militum
1,513
…should not of….
How would they of…
Hey, I just wanted to let you know that it’s “should have” and “would have.” The increasingly common error “should of” stems from the spoken version of the contraction, “should’ve”, “could’ve”, “would’ve.”

Cheers! 🤓
 
Upvote
169 (177 / -8)
I'm reminded of the old Chappelle's Show PopCopy sketch:
"If a customer brings in a computer disk, look at it and tell them it's the wrong format. If they use apple, tell them we're PC. If they use PC tell them we're apple. And If they have both, tell them we use linux. And if they got that, tell them the computers are down."

Because an ai agent will lie to customers with absolute confidence. It's not malicious, but it also doesn't care. It's not capable of caring.

From a business standpoint, companies rolling out ai agents should consider this as a risk: what if a human doing this job chose to lie, with absolute confidence, to the customer? What damage would occur? For a human you have countermeasures, you can train them or hire better humans, treat them better and pay them more. Or in the worst case you could hold them liable if they damage the company.

If my ai agent decides to say, break the law or damage the company there is no scapegoat. I "hired" an agent who will lie like a sociopath and also doesn't care.
 
Upvote
110 (111 / -1)

Connstruct

Ars Centurion
206
Subscriptor++
I never was much of a coder, but I've been vibe coding like crazy. It produces insanity a lot, but if you're careful and focused you can get some real gold out of it. It helps that most people are pretty fucking stupid to begin with and most things out there are complete trash, so the bar is low.
Then you're still not much of a coder.
 
Upvote
160 (163 / -3)
Post content hidden for low score. Show…

niftykev

Ars Scholae Palatinae
771
While I appreciate Benj's diligent work on the AI beat, I remain disappointed at the uncritical acceptance and parroting of the anthropomorphizing language the industry uses to build its hype train, i.e. "confabulation", "hallucination" etc. Why not "error", "random output", "bad data", or if we want to be technical, "bullshit"?
"made the policy up" "invented"

These things all imply intent. There is no intent in an LLM.
 
Upvote
22 (32 / -10)

mcmnky

Ars Tribunus Militum
2,624
Subscriptor
I love the response from the chatbot. “We limit a given session to a single device as a core security principle. Anyway, you can pay more for the functionality”
How do we know that was the chatbot? Maybe the email was from a person, and the later post "correcting" the email was the actual chatbot?
 
Upvote
19 (19 / 0)

Fred Duck

Ars Tribunus Angusticlavius
7,423
No doubt this would absolutely be frustrating, but as someone working in a Support capacity I have to admit it's a little validating to know that A.I. still can't do my job.
Unfortunately, this doesn't seem to be the most important criteria for management in too many instances.

If a human tech support made up a fake policy, lying to the customer and embarrassing the company, he would be fired.
Don't worry. In an attempt to solve the problem, they'll spend greater and greater sums of money, which is similar to your idea.

Hey, I just wanted to let you know that it’s “should have” and “would have.” The increasingly common error “should of” stems from the spoken version of the contraction, “should’ve”, “could’ve”, “would’ve.”

Cheers! 🤓
Good bot.
 
Upvote
26 (26 / 0)

issor

Ars Praefectus
5,628
Subscriptor
I’ve noticed AWS is using AI for first level support as well. Your first response on every reply is basically a summary of what you said and a promise to look into it, which I guess makes it feel like you’re being listened to and taken care of. It is a bit creepy how it is presented as though someone has taken the time to understand your issue, but in reality things have gone nowhere, just buys them time.

I guess if the summary is wrong it gives you a chance to clarify before a human gets involved.
 
Upvote
51 (51 / 0)

ninjaneer

Ars Scholae Palatinae
634
Subscriptor
If a human tech support made up a fake policy, lying to the customer and embarrassing the company, he would be fired.

Making up fake policies on the spot is par for the course for a lot of tech support, especially low-cost, low-training offshore outfits.
 
Upvote
12 (17 / -5)