Do AI detectors work? Students face false cheating accusations

(www.bloomberg.com)

461 points JumpCrisscross | 1 comments | 20 Oct 24 17:26 UTC | HN request time: 0.276s | source

Show context

greatartiste ◴[21 Oct 24 06:46 UTC] No.41901335[source]▶

For a human who deals with student work or reads job applications spotting AI generated work quickly becomes trivially easy. Text seems to use the same general framework (although words are swapped around) also we see what I call 'word of the week' where whichever 'AI' engine seems to get hung up on a particular English word which is often an unusual one and uses it at every opportunity. It isn't long before you realise that the adage that this is just autocomplete on steroids is true.

However programming a computer to do this isn't easy. In a previous job I had dealing with plagiarism detectors and soon realised how garbage they were (and also how easily fooled they are - but that is another story). The staff soon realised what garbage these tools are so if a student accused of plagiarism decided to argue back then the accusation would be quietly dropped.

replies(14): >>41901440 #>>41901484 #>>41901662 #>>41901851 #>>41901926 #>>41901937 #>>41902038 #>>41902121 #>>41902132 #>>41902248 #>>41902627 #>>41902658 #>>41903988 #>>41906183 #

acchow ◴[21 Oct 24 07:09 UTC] No.41901484[source]▶

>>41901335 #

> For a human who deals with student work or reads job applications spotting AI generated work quickly becomes trivially easy. Text seems to use the same general framework (although words are swapped around) also we see what I call 'word of the week'

Easy to catch people that aren't trying in the slightest not to get caught, right? I could instead feed a corpus of my own writing to ChatGPT and ask it to write in my style.

replies(1): >>41901583 #

hau ◴[21 Oct 24 07:29 UTC] No.41901583[source]▶

>>41901484 #

I don't believe it's possible at all if any effort is made beyond prompting chat-like interfaces to "generate X". Given a hand crafted corpus of text even current llms could produce perfect style transfer for a generated continuation. If someone believes it's trivially easy to detect, then they absolutely have no idea what they are dealing with.

I assume most people would make least amount of effort and simply prompt chat interface to produce some text, such text is rather detectable. I would like to see some experiments even for this type of detection though.

replies(1): >>41901673 #

hnlmorg ◴[21 Oct 24 07:42 UTC] No.41901673[source]▶

>>41901583 #

Are you then plagiarising if the LLM is just regurgitating stuff you’d personally written?

The point of these detectors is to spot stuff the students didn’t research and write themselves. But if the corpus is your own written material then you’ve already done the work yourself.

replies(2): >>41901696 #>>41901754 #

throwaway290 ◴[21 Oct 24 07:49 UTC] No.41901696[source]▶

>>41901673 #

LLM is just regurgitating stuff as a principle. You can request someone else's style. People who are easy to detect simply don't do that. But they will learn quickly

replies(2): >>41902120 #>>41903123 #

1. A4ET8a8uTh0 ◴[21 Oct 24 09:10 UTC] No.41902120[source]▶

>>41901696 #

Yep, some with fun results. I occasionally amuse myself now by asking for X in the style of writing of fictional figure Y. It does have moments.

↑