What is going on right now?

(catskull.net)

Show context

grey-area ◴[22 Aug 25 13:22 UTC] No.44984361[source]▶

The heart of the article is this conclusion, which I think is correct from first-hand experience with these tools and teams trying to use them:

So what good are these tools? Do they have any value whatsoever?

Objectively, it would seem the answer is no.

replies(4): >>44984494 #>>44984531 #>>44984734 #>>44984803 #

dlachausse ◴[22 Aug 25 13:36 UTC] No.44984531[source]▶

>>44984361 #

AI tools absolutely can deliver value for certain users and use cases. The problem is that they’re not magic, they’re a tool and they have certain capabilities and limitations. A screwdriver isn’t a bad tool just because it sucks at opening beer bottles.

replies(1): >>44984676 #

ptx ◴[22 Aug 25 13:49 UTC] No.44984676[source]▶

>>44984531 #

So what use cases are those?

It seems to me that the limitations of this particular tool make it suitable only in cases where it doesn't matter if the result is wrong and dangerous as long as it's convincing. This seems to be exclusively various forms of forgery and fraud, e.g. spam, phishing, cheating on homework, falsifying research data, lying about current events, etc.

replies(5): >>44984747 #>>44984766 #>>44984769 #>>44985788 #>>44987014 #

1. barbazoo ◴[22 Aug 25 13:57 UTC] No.44984769[source]▶

>>44984676 #

Extracting structured data from unstructured text at runtime. Some models are really good at that and it’s immensely useful for many businesses.

replies(1): >>44985374 #

2. Piskvorrr ◴[22 Aug 25 14:52 UTC] No.44985374[source]▶

>>44984769 (TP) #

Except when they "extract" something that wasn't in the source. And now what, assuming you can even detect the tainted data at all?

How do you fix that, when the process is literally "we throw an illegible blob at it and data comes out"? This is not even GIGO, this is "anything in, synthetic garbage out"

replies(2): >>44985807 #>>44985865 #

3. disgruntledphd2 ◴[22 Aug 25 15:29 UTC] No.44985807[source]▶

>>44985374 #

> Except when they "extract" something that wasn't in the source. And now what, assuming you can even detect the tainted data at all?

I mean, this is much less common than people make it out to be. Assuming that the context is there it's doable to run a bunch of calls and take the majority vote. It's not trivial but this is definitely doable.

replies(2): >>45010651 #>>45037038 #

4. barbazoo ◴[22 Aug 25 15:34 UTC] No.44985865[source]▶

>>44985374 #

> Except when they "extract" something that wasn't in the source. And now what, assuming you can even detect the tainted data at all?

You gotta watch for that for sure but no that's not a issue we worry about anymore, at least not for how we're using it for here. The text that's being extracted from is not a "BLOB". It's plain text at that point and of a certain, expected kind so that makes it easier. In general, the more isolated and specific the use case, the bigger the chances of the whole thing working end to end. Open ended chat is just a disaster. Operating on a narrow set of expectations. Much more successful.

5. grey-area ◴[25 Aug 25 05:54 UTC] No.45010651{3}[source]▶

>>44985807 #

I really don’t think that’s doable because why do you the majority output is correct? It’s just as likely to be a hallucination.

If he problem is the system has no concept of correctness or world model.

replies(1): >>45011376 #

6. disgruntledphd2 ◴[25 Aug 25 07:53 UTC] No.45011376{4}[source]▶

>>45010651 #

Assuming that hallucinationd are relatively random it's true. I do believe that they happen less often when you feed the model decent context though.

7. Piskvorrr ◴[27 Aug 25 08:54 UTC] No.45037038{3}[source]▶

>>44985807 #

I mean, it is obvious for a human inspecting the one specific input and output sample, but how do you do this at scale? (Spoiler: cross your fingers and hope, that's how)

↑