Most active commenters

disgruntledphd2(3)

What is going on right now?

(catskull.net)

Show context

grey-area ◴[22 Aug 25 13:22 UTC] No.44984361[source]▶

The heart of the article is this conclusion, which I think is correct from first-hand experience with these tools and teams trying to use them:

So what good are these tools? Do they have any value whatsoever?

Objectively, it would seem the answer is no.

replies(4): >>44984494 #>>44984531 #>>44984734 #>>44984803 #

dlachausse ◴[22 Aug 25 13:36 UTC] No.44984531[source]▶

>>44984361 #

AI tools absolutely can deliver value for certain users and use cases. The problem is that they’re not magic, they’re a tool and they have certain capabilities and limitations. A screwdriver isn’t a bad tool just because it sucks at opening beer bottles.

replies(1): >>44984676 #

1. ptx ◴[22 Aug 25 13:49 UTC] No.44984676[source]▶

>>44984531 #

So what use cases are those?

It seems to me that the limitations of this particular tool make it suitable only in cases where it doesn't matter if the result is wrong and dangerous as long as it's convincing. This seems to be exclusively various forms of forgery and fraud, e.g. spam, phishing, cheating on homework, falsifying research data, lying about current events, etc.

replies(5): >>44984747 #>>44984766 #>>44984769 #>>44985788 #>>44987014 #

2. dlachausse ◴[22 Aug 25 13:55 UTC] No.44984747[source]▶

>>44984676 (TP) #

I personally use it as a starting point for research and for summarizing very long articles.

I’m a mostly self taught hobbyist programmer, so take this with a grain of salt, but It’s also been great for giving me a small snippet of code to use as a starting point for my projects. I wouldn’t just check whatever it generates directly into version control without testing it and figuring out how it works first. It’s not a replacement for my coding skills, but an augmentation of them.

3. ◴[22 Aug 25 13:57 UTC] No.44984766[source]▶

>>44984676 (TP) #

4. barbazoo ◴[22 Aug 25 13:57 UTC] No.44984769[source]▶

>>44984676 (TP) #

Extracting structured data from unstructured text at runtime. Some models are really good at that and it’s immensely useful for many businesses.

replies(1): >>44985374 #

5. Piskvorrr ◴[22 Aug 25 14:52 UTC] No.44985374[source]▶

>>44984769 #

Except when they "extract" something that wasn't in the source. And now what, assuming you can even detect the tainted data at all?

How do you fix that, when the process is literally "we throw an illegible blob at it and data comes out"? This is not even GIGO, this is "anything in, synthetic garbage out"

replies(2): >>44985807 #>>44985865 #

6. disgruntledphd2 ◴[22 Aug 25 15:28 UTC] No.44985788[source]▶

>>44984676 (TP) #

> So what use cases are those?

I think that as software/data people, we tend to underestimate the number of business processes that are repetitive but require natural language parsing to be done. Examples would include supply chain (basically run on excels and email). Traditionally, these were basically impossible to automate because reading free text emails and updating some system based on that was incredibly hard. LLMs make this much, much easier. This is a big opportunity for lots of companies in normal industries (there's lots of it in tech too).

More generally, LLMs are pretty good at document summarisation and question answering, so with some guardrails (proper context, maybe multiple LLM calls involved) this can save people a bunch of time.

Finally, they can be helpful for broad search queries, but this is much much trickier as you'd need to build decent context offline and use that, which (to put it mildly) is a non-trivial problem.

In the tech world, they are really helpful in writing one to throw away. If you have a few ideas, you can now spec them out and get sortof working code from an LLM which lowers the bar to getting feedback and seeing if the idea works. You really do have to throw it away though, which is now much, much cheaper with LLM technology.

I do think that if we could figure out context management better (which is basically decent internal search for a company) then there's a bunch of useful stuff that could be built, but context management is a really, really hard problem so that's not gonna happen any time soon.

7. disgruntledphd2 ◴[22 Aug 25 15:29 UTC] No.44985807{3}[source]▶

>>44985374 #

> Except when they "extract" something that wasn't in the source. And now what, assuming you can even detect the tainted data at all?

I mean, this is much less common than people make it out to be. Assuming that the context is there it's doable to run a bunch of calls and take the majority vote. It's not trivial but this is definitely doable.

replies(2): >>45010651 #>>45037038 #

8. barbazoo ◴[22 Aug 25 15:34 UTC] No.44985865{3}[source]▶

>>44985374 #

> Except when they "extract" something that wasn't in the source. And now what, assuming you can even detect the tainted data at all?

You gotta watch for that for sure but no that's not a issue we worry about anymore, at least not for how we're using it for here. The text that's being extracted from is not a "BLOB". It's plain text at that point and of a certain, expected kind so that makes it easier. In general, the more isolated and specific the use case, the bigger the chances of the whole thing working end to end. Open ended chat is just a disaster. Operating on a narrow set of expectations. Much more successful.

9. mooseling ◴[22 Aug 25 17:08 UTC] No.44987014[source]▶

>>44984676 (TP) #

I started a new job recently, and used ChatGPT tons to learn how to use the new tools: python, opencv, fastapi. I had questions that were too complex for a web search, which ChatGPT answered very coherently! I found it a very good tool to use alongside web search, documentation, and trawling through Stack Overflow.

10. grey-area ◴[25 Aug 25 05:54 UTC] No.45010651{4}[source]▶

>>44985807 #

I really don’t think that’s doable because why do you the majority output is correct? It’s just as likely to be a hallucination.

If he problem is the system has no concept of correctness or world model.

replies(1): >>45011376 #

11. disgruntledphd2 ◴[25 Aug 25 07:53 UTC] No.45011376{5}[source]▶

>>45010651 #

Assuming that hallucinationd are relatively random it's true. I do believe that they happen less often when you feed the model decent context though.

12. Piskvorrr ◴[27 Aug 25 08:54 UTC] No.45037038{4}[source]▶

>>44985807 #

I mean, it is obvious for a human inspecting the one specific input and output sample, but how do you do this at scale? (Spoiler: cross your fingers and hope, that's how)

↑