95% of Companies See 'Zero Return' on $30B Generative AI Spend

(thedailyadda.com)

418 points speckx | 4 comments | 21 Aug 25 15:36 UTC | HN request time: 0.661s | source

Show context

jawns ◴[21 Aug 25 16:36 UTC] No.44974805[source]▶

Full disclosure: I'm currently in a leadership role on an AI engineering team, so it's in my best interest for AI to be perceived as driving value.

Here's a relatively straightforward application of AI that is set to save my company millions of dollars annually.

We operate large call centers, and agents were previously spending 3-5 minutes after each call writing manual summaries of the calls.

We recently switched to using AI to transcribe and write these summaries. Not only are the summaries better than those produced by our human agents, they also free up the human agents to do higher-value work.

It's not sexy. It's not going to replace anyone's job. But it's a huge, measurable efficiency gain.

replies(39): >>44974847 #>>44974853 #>>44974860 #>>44974865 #>>44974867 #>>44974868 #>>44974869 #>>44974874 #>>44974876 #>>44974877 #>>44974901 #>>44974905 #>>44974906 #>>44974907 #>>44974929 #>>44974933 #>>44974951 #>>44974977 #>>44974989 #>>44975016 #>>44975021 #>>44975040 #>>44975093 #>>44975126 #>>44975142 #>>44975193 #>>44975225 #>>44975251 #>>44975268 #>>44975271 #>>44975292 #>>44975458 #>>44975509 #>>44975544 #>>44975548 #>>44975622 #>>44975923 #>>44976668 #>>44977281 #

1. Shank ◴[21 Aug 25 16:43 UTC] No.44974907[source]▶

>>44974805 #

Who reads the summaries? Are they even useful to begin with? Or did this just save everyone 3-5 minutes of meaningless work?

replies(2): >>44974979 #>>44975119 #

2. vosper ◴[21 Aug 25 16:47 UTC] No.44974979[source]▶

>>44974907 (TP) #

AI reads them and identifies trends and patterns, or answers questions from PMs or others?

replies(1): >>44976538 #

3. doorhammer ◴[21 Aug 25 16:56 UTC] No.44975119[source]▶

>>44974907 (TP) #

Not the op, but I did work supporting three massive call centers for an f500 ecom.

It's 100% plausible it's busy work but it could also be for: - Categorizing calls into broad buckets to see which issues are trending - Sentiment analysis - Identifying surges of some novel/unique issue - Categorizing calls across vendors and doing sentiment analysis that way (looking for upticks in problem calls related to specific TSPs or whatever) - etc

False positives and negatives aren't really a problem once you hit a certain scale because you're just looking for trends. If you find one, you go spot-check it and do a deeper dive to get better accuracy.

Which is also how you end up with some schlepp like me listening to a few hundreds calls in a day at 8x speed (back when I was a QA data analyst) to verify the bucketing. And when I was doing it everything was based on phonetic indexing, which I can't imagine touching llms in terms of accuracy, and it still provided a ton of business value at scale.

4. cube00 ◴[21 Aug 25 18:47 UTC] No.44976538[source]▶

>>44974979 #

AI writes inaccurate summaries and then consumes its own slop so it can hallucinate the answer to the PM's questions after misreading said slop.

Much like dubbing a video tape multiple times, it's going to get worse as you add more layers text predictors.

↑