(signoz.io)

144 points pranay01 | 1 comments | 27 Sep 25 18:56 UTC | HN request time: 0.197s | source

Show context

gdiamos ◴[28 Sep 25 04:25 UTC] No.45401759[source]▶

LLM app telemetry is important, but I don’t think we have seen the right metrics yet. Nothing has convinced me that they are more useful than modern app telemetry

I don’t think tool calls or prompts or rag hits are it

That’s like saying that C++ app observability is about looking at every sys call and their arguments

Sure, if you are the OS it’s easy to instrument that, but IMO I’d rather just attach to my app and look at the logs

replies(1): >>45402673 #

1. jonnylaw ◴[28 Sep 25 08:22 UTC] No.45402673[source]▶

>>45401759 #

Attaching to the app is impractical to catch regressions in production. LLMs are probabilistic - this means you can have a regression without even changing the code / making a new deployment.

A metric to alert on could be task-completion rate using LLM as a judge or synthetic tests which are run on a schedule. Then the other metrics you mentioned are useful for debugging the problem.

↑

LLM Observability in the Wild – Why OpenTelemetry Should Be the Standard