←back to thread

358 points andrewstetsenko | 1 comments | | HN request time: 0.255s | source
Show context
taysix ◴[] No.44360808[source]
I had a fun result the other day from Claude. I opened a script in Zed and asked it to "fix the error on line 71". Claude happily went and fixed the error on line 91....

1. There was no error on line 91, it did some inconsequential formatting on that line 2. More importantly, it just ignored the very specific line I told it to go to. It's like I was playing telephone with the LLM which felt so strange with text-based communication.

This was me trying to get better at using the LLM while coding and seeing if I could "one-shot" some very simple things. Of course me doing this _very_ tiny fix myself would have been faster. Just felt weird and reinforces this idea that the LLM isn't actually thinking at all.

replies(4): >>44360819 #>>44360879 #>>44360917 #>>44363593 #
toephu2 ◴[] No.44360917[source]
Sounds like operator error to me.

You need to give LLMs context. Line number isn't good context.

replies(3): >>44361316 #>>44364260 #>>44365210 #
meepmorp ◴[] No.44361316[source]
> Line number isn't good context.

a line number is plenty of context - it's directly translatable into a range of bytes/characters in the file

replies(2): >>44361838 #>>44362459 #
mediaman ◴[] No.44361838[source]
It's a tool. It's not a human. A line number works great for humans. Today, they're terrible for LLMs.

I can choose to use a screwdriver to hammer in a nail and complain about how useless screwdrivers are. Or I can realize when and how to use it.

We (including marketing & execs) have made a huge mistake in anthropomorphizing these things, because then we stop treating them like tools that have specific use cases to be used in certain ways, and more like smart humans that don't need that.

Maybe one day they'll be there, but today they are screwdrivers. That doesn't make them useless.

replies(1): >>44363016 #
1. skydhash ◴[] No.44363016[source]
Check the whole ecosystem around editors, grep tools, debuggers, linting and build tools. One common thing about all of this is line (and column) number so you can integrate them together if you want to automate stuff. Like jumping to errors (quickfix in vim,…), search all files and jump to the occurrences (grep mode in emacs,…), etc…