(runnercode.com)

196 points zmccormick7 | 1 comments | 26 Sep 25 15:06 UTC | HN request time: 0s | source

Show context

aliljet ◴[26 Sep 25 15:27 UTC] No.45387614[source]▶

There's a misunderstanding here broadly. Context could be infinite, but the real bottleneck is understanding intent late in a multi-step operation. A human can effectively discard or disregard prior information as the narrow window of focus moves to a new task, LLMs seem incredibly bad at this.

Having more context, but leaving open an inability to effectively focus on the latest task is the real problem.

replies(10): >>45387639 #>>45387672 #>>45387700 #>>45387992 #>>45388228 #>>45388271 #>>45388664 #>>45388965 #>>45389266 #>>45404093 #

tptacek ◴[26 Sep 25 16:25 UTC] No.45388271[source]▶

>>45387614 #

Asking, not arguing, but: why can't they? You can give an agent access to its own context and ask it to lobotomize itself like Eternal Sunshine. I just did that with a log ingestion agent (broad search to get the lay of the land, which eats a huge chunk of the context window, then narrow searches for weird stuff it spots, then go back and zap the big log search). I assume this is a normal approach, since someone else suggested it to me.

replies(2): >>45388348 #>>45388456 #

simonw ◴[26 Sep 25 16:31 UTC] No.45388348[source]▶

>>45388271 #

This is also the idea behind sub-agents. Claude Code answers questions about things like "where is the code that does X" by firing up a separate LLM running in a fresh context, posing it the question and having it answer back when it finds the answer. https://simonwillison.net/2025/Jun/2/claude-trace/

replies(2): >>45388378 #>>45388417 #

1. tptacek ◴[26 Sep 25 16:35 UTC] No.45388378[source]▶

>>45388348 #

I'm playing with that too (everyone should write an agent; basic sub-agents are incredibly simple --- just tool calls that can make their own LLM calls, or even just a tool call that runs in its own context window). What I like about Eternal Sunshine is that the LLM can just make decisions about what context stuff matters and what doesn't, which is a problem that comes up a lot when you're looking at telemetry data.

↑

Context is the bottleneck for coding agents now