Most active commenters

neutronicus(4)
Someone1234(3)

Popular/hot comments

>>45387769 #

←back to thread

Context is the bottleneck for coding agents now

(runnercode.com)

Show context

aliljet ◴[26 Sep 25 15:27 UTC] No.45387614[source]▶

>>45387374 (OP) #

There's a misunderstanding here broadly. Context could be infinite, but the real bottleneck is understanding intent late in a multi-step operation. A human can effectively discard or disregard prior information as the narrow window of focus moves to a new task, LLMs seem incredibly bad at this.

Having more context, but leaving open an inability to effectively focus on the latest task is the real problem.

replies(10): >>45387639 #>>45387672 #>>45387700 #>>45387992 #>>45388228 #>>45388271 #>>45388664 #>>45388965 #>>45389266 #>>45404093 #

1. neutronicus ◴[26 Sep 25 15:32 UTC] No.45387672[source]▶

>>45387614 #

No, I think context itself is still an issue.

Coding agents choke on our big C++ code-base pretty spectacularly if asked to reference large files.

replies(4): >>45387769 #>>45388023 #>>45388024 #>>45388311 #

2. Someone1234 ◴[26 Sep 25 15:41 UTC] No.45387769[source]▶

>>45387672 (TP) #

Yeah, I have the same issue too. Even for a file with several thousand lines, they will "forget" earlier parts of the file they're still working in resulting in mistakes. They don't need full awareness of the context, but they need a summary of it so that they can go back and review relevant sections.

I have multiple things I'd love LLMs to attempt to do, but the context window is stopping me.

replies(3): >>45388022 #>>45388231 #>>45390936 #

3. AnotherGoodName ◴[26 Sep 25 16:01 UTC] No.45388022[source]▶

>>45387769 #

I do take that as a sign to refactor when it happens though. Even if not for the sake of LLM compatibility with the codebase it cuts down merge conflicts to refactor large files.

In fact I've found LLMs are reasonable at the simple task of refactoring a large file into smaller components with documentation on what each portion does even if they can't get the full context immediately. Doing this then helps the LLM later. I'm also of the opinion we should be making codebases LLM compatible. So if it happens i direct the LLM that way for 10mins and then get back to the actual task once the codebase is in a more reasonable state.

replies(1): >>45388498 #

4. atonse ◴[26 Sep 25 16:02 UTC] No.45388024[source]▶

>>45387672 (TP) #

I've found situations where a file was too big, and then it tries to grep for what might be useful in that file.

I could see in C++ it getting smarter about first checking the .h files or just grepping for function documentation, before actually trying to pull out parts of the file.

replies(1): >>45389593 #

5. bongodongobob ◴[26 Sep 25 16:22 UTC] No.45388231[source]▶

>>45387769 #

Interestingly, this issue has caused me to refactor and modularize code that I should have addressed a long time ago, but didn't have the time or stamina to tackle. Because the LLM can't handle the context, it has helped me refactor stuff (seems to be very good at this in my experience) and that has led me to write cleaner and more modular code that the LLMs can better handle.

6. AlGoreRhythm ◴[26 Sep 25 16:28 UTC] No.45388311[source]▶

>>45387672 (TP) #

Out of curiosity, how would you rate an LLM’s ability to deal with pointers in C++ code?

replies(2): >>45389012 #>>45389408 #

7. Someone1234 ◴[26 Sep 25 16:44 UTC] No.45388498{3}[source]▶

>>45388022 #

I'm trying to use LLMs to save me time and resources, "refactor your entire codebase, so the tool can work" is the opposite of that. Regardless of how you rationalize it.

replies(1): >>45388933 #

8. thunky ◴[26 Sep 25 17:30 UTC] No.45388933{4}[source]▶

>>45388498 #

It may be a good idea to refactor even if not for LLMs but for humans sake.

replies(1): >>45389095 #

9. neutronicus ◴[26 Sep 25 17:36 UTC] No.45389012[source]▶

>>45388311 #

Greenfield project? Claude is fucking great at C++. Almost all aspects of it, really.

Well, not so much the project organization stuff - it wants to stuff everything into one header and has to be browbeaten into keeping implementations out of headers.

But language semantics? It's pretty great at those. And when it screws up it's also really good at interpreting compiler error messages.

10. Someone1234 ◴[26 Sep 25 17:44 UTC] No.45389095{5}[source]▶

>>45388933 #

Right, but the discussion we're having here is context size. I, and others, are saying that the current context size is a limitation on when they can use the tool to be useful.

The replies of "well, just change the situation, so context doesn't matter" is irrelevant, and off-topic. The rationalizations even more so.

replies(1): >>45390931 #

11. jdrek1 ◴[26 Sep 25 18:16 UTC] No.45389408[source]▶

>>45388311 #

If you have lots of pointers, you're writing C, not C++.

replies(1): >>45389622 #

12. neutronicus ◴[26 Sep 25 18:33 UTC] No.45389593[source]▶

>>45388024 #

Yeah, my first instinct has been to expose an LSP server as a tool so the LLM can avoid reading entire 40,000 line files just to get the implementation of one function.

I think with appropriate instructions in the system prompt it could probably work on this code-base more like I do (heavy use of Ctrl-, in Visual Studio to jump around and read only relevant portions of the code-base).

13. neutronicus ◴[26 Sep 25 18:35 UTC] No.45389622{3}[source]▶

>>45389408 #

Eh, it's a big tent

14. thunky ◴[26 Sep 25 20:54 UTC] No.45390931{6}[source]▶

>>45389095 #

A huge context is a problem for humans too, which is why I think it's fair to suggest maybe the tool isn't the (only) problem.

Tools like Aider create a code map that basically indexes code into a small context. Which I think is similar to what we humans do when we try to understand a large codebase.

I'm not sure if Aider can then load only portions of a huge file on demand, but it seems like that should work pretty well.

replies(1): >>45395565 #

15. hadlock ◴[26 Sep 25 20:55 UTC] No.45390936[source]▶

>>45387769 #

I've started getting in the habit of finding seams in files > 1500 lines long. Occasionally it is unavoidable, but very regularly.

16. KronisLV ◴[27 Sep 25 13:30 UTC] No.45395565{7}[source]▶

>>45390931 #

As someone who's worked with both more fragmented/modular codebases with smaller classes and shorter files vs ones that span thousands of lines (sometimes even double digits), I very much prefer the former and hate the latter.

That said, some of the models out there (Gemini 2.5 Pro, for example) support 1M context; it's just going to be expensive and will still probably confuse the model somewhat when it comes to the output.

↑