Most active commenters

simonw(4)

Claude 3.7 Sonnet and Claude Code

(www.anthropic.com)

Show context

bcherny ◴[24 Feb 25 19:04 UTC] No.43163488[source]▶

Hi everyone! Boris from the Claude Code team here. @eschluntz, @catherinewu, @wolffiex, @bdr and I will be around for the next hour or so and we'll do our best to answer your questions about the product.

replies(82): >>43163527 #>>43163532 #>>43163549 #>>43163554 #>>43163555 #>>43163576 #>>43163585 #>>43163588 #>>43163589 #>>43163592 #>>43163593 #>>43163632 #>>43163642 #>>43163664 #>>43163677 #>>43163733 #>>43163758 #>>43163789 #>>43163803 #>>43163813 #>>43163821 #>>43163893 #>>43163909 #>>43163915 #>>43163921 #>>43163957 #>>43163958 #>>43163992 #>>43164069 #>>43164089 #>>43164102 #>>43164103 #>>43164104 #>>43164111 #>>43164127 #>>43164158 #>>43164329 #>>43164353 #>>43164424 #>>43164482 #>>43164514 #>>43164585 #>>43164616 #>>43164768 #>>43164797 #>>43164819 #>>43164899 #>>43165002 #>>43165057 #>>43165065 #>>43165088 #>>43165091 #>>43165187 #>>43165308 #>>43165355 #>>43165409 #>>43165468 #>>43165499 #>>43165516 #>>43165570 #>>43165578 #>>43165592 #>>43165836 #>>43165884 #>>43165965 #>>43165976 #>>43165995 #>>43166183 #>>43166711 #>>43166748 #>>43167130 #>>43167804 #>>43168626 #>>43168836 #>>43169047 #>>43169107 #>>43169119 #>>43169294 #>>43169310 #>>43173097 #>>43174353 #>>43192161 #

antirez ◴[24 Feb 25 19:50 UTC] No.43164089[source]▶

>>43163488 #

One of the silver bullets of Claude, in the context of coding, is that it does NOT use RAG when you use it via the web interface. Sure, you burn your tokens but the model sees everything and this let it reply in a much better way. Is Claude Code doing the same and just doing document-level RAG, so that if a document is relevant and if it fits, all the document will be put inside the context window? I really hope so! Also, this means that splitting large code bases into manageable file sizes will make more and more sense. Another Q: is the context size of Sonnet 3.7 the same of 3.5? Btw Thanks you so much for Claude Sonnet, in the latest months it changed the way I work and I'm able to do a lot more, now.

replies(1): >>43164253 #

bcherny ◴[24 Feb 25 20:03 UTC] No.43164253[source]▶

>>43164089 #

Right -- Claude Code doesn't use RAG currently. In our testing we found that agentic search out-performed RAG for the kinds of things people use Code for.

replies(1): >>43164503 #

1. marlott ◴[24 Feb 25 20:23 UTC] No.43164503[source]▶

>>43164253 #

Interesting - can you elaborate a little on what you mean by agentic search here?

replies(2): >>43164993 #>>43166130 #

2. antirez ◴[24 Feb 25 21:13 UTC] No.43164993[source]▶

>>43164503 (TP) #

I guess it's what sometimes it's called "self RAG", that is, the agent looks inside the files how a human would be to find that's relevant.

replies(1): >>43165401 #

3. kadushka ◴[24 Feb 25 21:54 UTC] No.43165401[source]▶

>>43164993 #

As opposed to vector search, or…?

replies(2): >>43165926 #>>43166422 #

4. FeepingCreature ◴[24 Feb 25 22:54 UTC] No.43165926{3}[source]▶

>>43165401 #

To my knowledge these are the options:

1. RAG: A simple model looks at the question, pulls up some associated data into the context and hopes that it helps.

2. Self-RAG: The model "intentionally"/agentically triggers a lookup for some topic. This can be via a traditional RAG or just string search, ie. grep.

3. Full Context: Just jam everything in the context window. The model uses its attention mechanism to pick out the parts it needs. Best but most expensive of the three, especially with repeated queries.

Aider uses kind of a hybrid of 2 and 3: you specify files that go in the context, but Aider also uses Tree-Sitter to get a map of the entire codebase, ie. function headers, class definitions etc., that is provided in full. On that basis, the model can then request additional files to be added to the context.

replies(1): >>43166736 #

5. simonw ◴[24 Feb 25 23:20 UTC] No.43166130[source]▶

>>43164503 (TP) #

Since the Claude Code docs suggest installing Ripgrep, my guess is that they mean that Claude Code often runs searches to find snippets to improve in the context.

I would argue that this is still RAG. There's a common misconception (or at least I think it's a misconception) that RAG only counts if you used vector search - I like to expand the definition of RAG to include non-vector search (like Ripgrep in this case), or any other technique where you use Retrieval techniques to Augment the Generation phase.

IR (Information Retrieval) has been around for many decades before vector search become fashionable: https://en.wikipedia.org/wiki/Information_retrieval

replies(2): >>43168083 #>>43168903 #

6. numba888 ◴[24 Feb 25 23:59 UTC] No.43166422{3}[source]▶

>>43165401 #

Does it make sense to use vector search for code? It's more for vague texts. In the code relevant parts can be found by exact name match. (in most cases. both methods aren't exclusive)

replies(1): >>43166718 #

7. simonw ◴[25 Feb 25 00:40 UTC] No.43166718{4}[source]▶

>>43166422 #

Vector search for code can be quite interesting - I've used it for things like "find me code that downloads stuff" and it's worked well. I think text search is usually better for code though.

8. kadushka ◴[25 Feb 25 00:43 UTC] No.43166736{4}[source]▶

>>43165926 #

I'm still not sure I get the difference between 1 and 2. What is "pulls up some associated data into the context" vs ""intentionally"/agentically triggers a lookup for some topic"?

replies(2): >>43168010 #>>43169602 #

9. throwaway314155 ◴[25 Feb 25 03:58 UTC] No.43168010{5}[source]▶

>>43166736 #

1. Tends to use embeddings with a similarity search. Sometimes called "retrieval". This is faster but similarity search doesn't alway work quite as well as you might want it to.

2. Instead lets the agent decide what to bring into context by using tools on the codebase. Since the tools used are fast enough, this gives you effectively "verified answers" so long as the agent didn't screw up its inputs to the tool (which will happen, most likely).

10. wegfawefgawefg ◴[25 Feb 25 04:08 UTC] No.43168083[source]▶

>>43166130 #

rag is an acronym with a pinned meaning now. just like the word drone. drone didnt really mean drone, but drone means drone now. no amount of complaining will fix it. :[

11. jcheng ◴[25 Feb 25 06:39 UTC] No.43168903[source]▶

>>43166130 #

I agree that retrieval can take many forms besides vector search, but do we really want to call it RAG if the model is directing the search using a tool call? That like an important distinction to me and the name "agentic search" makes a lot more sense IMHO.

replies(1): >>43169099 #

12. simonw ◴[25 Feb 25 07:12 UTC] No.43169099{3}[source]▶

>>43168903 #

Yes, I think that's RAG. It's Retrieval Augmented Generation - you're retrieving content to augment the generation.

Who cares if you used vector search for the retrieval?

The best vector retrieval implementations are already switching to a hybrid between vector and FTS, because it turns out BM25 etc is still a better algorithm for a lot of use-cases.

"Agentic search" makes much less sense to me because the term "agentic" is so incredibly vague.

replies(1): >>43169580 #

13. regularfry ◴[25 Feb 25 08:45 UTC] No.43169580{4}[source]▶

>>43169099 #

I think it depends who "you" is. In classic RAG the search mechanism is preordained, the search is done up front and the results handed to the model pre-baked. I'd interpret "agentic search" as anything where the model has potentially a collection of search tools that it can decide how to use best for a given query, so the search algorithm, the query, and the number of searches are all under its own control.

replies(2): >>43174140 #>>43183513 #

14. ◴[25 Feb 25 08:48 UTC] No.43169602{5}[source]▶

>>43166736 #

15. jcheng ◴[25 Feb 25 16:40 UTC] No.43174140{5}[source]▶

>>43169580 #

Exactly. Was the extra information pushed to the model as part of the query? It’s RAG. Did the model pull the extra information in via a tool call? Agentic search.

replies(1): >>43186485 #

16. simonw ◴[26 Feb 25 13:39 UTC] No.43183513{5}[source]▶

>>43169580 #

This is a really useful definition of "agentic search", thanks.

17. regularfry ◴[26 Feb 25 18:24 UTC] No.43186485{6}[source]▶

>>43174140 #

That's far clearer. Yes.

↑