←back to thread

469 points samuelstros | 2 comments | | HN request time: 0s | source
Show context
dingnuts[dead post] ◴[] No.44998611[source]
[flagged]
1. BoorishBears ◴[] No.44998656[source]
If you want to be really stringent, RAG originally referred to going from user query to retrieving information directly based on the query then passing it to an LLM: With CC the LLM is taking the raw user query then crafting its own searches

But realistically lots of RAG systems have LLM calls interleaved for various reasons, so what they probably mean it not doing the usual chunking + embeddings thing.

replies(1): >>44998746 #
2. theptip ◴[] No.44998746[source]
Yeah, TFA clearly explains their point. They mean RAG=vector search, and contrast this with tool calling (eg Grep).