Show HN: FastGraphRAG – Better RAG using good old PageRank

from fast_graphrag import GraphRAG DOMAIN = "Analyze this story and identify the characters. Focus on how they interact with each other, the locations they explore, and their relationships." EXAMPLE_QUERIES = [ "What is the significance of Christmas Eve in A Christmas Carol?", "How does the setting of Victorian London contribute to the story's themes?", "Describe the chain of events that leads to Scrooge's transformation.", "How does Dickens use the different spirits (Past, Present, and Future) to guide Scrooge?", "Why does Dickens choose to divide the story into \"staves\" rather than chapters?" ] ENTITY_TYPES = ["Character", "Animal", "Place", "Object", "Activity", "Event"] grag = GraphRAG( working_dir="./book_example", domain=DOMAIN, example_queries="\n".join(EXAMPLE_QUERIES), entity_types=ENTITY_TYPES ) with open("./book.txt") as f: grag.insert(f.read()) print(grag.query("Who is Scrooge?").response)

1. yccheok ◴[19 Nov 24 05:16 UTC] No.42180275[source]▶

>>42174829 (OP) #

Hi,

I’m currently building a Q&A chatbot and facing challenges in addressing the following scenario:

When a user asks:

"What do you mean in your previous statement?"

How does your framework handle retrieving the correct small subset of "raw knowledge" and integrating it into the LLM for a relevant response?

Without relying on external frameworks, I’ve struggled with this issue - https://www.reddit.com/r/LocalLLaMA/comments/1gtzdid/d_optim...

I’d love to know how your framework solves this and whether it can streamline the process.

Thank you!

replies(3): >>42180549 #>>42180583 #>>42182362 #

2. martinkallstrom ◴[19 Nov 24 06:28 UTC] No.42180549[source]▶

>>42180275 (TP) #

Have you tried allowing the LLM to decide the use of knowledge retrieval, through tool use or a direct query?

3. dheerkt ◴[19 Nov 24 06:35 UTC] No.42180583[source]▶

>>42180275 (TP) #

If the user asks such a question, your agent should not invoke the RAG at all, but simply answer from the history. You need to focus on your orchestration step.

Search for ReAct agents, can build using either LangGraph or Bedrock Agents.

4. Tsarp ◴[19 Nov 24 11:40 UTC] No.42182362[source]▶

>>42180275 (TP) #

After a lot of experimentation, the only thing that worked in a chat style application is to pass maybe the last 4-5 messages (ideally the entire conversation history) and ask an LLM to summarize the question in the context of the conversation.

Without that it often failed when users asked something like ("Can you expand point 2? , Give a detailed example of the above").

Current implementation(I have 3 indexes) is to provide Query + Past messages and ask an LLM to break it down into Overall ask: BM25 optimized question: Keywords: Semantic optimized question:

Perform RAG + Rerank and pass the top N passages after this along with the Overall ask in the second LLM call.