KAG – Knowledge Graph RAG Framework

(github.com)

230 points taikon | 2 comments | 30 Dec 24 02:55 UTC | HN request time: 0.403s | source

Show context

dcreater ◴[30 Dec 24 06:01 UTC] No.42546890[source]▶

Yet another RAG/knowledge graph implementation.

At this point, the onus is on the developer to prove it's value through AB comparisons versus traditional RAG. No person/team has the bandwidth to try out this (n + 1) solution.

replies(2): >>42546979 #>>42550357 #

ertdfgcvb ◴[30 Dec 24 06:19 UTC] No.42546979[source]▶

>>42546890 #

I enjoy the explosion of tools. Only time will tell which ones stand the test of time. But this is my day job so I never get tired of new tools but I can see how non-industry folks can find it overwhelming

replies(2): >>42547052 #>>42551832 #

trees101 ◴[30 Dec 24 06:33 UTC] No.42547052[source]▶

>>42546979 #

Can you expand on that? Where do big enterprise orgs products fit in, eg Microsoft, Google? What are the leading providers as you see them? As an outsider it is bewildering. First I hear that llama_index is good, then I hear that its overcomplicating slop. What sources or resources are reliable on this? How can we develop anything that will still stand in 12 months time?

replies(4): >>42547244 #>>42547418 #>>42548475 #>>42551860 #

ankit219 ◴[30 Dec 24 11:47 UTC] No.42548475[source]▶

>>42547052 #

> How can we develop anything that will still stand in 12 months time?

The pace at which things are moving, likely none. You will have to keep making changes as and when you see newer things. One thing in your favor (arguably) is that every technique is very dependent on the dataset and problem you are solving. So, if you do not have the latest one implemented, you would be okay, as long as your evals and metrics are good. So, if this helps, skip the details, understand the basics, and go for your own implementation. One thing to look out for is new SOTA LLM releases, and the jumps in capability. Eg: 4o did not announce it, but they started doing very well on vision. (GPT-4 was okay, 4o is empirically quite better). These things help when you update your pipeline.

replies(1): >>42549064 #

dartos ◴[30 Dec 24 13:27 UTC] No.42549064[source]▶

>>42548475 #

Well the rate of new LLMs keep coming out, but since they’re all trying to model language, they should all be fairly interchangeable and potentially will converge.

It’s not hard for a product to swap the underlying LLM for a given task.

replies(1): >>42549390 #

1. ankit219 ◴[30 Dec 24 14:11 UTC] No.42549390[source]▶

>>42549064 #

I meant not a jump in text generation ability, but more like adding a completely new modality and the likes. With 4o, you can have a multimodal embedding space and provide more relevant context to a model for fewer tokens (and higher accuracy). Ideally everyone would get there, but upgrading your pipeline is more about getting the latest functionality faster rather than just a slightly better generation.

replies(1): >>42550339 #

2. dartos ◴[30 Dec 24 15:49 UTC] No.42550339[source]▶

>>42549390 (TP) #

Well they did.

Then Google did.

Then llava.

The issue is that this technology has no most (other than the cost to create models and datasets)

There’s not a lot of secret sauce you can use that someone else can’t trivially replicate, given the resources.

It’s going to come down to good ol product design and engineering.

The issue is openai doesn’t seem to care about what their users want. (I don’t think their users know what they want either, but that’s another discussion)

They want more money to make bigger models in the hope that nobody else can or will.

They want to achieve regulatory capture as their moat.

For all their technical abilities at scaling LLM training and inference, I don’t get the feeling that they have great product direction.

↑