←back to thread

230 points taikon | 1 comments | | HN request time: 0.195s | source
1. mentalgear ◴[] No.42547671[source]
It has come to the point that we need benchmarks for (Graph)-Rag systems now, same as we have for pure LLMs. However vendors will certainly then optimize for the popular ones, so we need a good mix of public, private and dynamic eval datasets.