←back to thread

177 points Runonthespot | 1 comments | | HN request time: 0s | source
Show context
CuriouslyC ◴[] No.45161101[source]
I actually have a WIP library for this, the indexing server isn't where I want it just yet, but I have an entire agent toolkit that does this stuff, and the indexing server is quite advance, with self-tuning, raptor/lsp integration, solves for optimal result set using knapsack, etc.

https://github.com/sibyllinesoft/grimoire

replies(1): >>45162275 #
threecheese ◴[] No.45162275[source]
I have to know, what is the Lens SPI? The link in your readme is broken, and Kagi results for this cannot possibly be right.
replies(1): >>45162926 #
1. CuriouslyC ◴[] No.45162926[source]
Lens is basically a rust local first mmapped file base search store, it combines RAPTOR with LSP, semantic vectors and a dual dense/sparse encoding, and can learn a function over those to tune the weights of the relevance sources adaptively per query using your data. It also uses linear programming to select an "efficient" set of results that minimizes mutual information between result atoms -- regular rag/rerank pipelines just dump the top K, but those often have a significant amount of overlap so you bloat context for no benefit.