Show HN: I replaced vector databases with Git for AI memory (PoC)

Show context

BenoitP ◴[21 Aug 25 07:25 UTC] No.44970042[source]▶

I'm failing to grasp how it solves/replaces what vector db were created for in the first place (high-dimensional neighborhood searching, where the space to be searched grows by distance^dimension)

replies(4): >>44970063 #>>44970064 #>>44970076 #>>44970631 #

alexmrv ◴[21 Aug 25 07:28 UTC] No.44970063[source]▶

>>44970042 #

Super simplistic example, but say i mention my Daughter, who is 9.

Then mention she is 10,

a few years later she is 12 but now i call her by her name.

I have struggled to get any of the RAG approaches to handle this effectively. It is also 3 entries, but 2 of them are no longer useful, they are nothing but noise in the system.

replies(5): >>44970126 #>>44970254 #>>44970511 #>>44970533 #>>44974081 #

PeterStuer ◴[21 Aug 25 08:46 UTC] No.44970511[source]▶

>>44970063 #

That is because basic RAG is not very useful as a long-term knowledge base. You have to actively annotate and transform data for it to become useful knowledge. I have the same problem in the regulation domain, which also constantly evolves.

In your case, you do not want to store the age as fact without context. Better is e.g. to transform the relative fact (age) into an absolute fact (year of birth), or contextualize it enough to transform it into more absolutes (age 10 in 2025.

replies(1): >>44970732 #