(github.com)

177 points marv1nnnnn | 1 comments | 15 May 25 13:40 UTC | HN request time: 0.328s | source

Show context

infogulch ◴[15 May 25 18:11 UTC] No.43997694[source]▶

I wonder how this compares to KBLaM [1], which also has a preprocessing step to prepare a large amount of reference material for direct access by LLMs. One obvious difference is that it has a modified attention mechanism they call "rectangular attention". The paper was posted on HN a few times, but it hasn't generated any discussion yet.

[1]: Introducing KBLaM: Bringing plug-and-play external knowledge to LLMs | https://www.microsoft.com/en-us/research/blog/introducing-kb...

replies(1): >>44001053 #

1. marv1nnnnn ◴[16 May 25 01:37 UTC] No.44001053[source]▶

>>43997694 #

never heard of this one! sounds really interesting

↑

Show HN: Min.js style compression of tech docs for LLM context