Show HN: Muscle-Mem, a behavior cache for AI agents

1. mindwok ◴[15 May 25 02:13 UTC] No.43991197[source]▶

It's becoming increasingly clear that memory and context are the bottlenecks in advancing usage of AI. I can't help but feel there needs to be a general, perhaps even built into the model, solution for this - everyone seems to be building something on top that is roughly the same thing.

replies(4): >>43991636 #>>43997414 #>>43998967 #>>44001182 #

2. ramoz ◴[15 May 25 03:43 UTC] No.43991636[source]▶

>>43991197 (TP) #

Karpathy had a similar interesting take the other day

https://x.com/karpathy/status/1921368644069765486

replies(2): >>43994547 #>>43997565 #

3. FisherKK ◴[15 May 25 12:57 UTC] No.43994547[source]▶

>>43991636 #

Skill Library!

4. hnuser123456 ◴[15 May 25 17:41 UTC] No.43997414[source]▶

>>43991197 (TP) #

Fine tuning should be combined with inference in some way. However this requires keeping the model loaded at high enough precision for backprop to work.

Instead of hundreds of thousands of us downloading the latest and greatest model that won't fundamentally update one bit until we're graced with the next one, I would think we should all be able to fine-tune the weights so that it can naturally memorize new additional info and preferences without using up context length.

5. hnuser123456 ◴[15 May 25 17:57 UTC] No.43997565[source]▶

>>43991636 #

I'm starting up experiments with having agents write system prompts for sub-agents. Specifically, have the LLM build, test, and validate a small, simple tool, and once validated, add it to its own system prompt listing available tools.

Anyone else experimenting with letting LLMs generate their own or sub-agent system prompts?

6. pacjam ◴[15 May 25 20:26 UTC] No.43998967[source]▶

>>43991197 (TP) #

check out Letta - the OSS codebase (https://github.com/letta-ai/letta) is basically focused on solving the memory/context problem in a generalized way (via "agentic context management"). if you're more interested in papers, we also worked on MemGPT and more recently sleep-time compute (https://arxiv.org/abs/2504.13171)

replies(1): >>43999253 #

7. edunteman ◴[15 May 25 20:55 UTC] No.43999253[source]▶

>>43998967 #

Love your sleep time stuff! It's an inspiration for Muscle Mem

8. tom_m ◴[16 May 25 01:58 UTC] No.44001182[source]▶

>>43991197 (TP) #

Absolutely. The "intelligence" isn't complete without a memory. In fact there's a whole lot more to it than that. The LLM is one component, a logic factory, but there's so much more than the LLM and the memory.

In fact, systems should be LLM agnostic or use different models for different needs.

I don't believe building something into the model will ever be the solution though. It is interesting what Google is trying to do with model caching but at the end of the day I believe the strength of agents here will rely heavily upon modularity.