(mistral.ai)

701 points mfiguiere | 4 comments | 21 May 25 14:21 UTC | HN request time: 0.444s | source

Show context

christophilus ◴[22 May 25 02:36 UTC] No.44058247[source]▶

What hardware are y'all using when you run these things locally? I was thinking of pre ordering the Framework desktop[0] for this purpose, but I wouldn't mind having a decent laptop that could run it (ideally Linux).

[0] https://frame.work/desktop

replies(4): >>44058269 #>>44058281 #>>44058363 #>>44058499 #

1. zackify ◴[22 May 25 03:30 UTC] No.44058499[source]▶

>>44058247 #

M4 max 128gb ram.

LM studio MLX with full 128k context.

It works well but has a long 1 minute initial prompt processing time.

I wouldn’t buy a laptop for this, I would wait for the new AMD 32gb gpu coming out.

If you want a laptop I even consider my m4 max too slow to use more than just here or there.

It melts if you run this and battery goes down asap. Have to use it docked for full speed really

replies(3): >>44058814 #>>44062894 #>>44112621 #

2. pram ◴[22 May 25 04:34 UTC] No.44058814[source]▶

>>44058499 (TP) #

Yep I have an M4 Max Studio with 128GB of RAM, even the Q8 GGUF fits in memory with 131k context. Memory pressure at 45% lol

3. discordance ◴[22 May 25 15:12 UTC] No.44062894[source]▶

>>44058499 (TP) #

How many tokens per second are you both getting?

4. bicepjai ◴[28 May 25 03:59 UTC] No.44112621[source]▶

>>44058499 (TP) #

Do you also have tokens per second metric ?

↑

Devstral