(mistral.ai)

701 points mfiguiere | 1 comments | 21 May 25 14:21 UTC | HN request time: 0.379s | source

Show context

simonw ◴[21 May 25 17:30 UTC] No.44053886[source]▶

The first number I look at these days is the file size via Ollama, which for this model is 14GB https://ollama.com/library/devstral/tags

I find that on my M2 Mac that number is a rough approximation to how much memory the model needs (usually plus about 10%) - which matters because I want to know how much RAM I will have left for running other applications.

Anything below 20GB tends not to interfere with the other stuff I'm running too much. This model looks promising!

replies(4): >>44054806 #>>44056502 #>>44059216 #>>44059888 #

nico ◴[21 May 25 21:32 UTC] No.44056502[source]▶

>>44053886 #

Any agentic dev software you could recommend that runs well with local models?

I’ve been using Cursor and I’m kind of disappointed. I get better results just going back and forth between the editor and ChatGPT

I tried localforge and aider, but they are kinda slow with local models

replies(6): >>44056637 #>>44057592 #>>44058473 #>>44059316 #>>44064049 #>>44071582 #

zackify ◴[22 May 25 03:26 UTC] No.44058473[source]▶

>>44056502 #

I used devstral today with cline and open hands. Worked great in both.

About 1 minute initial prompt processing time on an m4 max

Using LM studio because the ollama api breaks if you set the context to 128k.

replies(2): >>44060526 #>>44062026 #

elAhmo ◴[22 May 25 10:08 UTC] No.44060526[source]▶

>>44058473 #

How is it great that it takes 1 minute for initial prompt processing?

replies(2): >>44080572 #>>44112119 #

1. zackify ◴[28 May 25 01:45 UTC] No.44112119[source]▶

>>44060526 #

Haha great as in surprisingly good at some simple things that nothing has been able to do locally for me.

The 1 minute first token sucks and has me dreaming for the day of 3-4x the bandwidth

↑

Devstral