←back to thread

Devstral

(mistral.ai)
701 points mfiguiere | 3 comments | | HN request time: 0.001s | source
1. gyudin ◴[] No.44053907[source]
Super weird benchmarks
replies(1): >>44053952 #
2. avereveard ◴[] No.44053952[source]
from what I gather it's finetuned to use OpenHand specifically so shows value on thsoe benchmark that target a whole system as a blackbox (i.e. agent + llm) more than directly target the llm input/outputs
replies(1): >>44057074 #
3. amarcheschi ◴[] No.44057074[source]
Yup the 1st comment says this https://www.reddit.com/r/LocalLLaMA/comments/1kryybf/mistral...