←back to thread

Devstral

(mistral.ai)
701 points mfiguiere | 2 comments | | HN request time: 0.402s | source
Show context
gyudin ◴[] No.44053907[source]
Super weird benchmarks
replies(1): >>44053952 #
1. avereveard ◴[] No.44053952[source]
from what I gather it's finetuned to use OpenHand specifically so shows value on thsoe benchmark that target a whole system as a blackbox (i.e. agent + llm) more than directly target the llm input/outputs
replies(1): >>44057074 #
2. amarcheschi ◴[] No.44057074[source]
Yup the 1st comment says this https://www.reddit.com/r/LocalLLaMA/comments/1kryybf/mistral...