←back to thread

Devstral

(mistral.ai)
701 points mfiguiere | 1 comments | | HN request time: 0.2s | source
Show context
gyudin ◴[] No.44053907[source]
Super weird benchmarks
replies(1): >>44053952 #
avereveard ◴[] No.44053952[source]
from what I gather it's finetuned to use OpenHand specifically so shows value on thsoe benchmark that target a whole system as a blackbox (i.e. agent + llm) more than directly target the llm input/outputs
replies(1): >>44057074 #