←back to thread

370 points meetpateltech | 1 comments | | HN request time: 0.205s | source
1. haffi112 ◴[] No.44006459[source]
(watching live) I'm wondering how it performs on the METR benchmark (https://metr.org/blog/2025-03-19-measuring-ai-ability-to-com...).