←back to thread

210 points vincirufus | 3 comments | | HN request time: 0.711s | source
1. Jcampuzano2 ◴[] No.45145915[source]
Hmm with the lower context length I'm wonder how it holds up for problems requiring slightly larger context given we know most models tend to degrade fairly quickly with context length.

Maybe it's best for shorter tasks or condensed context?

I find it interesting the number of models latching onto Claude codes harness. I'm still using Cursor for work and personal but tried out open code and Claude for a bit. I just miss having the checkpoints and whatnot.

replies(1): >>45145994 #
2. CuriouslyC ◴[] No.45145994[source]
https://fiction.live/stories/Fiction-liveBench-Feb-21-2025/o...
replies(1): >>45146834 #
3. saretup ◴[] No.45146834[source]
Interesting, although how hard is it to add a sorting functionality to the table?