←back to thread

469 points samuelstros | 2 comments | | HN request time: 0s | source
Show context
OtherShrezzing ◴[] No.44998715[source]
I think it’s just that the base model is good at real world coding tasks - as opposed to the types of coding tasks in the common benchmarks.

If you use GitHub Copilot - which has its own system level prompts - you can hotswap between models, and Claude outperforms OpenAI’s and Google’s models by such a large margin that the others are functionally useless in comparison.

replies(4): >>44998798 #>>44998867 #>>45001236 #>>45001252 #
1. paool ◴[] No.45001252[source]
It's not just the base model

Try using opus with cline in vs code. Then use Claude code.

I don't know the best way to quantify the differences, but I know I get more done in CC.

replies(1): >>45004174 #
2. afarah1 ◴[] No.45004174[source]
But is it a game changer vs CoPilot in Agent mode with Claude 4 Sonnet?

Because it's twice the price and doesn't even have a trial.

I feel like if it were a game changer, like Cursor once was vs Ask mode with GPT, it would be worth it, but CoPilot has come a long way and the only up-to-date comparisons I've read point to it being marginally better or the same, but twice the price.