←back to thread

Alignment is capability

(www.off-policy.com)
106 points drctnlly_crrct | 1 comments | | HN request time: 0.208s | source
Show context
xnorswap ◴[] No.46192597[source]
I've only been using it a couple of weeks, but in my opinion, Opus 4.5 is the biggest jump in tech we've seen since ChatGPT 3.5.

The difference between juggling Sonnet 4.5 / Haiku 4.5 and just using Opus 4.5 for everything is night & day.

Unlike Sonnet 4.5 which merely had promise at being able to go off and complete complex tasks, Opus 4.5 seems genuinely capable of doing so.

Sonnet needed hand-holding and correction at almost every step. Opus just needs correction and steering at an early stage, and sometimes will push back and correct my understanding of what's happening.

It's astonished me with it's capability to produce easy to read PDFs via Typst, and has produced large documents outlining how to approach very tricky tech migration tasks.

Sonnet would get there eventually, but not without a few rounds of dealing with compilation errors or hallucinated data. Opus seems to like to do "And let me just check my assumptions" searches which makes all the difference.

replies(5): >>46192783 #>>46192922 #>>46193718 #>>46194371 #>>46196267 #
1. throw310822 ◴[] No.46193718[source]
Cursor with Claude 4.5 Opus has been writing all my code since a few days. It's exhilarating, I can describe features and they get added to my code in a matter of seconds, minutes at most. It gets almost everything right, certainly more than I would at the first try. I only hand code parts that are small and tricky, and provide guidance on the general architecture, where to put things and how to organise them. It's an incredible way of working, the only nagging doubt is how long will it last before employers decide they don't need me in the loop at all.