←back to thread

1480 points sandslash | 1 comments | | HN request time: 0.231s | source
Show context
mentalgear ◴[] No.44316934[source]
Meanwhile, I asked this morning Claude 4 to write a simple EXIF normalizer. After two rounds of prompting it to double-check its code, I still had to point out that it makes no sense to load the entire image for re-orientating if the EXIF orientation is fine in the first place.

Vibe vs reality, and anyone actually working in the space daily can attest how brittle these systems are.

Maybe this changes in SWE with more automated tests in verifiable simulators, but the real world is far to complex to simulate in its vastness.

replies(7): >>44317104 #>>44317116 #>>44317136 #>>44317214 #>>44317305 #>>44317622 #>>44317741 #
ramon156 ◴[] No.44317136[source]
The real question is how long it'll take until they're not brittle
replies(3): >>44317160 #>>44317197 #>>44317483 #
yahoozoo ◴[] No.44317483[source]
“Treat it like a junior developer” … 5 years later … “Treat it like a junior developer”
replies(2): >>44317582 #>>44317623 #
TeMPOraL ◴[] No.44317623[source]
Usable LLMs are 3 years old at this point. ChatGPT, not Github Copilot, is the marker.
replies(1): >>44320349 #
1. LtWorf ◴[] No.44320349[source]
Usable for fun yes.