(www.youtube.com)

1480 points sandslash | 1 comments | 19 Jun 25 00:33 UTC | HN request time: 0.231s | source

Show context

mentalgear ◴[19 Jun 25 09:33 UTC] No.44316934[source]▶

Meanwhile, I asked this morning Claude 4 to write a simple EXIF normalizer. After two rounds of prompting it to double-check its code, I still had to point out that it makes no sense to load the entire image for re-orientating if the EXIF orientation is fine in the first place.

Vibe vs reality, and anyone actually working in the space daily can attest how brittle these systems are.

Maybe this changes in SWE with more automated tests in verifiable simulators, but the real world is far to complex to simulate in its vastness.

replies(7): >>44317104 #>>44317116 #>>44317136 #>>44317214 #>>44317305 #>>44317622 #>>44317741 #

ramon156 ◴[19 Jun 25 10:13 UTC] No.44317136[source]▶

>>44316934 #

The real question is how long it'll take until they're not brittle

replies(3): >>44317160 #>>44317197 #>>44317483 #

yahoozoo ◴[19 Jun 25 11:08 UTC] No.44317483[source]▶

>>44317136 #

“Treat it like a junior developer” … 5 years later … “Treat it like a junior developer”

replies(2): >>44317582 #>>44317623 #

TeMPOraL ◴[19 Jun 25 11:31 UTC] No.44317623[source]▶

>>44317483 #

Usable LLMs are 3 years old at this point. ChatGPT, not Github Copilot, is the marker.

replies(1): >>44320349 #

1. LtWorf ◴[19 Jun 25 16:49 UTC] No.44320349[source]▶

>>44317623 #

Usable for fun yes.

↑

Andrej Karpathy: Software in the era of AI [video]