←back to thread

296 points todsacerdoti | 1 comments | | HN request time: 0.842s | source
Show context
blixt ◴[] No.44370298[source]
I’m starting to think “The Bitter Lesson” is a clever sounding way to give shade to people that failed to nail it on their first attempt. Usually engineers build much more technology than they actually end up needing, then the extras shed off with time and experience (and often you end up building it again from scratch). It’s not clear to me that starting with “just build something that scales with compute” would get you closer to the perfect solution, even if as you get closer to it you do indeed make it possible to throw more compute at it.

That said the hand coded nature of tokenization certainly seems in dire need of a better solution, something that can be learned end to end. And It looks like we are getting closer with every iteration.

replies(3): >>44370492 #>>44370692 #>>44370704 #
1. RodgerTheGreat ◴[] No.44370492[source]
The bitter lesson says more about medium-term success at publishable results than it does about genuine scientific progress or even success in the market.