←back to thread

174 points Philpax | 1 comments | | HN request time: 0.205s | source
Show context
codingwagie ◴[] No.43719845[source]
I just used o3 to design a distributed scheduler that scales to 1M+ sxchedules a day. It was perfect, and did better than two weeks of thought around the best way to build this.
replies(8): >>43719906 #>>43720086 #>>43720092 #>>43721143 #>>43721297 #>>43722293 #>>43723047 #>>43727685 #
davidsainez ◴[] No.43720086[source]
While impressive, I'm not convinced that improved performance on tasks of this nature are indicative of progress toward AGI. Building a scheduler is a well studied problem space. Something like the ARC benchmark is much more indicative of progress toward true AGI, but probably still insufficient.
replies(2): >>43720972 #>>43721178 #
1. codingwagie ◴[] No.43720972[source]
the other models failed at this miserably. There were also specific technical requirements I gave it related to my tech stack