I just used o3 to design a distributed scheduler that scales to 1M+ sxchedules a day. It was perfect, and did better than two weeks of thought around the best way to build this.
is this because the LLM actually reasoned on a better design or because it found a better design in its "database" scoured from another tenured engineer.
Ignoring the copyright issues, credit issues, and any ethical concerns... this approach doesn't work for anything not in the "database", it's not AGI and the tangential experience is barely relevant to the article.