There is another widespread common factor among the top machines. A large majority are based on HPE Slingshot networking (7 out of top 10 by my count).
Without blindingly fast, otherwise blinding numerical performance dims quite a lot. This is why the Cerebras numbers on heavy numerical problems are competitive up to a pretty severe ceiling. Below that point, their on wafer interconnects suffice, above it they cannot scale the data communications bandwidth necessary.