Most active commenters
  • baybal2(3)
  • MrBuddyCasino(3)

←back to thread

1080 points antipaul | 18 comments | | HN request time: 0s | source | bottom
Show context
zdw ◴[] No.25066465[source]
AMD's Zen 3 (Ryzen 5xxx series) are beating the Apple M1 in single core score: https://browser.geekbench.com/v5/cpu/singlecore

As another datapoint Ian (of Anandtech) estimated that the M1 would need to be clocked at 3.25Ghz to match Zen 3, and these systems are showing a 3.2Ghz clock: https://twitter.com/IanCutress/status/1326516048309460992

replies(9): >>25066469 #>>25066520 #>>25066537 #>>25066720 #>>25067051 #>>25067086 #>>25068425 #>>25068547 #>>25069628 #
YetAnotherNick ◴[] No.25066720[source]
No, they aren't. All of the top results have crazy overclocking and liquid cooling. You need to look the numbers here: https://browser.geekbench.com/processor-benchmarks. Top end Zen 3 is slightly lower than M1.
replies(1): >>25066806 #
trynumber9 ◴[] No.25066806[source]
Not exactly.

You can check the clock speeds: https://browser.geekbench.com/v5/cpu/4620493.gb5

Up to 5050MHz is stock behavior for the 5950X and it's using standard DDR4 3200 memory.

replies(2): >>25067097 #>>25067439 #
1. baybal2 ◴[] No.25067097[source]
Yet it still makes it very clear: a properly implemented ARM core can easily bury an X86 of equivalent size because of inherent advantage of not having to pay interest on 40 years of technical debt in the ISA.
replies(3): >>25067150 #>>25068040 #>>25068302 #
2. throwaway2048 ◴[] No.25067150[source]
ISA has very little to do with it, ARM is almost as old as x86.
replies(2): >>25067288 #>>25067399 #
3. MrBuddyCasino ◴[] No.25067288[source]
ARM went through multiple iterations of its ISA. They don’t need to run 40 year old code.
replies(1): >>25067448 #
4. GreenHeuristics ◴[] No.25067399[source]
AArch64/ARM64 was developed from the ground up, not bolted on to the old 32-bit ISA
5. The_Colonel ◴[] No.25067448{3}[source]
X86 CPUs are not really "running" X86 ISA since Pentium Pro (1995), they are translating on-the-fly X86 instructions to microcode which is actually getting executed. ARM CPUs are also not executing ARM ISA directly and doing translation as well.

Simpler ARM ISA has advantages in very small / energy efficient CPUs since the silicon translation logic can be smaller but this advantage grows increasingly irrelevant when you are scaling to bigger, faster cores.

IMHO these days ISA implications on performance and efficiency are being overstated.

replies(2): >>25067471 #>>25068037 #
6. MrBuddyCasino ◴[] No.25067471{4}[source]
Yes, those are widely known fact. There are aspects of the ISA that do constrain performance and cannot be easily worked around, eg the memory model which is more relaxed on ARM.
7. baybal2 ◴[] No.25068037{4}[source]
> IMHO these days ISA implications on performance and efficiency are being overstated.

Noooo, besides simply copying instructions 1-to-1, the process is way to involved, and imposes 40 years old assumptions on memory model, and many other things, which greatly limits the amount of way you can interact with the CPU, adds to transistor count, and makes making efficient compilers really hard.

replies(1): >>25070468 #
8. vbezhenar ◴[] No.25068040[source]
Because 5 nm better than 7 nm. That's about it. AMD Zen will be on par with Apple Silicone when they'll use 5 nm process.
replies(1): >>25068719 #
9. Fnoord ◴[] No.25068302[source]
AMD64 (x86-64) runs x86-32 at near-native speed, but it isn't x86-32. As someone who was an early adopter of Linux/AMD64 I know first-hand backwards compatibility is very important. Apple knows, hence Rosetta. Every time they switch architecture, they invest into backwards compatibility. As a counter-example, Itanium wasn't good with backwards compatibility.
replies(1): >>25070951 #
10. whizzter ◴[] No.25068719[source]
Actually i'd bet that you're both wrong. What M1 does well isn't that "ARM-is-better" or that they're using a smaller process (even if both factors probably plays into helping the M1 chips edge a few %).

Rather i suspect that the main benefit that M1 has in many real world benchmarks is that it has on-chip memory, cache-miss latency is a huge cost in the real world (why games has drifted towards DoD internals), so sidestepping that issue to a large extent by integrating memory on-die gives it a great boost.

I'm betting once they've reverse engineered the M1 perf, we will see multi-GB caches on AMD/Intel chips within 4 years.

replies(3): >>25069586 #>>25069884 #>>25081011 #
11. qayxc ◴[] No.25069586{3}[source]
There's nothing to "reverse engineer" there: M1 has 4x the L1 cache and a wider bus. That's it.

This cannot be implemented in AMD's current 7nm process due to size restrictions.

The SoC-side of the story is also contrary to the very core design of a general purpose CPU. RAM, GPU, and extension cards for specialised tasks are already covered by 3rd party products on the PCIe and USB4 buses and AMD has no interest in cannibalising their GPU and console business...

With their upcoming discrete GPUs and accelerator cards, Intel might be in the same boat w.r.t. SoC design.

12. FrojoS ◴[] No.25069884{3}[source]
The M1 has an L1 cache with less than 0.5 KB per core and an L2 with about 4 MB shared by all cores. https://en.wikipedia.org/wiki/Apple_M1
13. tomxor ◴[] No.25070468{5}[source]
Interesting point. So on the one hand we have all these layers in the CPU to abstract away things in the ISA that are not ideal for block level implementation... but on the other hand compilers are still targeting that high level ISA... and ironically they also have their own more general abstraction, the intermediate representation.

I'm probably not the first or last to suggest this but... it seems awfully tempting to say: why can't we throw away the concept of maintaining binary comparability yet and target some level of "internal" ISA directly (if intel/AMD could provide such an interface in parallel to the high level ISA)... with the accepted cost of knowing that ISA will change in not necessarily forward compatible ways between CPU revisions.

From the user's perspective we'd either end up with more complex binary distribution, or needing to compile for your own CPU FOSS style when you want to escape the performance limitations of x86.

replies(1): >>25072099 #
14. thrwyoilarticle ◴[] No.25070951[source]
What was the Linux landscape like for AMD64 early adopters?
replies(1): >>25071030 #
15. Fnoord ◴[] No.25071030{3}[source]
Debian was quick with adopting it (they've always been very cross-platform focused), in contrast to say Windows (which took a lot longer). On Linux, a lot worked, but not everything. Slowly but surely more got ported to AMD64. What didn't work? Especially pre-compiled proprietary software was not available (IIRC Nvidia drivers? At the very least games). You had to have x86-32 userland installed. Which adds up to higher diskspace requirement. Nowadays, diskspace requirement is negligible, and x86-32 userland is less relevant (on AMD64/x86-64). I would assume the 4 GB limit eventually made games swap to AMD64 as well.

Back then, Intel was still betting on Itanium. It was a time when AMD was ahead of Intel. Wintel lasted longer, and its only since the smartphone revolution they got caught up. In hindsight, even a Windows computer on Intel gave a user more freedom than the locked down stuff on say iOS. OTOH, sometimes user freedom is a bad thing, arguably if the user isn't technically inclined or if you can sell a locked down platform like PlayStation or Xbox for relatively cheap (kind of like the printer business).

I'm sure other people can add to this as well. :-)

replies(1): >>25082341 #
16. MrBuddyCasino ◴[] No.25072099{6}[source]
I think IBM mainframes do something like this. Software is distributed as bytecode, and then compiled to machine code to CPU-specific assembly.
17. baybal2 ◴[] No.25081011{3}[source]
> What M1 does well isn't that "ARM-is-better"

Of course, not all to it, but denying that having to emulate a 40 years old ISA does not place a huge cost on transistor count, and efficiency is impossible.

18. thrwyoilarticle ◴[] No.25082341{4}[source]
Thanks