As another datapoint Ian (of Anandtech) estimated that the M1 would need to be clocked at 3.25Ghz to match Zen 3, and these systems are showing a 3.2Ghz clock: https://twitter.com/IanCutress/status/1326516048309460992
As another datapoint Ian (of Anandtech) estimated that the M1 would need to be clocked at 3.25Ghz to match Zen 3, and these systems are showing a 3.2Ghz clock: https://twitter.com/IanCutress/status/1326516048309460992
You can check the clock speeds: https://browser.geekbench.com/v5/cpu/4620493.gb5
Up to 5050MHz is stock behavior for the 5950X and it's using standard DDR4 3200 memory.
Simpler ARM ISA has advantages in very small / energy efficient CPUs since the silicon translation logic can be smaller but this advantage grows increasingly irrelevant when you are scaling to bigger, faster cores.
IMHO these days ISA implications on performance and efficiency are being overstated.
Desktop CPUs differ from the mobile CPUs mainly in how much can they boost more/all cores.
Isn't the M1 fabbed on TSMC 5nm? Zen 3 is on 7nm. If a Zen 3 APU will run close to Apple Silicon I will be mightily impressed.
Noooo, besides simply copying instructions 1-to-1, the process is way to involved, and imposes 40 years old assumptions on memory model, and many other things, which greatly limits the amount of way you can interact with the CPU, adds to transistor count, and makes making efficient compilers really hard.
(that was sarcasm. My take is this performance is impressive but you should not be surprised if it does not completely outperform CPUs that should be less efficient)
* 3800X (105W desktop) scores 2855
* 4900H (45W mobile) scores 2707 or 95% of 3800X
* 4750U (15W mobile) scores 2596 or 91% of 3800X
Rather i suspect that the main benefit that M1 has in many real world benchmarks is that it has on-chip memory, cache-miss latency is a huge cost in the real world (why games has drifted towards DoD internals), so sidestepping that issue to a large extent by integrating memory on-die gives it a great boost.
I'm betting once they've reverse engineered the M1 perf, we will see multi-GB caches on AMD/Intel chips within 4 years.
This cannot be implemented in AMD's current 7nm process due to size restrictions.
The SoC-side of the story is also contrary to the very core design of a general purpose CPU. RAM, GPU, and extension cards for specialised tasks are already covered by 3rd party products on the PCIe and USB4 buses and AMD has no interest in cannibalising their GPU and console business...
With their upcoming discrete GPUs and accelerator cards, Intel might be in the same boat w.r.t. SoC design.
I'm probably not the first or last to suggest this but... it seems awfully tempting to say: why can't we throw away the concept of maintaining binary comparability yet and target some level of "internal" ISA directly (if intel/AMD could provide such an interface in parallel to the high level ISA)... with the accepted cost of knowing that ISA will change in not necessarily forward compatible ways between CPU revisions.
From the user's perspective we'd either end up with more complex binary distribution, or needing to compile for your own CPU FOSS style when you want to escape the performance limitations of x86.
Source:
[1]: https://browser.geekbench.com/processors/amd-ryzen-7-3700x
[2]: https://browser.geekbench.com/processors/amd-ryzen-7-3700u
Back then, Intel was still betting on Itanium. It was a time when AMD was ahead of Intel. Wintel lasted longer, and its only since the smartphone revolution they got caught up. In hindsight, even a Windows computer on Intel gave a user more freedom than the locked down stuff on say iOS. OTOH, sometimes user freedom is a bad thing, arguably if the user isn't technically inclined or if you can sell a locked down platform like PlayStation or Xbox for relatively cheap (kind of like the printer business).
I'm sure other people can add to this as well. :-)
Renoir is 7nm Zen 2 aka the 4000 series. https://en.wikichip.org/wiki/amd/cores/renoir
Matisse is also 7nm Zen 2 aka the desktop 3000 series. https://en.wikichip.org/wiki/Matisse
Picasso is 12nm Zen+ aka the mobile 3000 series. https://en.wikichip.org/wiki/amd/cores/picasso
* 3800XT = 1357 (100%)
* 4800H = 1094 (~80%)
* 4800U = 1033 (~76%)
I would expect a 5800U to score at best around 1500, but realistically closer to 1300-1450. That's still behind the M1, but pretty darn close for being behind a node (and will still probably be faster for applications that would require x86 translation).
I understand you are being sarcastic, but no, that's not what's not what I'm saying.
It is Apple Silicon that is faster (at least on paper). I'm saying I think even though AMD will have worse perf/watt, I think it will get impressively close despite it's less efficient fabrication process.