TPDE-LLVM: Faster LLVM -O0 Back-End

1. testdelacc1 ◴[03 Sep 25 08:10 UTC] No.45113380[source]▶

LLVM is the code generation backend used in several languages, like Rust and one of the many compilers for C and C++ (clang). Code generated by these compilers is considered “fast/performant” thanks to LLVM.

The problem with LLVM has always been that it takes a long time to produce code. The post in the link promises a new backend that produces a slower artifact, but does so 10-20x quicker. This is great for debug builds.

This doesn’t mean the compilation as a whole gets quicker. There are 3 steps in compilation

- Front end: transforms source code into an LLVM intermediation representation (IR)

- Backend: this is where LLVM comes in. It accepts LLVM IR and transforms it into machine code

- Linking: a separate program links the artifacts produced by LLVM.

How long does each step take? Really depends on the program we’re trying to compile. This blog post contains timings for one example program (https://blog.rust-lang.org/2023/11/09/parallel-rustc/) to give you an idea. It also depends on whether LLVM is asked to produce a debug build (not performant, but quicker to produce) or a release build (fully optimised, takes longer).

The 10-20x improvement described here doesn’t work yet for clang or rustc, and when it does it will only speed up the backend portion. Nevertheless, this is still an incredible win for compile times because the other two steps can be optimised independently. Great work by everyone involved.

replies(3): >>45113440 #>>45113555 #>>45113671 #

2. tialaramex ◴[03 Sep 25 08:23 UTC] No.45113440[source]▶

>>45113380 (TP) #

IMO the worst problem with LLVM isn't that it's slow, the worst problem is that its IR has poorly defined semantics or its team doesn't actually deliver those semantics and a bug ticket saying "Hey, what gives?" goes in the pile of never-never tickets, making it less useful as a compiler backend even if it was instant.

This is the old "correctness versus performance" problem and we already know that "faster but wrong" isn't meaningfully faster it's just wrong, anybody can give a wrong answer immediately and so that's not at all useful.

replies(2): >>45113616 #>>45119553 #

3. ObscureScience ◴[03 Sep 25 08:44 UTC] No.45113555[source]▶

>>45113380 (TP) #

[flagged]

replies(2): >>45113655 #>>45113984 #

4. randomNumber7 ◴[03 Sep 25 08:55 UTC] No.45113616[source]▶

>>45113440 #

What is the alternative though for a new language though? Transpiring to C or hacking s.th. by using the GCC backend?

replies(3): >>45113660 #>>45113723 #>>45113897 #

5. testdelacc1 ◴[03 Sep 25 09:01 UTC] No.45113655[source]▶

>>45113555 #

No it wasn’t. There’s no way I can prove that it wasn’t.

But I can prove that this comment wasn’t LLM generated -> fuck you.

(LLMs don’t swear)

replies(1): >>45113830 #

6. tialaramex ◴[03 Sep 25 09:02 UTC] No.45113660{3}[source]▶

>>45113616 #

The easy alternative? There isn't one.

The really difficult thing would be to write a new compiler backend with a coherent IR that everybody understands and you'll stick to. Unfortunately you can be quite certain that after you've done the incredible hard work to build such a thing, a lot of people's assessment of your backend will be:

1. The code produced was 10% slower than LLVM, never use this, speed is all that matters anyway and correctness is irrelevant.

2. This doesn's support the Fongulab Splox ZV406 processor made for six years in the 1980s, whereas LLVM does, therefore this is a waste of time.

replies(2): >>45115237 #>>45116172 #

7. aengelke ◴[03 Sep 25 09:05 UTC] No.45113671[source]▶

>>45113380 (TP) #

In terms of runtime performance, the TPDE-generated code is comparable with and sometimes a bit faster than LLVM -O0.

I agree that front-ends are a big performance problem and both rustc and Clang (especially in C++ mode) are quite slow. For Clang with LLVM -O0, 50-80% is front-end time, with TPDE it's >98%. More work on front-end performance is definitely needed; maybe some things can be learned from Carbon. With mold or lld, I don't think linking is that much of a problem.

We now support most LLVM-IR constructs that are frequently generated by rustc (most notably, vectors). I just didn't get around to actually integrate it into rustc and get performance data.

> The 10-20x improvement described here doesn’t work yet for clang

Not sure what you mean here, TPDE can compile C/C++ programs with Clang-generated LLVM-IR (95% of llvm-test-suite SingleSource/MultiSource, large parts of the LLVM monorepo).

8. IshKebab ◴[03 Sep 25 09:12 UTC] No.45113723{3}[source]▶

>>45113616 #

Or native code generation. Depends on what your performance goals are. It would be cool if there was a standard IR that languages could target - something more suitable than C.

replies(3): >>45113762 #>>45113862 #>>45114658 #

9. ahartmetz ◴[03 Sep 25 09:17 UTC] No.45113762{4}[source]▶

>>45113723 #

Hm. SPIR-V is a standard IR, but AFAIU not really the kind of IR that you need for communication inside a compiler. It wasn't designed for that.

10. tialaramex ◴[03 Sep 25 09:32 UTC] No.45113830{3}[source]▶

>>45113655 #

Maybe HN should add "Don't accuse comments of being LLM generated" to the guidelines, because this sure seems like it'll be in the same category as people moaning that they were downvoted or more closely people saying "Have you read the link?"

replies(2): >>45113898 #>>45114009 #

11. pjmlp ◴[03 Sep 25 09:36 UTC] No.45113862{4}[source]▶

>>45113723 #

Being pursued since UNCOL in 1958, each attempt eventually only works out for a specific set of languages, due to politics or market forces.

12. pjmlp ◴[03 Sep 25 09:40 UTC] No.45113897{3}[source]▶

>>45113616 #

Produce a dumb machine code quality, enough to bootstrapt it, and go from there.

Move away from classical UNIX compiler pipelines.

However in current times, I would rather invest into LLM improvements into generating executables directly, the time to mix AI into compiler development has come, and classical programming languages are just like doing yet another UNIX clone, in terms of value.

replies(1): >>45114370 #

13. testdelacc1 ◴[03 Sep 25 09:41 UTC] No.45113898{4}[source]▶

>>45113830 #

I feel like a fuck you to the accuser is sufficient. It proves that you’re not an LLM and is a reasonable response to an unfounded accusation.

LLMs decline when asked to say fuck you. Gemini: “I am unable to respond to that request.” Claude: “I’d rather not use profanity unprompted.”

But allowing a fuck you would need a modification to the rules anyway, I suppose.

14. tomhow ◴[03 Sep 25 09:58 UTC] No.45113984[source]▶

>>45113555 #

Please don't do this here. If a comment seems unfit for HN, please flag it and email us at hn@ycombinator.com so we can have a look.

15. tomhow ◴[03 Sep 25 10:03 UTC] No.45114009{4}[source]▶

>>45113830 #

We've talked about this but we're not adding it to the guidelines. It's already covered indirectly by the established guidelines, and "case law" (in the form of moderator replies) makes it explicit.

16. taminka ◴[03 Sep 25 11:04 UTC] No.45114370{4}[source]▶

>>45113897 #

mm, a non deterministic compiler with no way to verify correctness, what could go wrong lol

replies(1): >>45115740 #

17. rafaelmn ◴[03 Sep 25 11:51 UTC] No.45114658{4}[source]▶

>>45113723 #

WASM ?

replies(2): >>45115506 #>>45119481 #

18. mamcx ◴[03 Sep 25 13:05 UTC] No.45115237{4}[source]▶

>>45113660 #

Ok, but then if this were done, then you could also emit LLVM after. It probably get worse timings, but, allow to make the transition palatable

19. nerpderp82 ◴[03 Sep 25 13:25 UTC] No.45115506{5}[source]▶

>>45114658 #

WASM !

20. pjmlp ◴[03 Sep 25 13:46 UTC] No.45115740{5}[source]▶

>>45114370 #

Ask C and C++ developers, they are used to it, and still plenty of critical software keeps being written with them.

replies(2): >>45116969 #>>45119546 #

21. derefr ◴[03 Sep 25 14:23 UTC] No.45116172{4}[source]▶

>>45113660 #

> The really difficult thing would be to write a new compiler backend with a coherent IR that everybody understands and you'll stick to.

But why would you bother, when with those same skills and a lot less time, you could fork LLVM, correct its IR semantics yourself (unilaterally), and then push people to use your fork?

(I.e. the EGCS approach to forcing the upstream to fix their shit.)

> This doesn's support the Fongulab Splox ZV406 processor made for six years in the 1980s, whereas LLVM does, therefore this is a waste of time.

AFAIK, the various Fongulab Sploxes that LLVM has targets for, are mostly there to act as forcing functions to keep around features that no public backend would otherwise rely on, because proprietary, downstream backends rely on those features. (See e.g. https://q3k.org/lanai.html — where the downstream ISA of interest is indeed proprietary, but used to be public before an acquisition; so the contributor [Google] upstreamed an implementation of the old public ISA target.)

replies(1): >>45118563 #

22. taminka ◴[03 Sep 25 15:30 UTC] No.45116969{6}[source]▶

>>45115740 #

C and C++ compilers are deterministic and have guarantees of correctness similar to that of other languages (esp ones that share share the same llvm backend)

replies(2): >>45117344 #>>45117394 #

23. pjmlp ◴[03 Sep 25 16:04 UTC] No.45117344{7}[source]▶

>>45116969 #

Provided you're clever enough to avoid UB land mines, and compiler specific implementation non portable behaviours.

24. Kranar ◴[03 Sep 25 16:08 UTC] No.45117394{7}[source]▶

>>45116969 #

C++ compilers are not required to be deterministic and in practice are not, at least as far as "same source code produces same observable behavior". Things that can introduce non-determinism include the order in which symbols are linked, static variable initialization, floating point operations (unless you use strict mode, which is not mandated by the standard), and this is ignoring the obvious stuff like unspecified behavior which is specifically defined as behavior which can differ between different runs on the same system.

Also correctness guarantees? Hahaha... I'll pretend you didn't just claim C++ has correctness guarantees on par with other languages, LLVM or otherwise. C++ gives you next to nothing with respect to correctness guarantees.

25. tialaramex ◴[03 Sep 25 17:45 UTC] No.45118563{5}[source]▶

>>45116172 #

Thanks for the link about Lanai although that site's cert has expired (very recently too) so it's slightly annoying (or of course bad guys are attacking me)

As to the first point, I suspect this is a foundational problem. Like, suppose you realise the concrete used to make a new skyscraper was the wrong mixture. In a sense this is a small change, there's nothing wrong with the elevators, the windows, cabling, furnishing, air conditioning, and so on. But, to "fix" this problem you need to tear down the skyscraper and replace it. Ouch.

I may be wrong, I have never tried to solve this problem. But I fear...

26. IshKebab ◴[03 Sep 25 19:18 UTC] No.45119481{5}[source]▶

>>45114658 #

It's probably not too bad of an option these days tbf. Are there any optimising WASM compilers that can get close to native performance?

27. saagarjha ◴[03 Sep 25 19:25 UTC] No.45119546{6}[source]▶

>>45115740 #

Excellent point, undefined behavior is exactly like an LLM. Surely this is what “alignment” in the standard is talking about.

28. saagarjha ◴[03 Sep 25 19:26 UTC] No.45119553[source]▶

>>45113440 #

Can you point to cases where you feel this has caused harm that you feel outweighs the collective time people spend waiting for LLVM builds?