boulder: Numerous toolchain changes #381

ReillyBrogan · 2025-01-01T02:13:39Z

Note: I bundled these together because most of them are pretty small and I worked on all of them at the same time, but I can split them out if needed (they're separate commits per-change).

Changes:

Add build-id flags and enable it by default
Default to compressing debug symbols with zstd. This reduces on-disk space of debug symbols significantly at the cost of higher package sizes.
Define -flto=%(jobs) for GCC as "thin" LTO since it behaves roughly similar to the LLVM equivalent. Add -flto=%(jobs) -flto-partition=one as a new "full" variant of LTO for GCC.
Add -ffat-lto-objects flags and enable them by default. These are needed if building static archives with LTO that are used by the alternate toolchain (IE if the static archives are built with GCC then fat-lto-objects is needed in order to link them with LLVM). Interestingly this also reduced binary sizes with LTO over not having the flag, likely because more information was preserved to the linker.
Add some error flags recommended by Gentoo which can provide an early warning that LTO is likely to result in runtime issues. Enable by default.
Enable thin LTO by default. Several other distros have switched to this as the default by now and it's better to deal with the fallout of it now rather than when we have a few thousand extra packages.
Add direct support for mold including automatically adding it as a builddep and setting the appropriate error flags
Changed many dependencies added by macros to be binary(*) dependencies instead of named ones.

Testing:

Did many, many builds between LLVM/GCC/Mold with the new flags in order to ensure that they worked everywhere as expected.

joebonrichie · 2025-01-06T14:22:24Z

LGTM

tarkah

LGTM

ikeycode · 2025-01-09T00:23:06Z

crates/stone_recipe/src/lib.rs

@@ -48,6 +48,8 @@ pub struct Recipe {
    pub tuning: Vec<KeyValue<Tuning>>,
    #[serde(default, deserialize_with = "stringy_bool")]
    pub emul32: bool,
+    #[serde(default, deserialize_with = "stringy_bool")]
+    pub mold: bool,


what happens when you want to support another experimental linker? Another root-level key? This part needs more consideration before changing here.

ikeycode · 2025-01-09T00:23:50Z

boulder/src/build/job/phase.rs

        flags
            .iter()
            .filter_map(|flag| flag.get(tuning::CompilerFlag::Rust, toolchain)),
    );

+    if recipe.parsed.mold {


surely we can use substitutions for the ld plugin and not export full format strings here

Not sure what you mean? We only need to add the flag for mold, not substitute an existing one.

ikeycode · 2025-01-09T00:25:14Z

boulder/data/macros/arch/base.yaml

+            c         : "-Werror=odr -Werror=strict-aliasing"
+            cxx       : "-Werror=odr -Werror=strict-aliasing"
+
+    # Enable build-id (ON)


im not really happy with this one - some times we'll encounter packages that ignore the LDFLAGS partly or fully, and sometimes we do it on purpose (for example bootstrap of toolchain) - so we need the build-id guarantee baked into the toolchain itself. We also rely on build-id for generation of debuginfo assets so I don't actually see a reason to support disabling this? Lastly our current default is actually xxhash, why the undocumented switch to sha1?

Way ahead of you on baking it into the toolchain:

LLVM: https://github.com/serpent-os/recipes/blob/llvm-cleanup/l/llvm/pkg/patches/config/0001-lld-Always-enable-build-id-and-use-20-byte-hashes.patch

Mold: https://github.com/serpent-os/recipes/blob/blake3/m/mold/pkg/0001-Set-default-settings-for-compression-and-build.patch

binutils: (Has the upstream default of --build-id=sha1 already).

The purpose of allowing --build-id=none to be set is moreso a just-in-case thing. Since we're setting it at the toolchain level having --build-id=sha1 in the flags is mostly so that packagers can see that it's there, and since it's baked in just removing the flag isn't going to disable it in the toolchain. We need an explicit --build-id=none to turn it off.

I don't really see it being necessary to disable, but some things like golang handle their own buildid generation and it's better to have the escape than not I think.

Lastly our current default is actually xxhash, why the undocumented switch to sha1?

We discussed this in Matrix already but I'm going to open up an issue to track these flag changes.

Signed-off-by: Reilly Brogan <[email protected]>

Changes: - Move "fat" GCC LTO configuration to "thin" as it's a better match in terms of what it does - Add a "fat" GCC LTO configuration that is much more similar to fat LTO with LLVM Signed-off-by: Reilly Brogan <[email protected]>

These flags enable both traditional bitcode and LLVM/GCC intermediate object language to be built at the same time. This is necessary when using a static library built with LTO on LLVM/GCC with the other. Also, from some testing this also seems to improve the ability for LTO to minimize the binary size further. This _does_ increase the size of the static archive files but not to a massive degree and it allows for more low-level LTO with packages that use the same toolchain and use static archives built in another package. Tested with both LLVM and GCC, full LTO, thin LTO, and no LTO. Mold was also tested with both LLVM and GCC. Signed-off-by: Reilly Brogan <[email protected]>

As recommended by the Gentoo wiki, this adds a number of compiler flags that indicate a high probability of runtime-related LTO issues. They are enabled by default. Signed-off-by: Reilly Brogan <[email protected]>

Thin LTO for both GCC and LLVM has relatively little build-time loss compared to non-LTO and consistently sees improvements to binary size and runtime memory use. Enable it by default following in the footsteps of several other distributions that have done the same. Signed-off-by: Reilly Brogan <[email protected]>

…efault These reduce on-disk size of debug symbols significantly. They are fully supported by LLVM, GCC/Binutils, Mold, and elfutils. Signed-off-by: Reilly Brogan <[email protected]>

Signed-off-by: Reilly Brogan <[email protected]>

Mold is often noticeably faster than LLD (and much faster than BFD) and is already used by numerous packages for that reason. Make it more ergonomic to use by adding a top-level `mold` key which adds it as a builddep and sets environmental flags appropriately to use it. Tested with both the gnu and LLVM toolchains. Signed-off-by: Reilly Brogan <[email protected]>

Signed-off-by: Reilly Brogan <[email protected]>

ikeycode · 2025-01-17T02:31:47Z

Per matrix convo toolchain was just a busted-ass notion and we'll clean up in port to KDL

ReillyBrogan requested review from ikeycode, ermo and tarkah as code owners January 1, 2025 02:13

tarkah approved these changes Jan 6, 2025

View reviewed changes

ikeycode requested changes Jan 9, 2025

View reviewed changes

ReillyBrogan mentioned this pull request Jan 10, 2025

llvm: Cleanups and improvements serpent-os/recipes#535

Merged

ReillyBrogan added 9 commits January 13, 2025 20:35

boulder-data: Add build-id and enable by default

62c49c7

Signed-off-by: Reilly Brogan <[email protected]>

boulder-data: Update GCC LTO configuration

1a49ebc

Changes: - Move "fat" GCC LTO configuration to "thin" as it's a better match in terms of what it does - Add a "fat" GCC LTO configuration that is much more similar to fat LTO with LLVM Signed-off-by: Reilly Brogan <[email protected]>

boulder-data: Add early-warning error flags for LTO

ac82602

As recommended by the Gentoo wiki, this adds a number of compiler flags that indicate a high probability of runtime-related LTO issues. They are enabled by default. Signed-off-by: Reilly Brogan <[email protected]>

boulder-data: Add flags for compressing debug symbols and enable by d…

3429204

…efault These reduce on-disk size of debug symbols significantly. They are fully supported by LLVM, GCC/Binutils, Mold, and elfutils. Signed-off-by: Reilly Brogan <[email protected]>

boulder-data: Remove trivial null character

ffe8a27

Signed-off-by: Reilly Brogan <[email protected]>

boulder-data: Use binary(*) deps instead of named ones

0a484cd

Signed-off-by: Reilly Brogan <[email protected]>

ReillyBrogan force-pushed the tooling-flags-2025 branch from 691374e to 0a484cd Compare January 14, 2025 02:35

ikeycode approved these changes Jan 17, 2025

View reviewed changes

ikeycode merged commit ecbd710 into main Jan 17, 2025
2 checks passed

ikeycode deleted the tooling-flags-2025 branch January 17, 2025 02:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

boulder: Numerous toolchain changes #381

boulder: Numerous toolchain changes #381

ReillyBrogan commented Jan 1, 2025

joebonrichie commented Jan 6, 2025

tarkah left a comment

ikeycode Jan 9, 2025

ikeycode Jan 9, 2025

ReillyBrogan Jan 10, 2025

ikeycode Jan 9, 2025

ReillyBrogan Jan 10, 2025

ikeycode commented Jan 17, 2025

boulder: Numerous toolchain changes #381

boulder: Numerous toolchain changes #381

Conversation

ReillyBrogan commented Jan 1, 2025

joebonrichie commented Jan 6, 2025

tarkah left a comment

Choose a reason for hiding this comment

ikeycode Jan 9, 2025

Choose a reason for hiding this comment

ikeycode Jan 9, 2025

Choose a reason for hiding this comment

ReillyBrogan Jan 10, 2025

Choose a reason for hiding this comment

ikeycode Jan 9, 2025

Choose a reason for hiding this comment

ReillyBrogan Jan 10, 2025

Choose a reason for hiding this comment

ikeycode commented Jan 17, 2025