[EVM] Improvements #759

vladimirradosavljevic · 2025-01-21T13:51:23Z

No description provided.

ARGUMENT instructions should always be located at the beginning of a MF`s entry basic block and be ordered in ascending order of their operand values.

Original idea and some code parts were taken from the Ethereum`s compiler (solc) stackification algorithm.

Signed-off-by: Vladimir Radosavljevic <[email protected]>

This patch adds pseudo jumps, call and ret instructions to fix machine verifier after stackification and to reduce complexity added with bundles. Signed-off-by: Vladimir Radosavljevic <[email protected]>

Signed-off-by: Vladimir Radosavljevic <[email protected]>

- Removed EVMControlFlowGraph/EVMControlFlowGraphBuilder classes - Functionality that analyzes machine CFG and provides query methods was moved to EVMMachineCFGInfo class - Information about MBB terminators is represented in EVMMBBTerminatorsInfo class - StackSlot, Stack and Operation definitions were moved to EVMStackModel - Replaced almost all the std::map/std::set with llvm counterparts in EVMStackLayoutGenerator

…es to be treated as loads (#99999) This change avoids deleting `!willReturn` intrinsics for which the return value is unused when building the SDAG. Currently, calls to read-only intrinsics not marked with `IntrWillReturn` cannot be deleted at the LLVM IR level but may be deleted when building the SDAG. These calls are unsafe to remove from the IR because the functions are `!willReturn` and should also be unsafe to remove fromthe SDAG for the same reason. This change aligns the behavior of the SDAG to that of LLVM IR. This change also requires that intrinsics not have the `Throws` attribute to be treated as loads for the same reason.

Signed-off-by: Vladimir Radosavljevic <[email protected]>

This way, we can share address space AliasAnalysis implementation between the backends. Signed-off-by: Vladimir Radosavljevic <[email protected]>

Signed-off-by: Vladimir Radosavljevic <[email protected]>

This way, we can share SHA3ConstFolding implementation between the backends. Signed-off-by: Vladimir Radosavljevic <[email protected]>

…alls Signed-off-by: Vladimir Radosavljevic <[email protected]>

Signed-off-by: Vladimir Radosavljevic <[email protected]>

Currently, it is only allowed to have memmove where src and dst are from address space 1 (HEAP). Since MemCpyOptPass can change memmove to memcpy, allow this case, and don't issue an error. Signed-off-by: Vladimir Radosavljevic <[email protected]>

Signed-off-by: Vladimir Radosavljevic <[email protected]>

For EVM, transformations to shift are preferable. Signed-off-by: Vladimir Radosavljevic <[email protected]>

Signed-off-by: Vladimir Radosavljevic <[email protected]>

…context instructions Signed-off-by: Vladimir Radosavljevic <[email protected]>

Since these instructions are cheaper than move, it is beneficial to rematerialize them. Signed-off-by: Vladimir Radosavljevic <[email protected]>

This looks like a rather weird change, so let me explain why this isn't as unreasonable as it looks. Let's start with the problem it's solving. ``` define signext i32 @overlap_live_ranges(ptr %arg, i32 signext %arg1) { bb: %i = icmp eq i32 %arg1, 1 br i1 %i, label %bb2, label %bb5 bb2: ; preds = %bb %i3 = getelementptr inbounds nuw i8, ptr %arg, i64 4 %i4 = load i32, ptr %i3, align 4 br label %bb5 bb5: ; preds = %bb2, %bb %i6 = phi i32 [ %i4, %bb2 ], [ 13, %bb ] ret i32 %i6 } ``` Right now, we codegen this as: ``` li a3, 1 li a2, 13 bne a1, a3, .LBB0_2 lw a2, 4(a0) .LBB0_2: mv a0, a2 ret ``` In this example, we have two values which must be assigned to a0 per the ABI (%arg, and the return value). SelectionDAG ensures that all values used in a successor phi are defined before exit the predecessor block. This creates an ADDI to materialize the immediate in the entry block. Currently, this ADDI is not sunk into the tail block because we'd have to split a critical edges to do so. Note that if our immediate was anything large enough to require two instructions we *would* split this critical edge. Looking at other targets, we notice that they don't seem to have this problem. They perform the sinking, and tail duplication that we don't. Why? Well, it turns out for AArch64 that this is entirely an accident of the existance of the gpr32all register class. The immediate is materialized into the gpr32 class, and then copied into the gpr32all register class. The existance of that copy puts us right back into the two instruction case noted above. This change essentially just bypasses this emergent behavior aspect of the aarch64 behavior, and implements the same "always sink immediates" behavior for RISCV as well.

…ges for cheap instructions Signed-off-by: Vladimir Radosavljevic <[email protected]>

Break critical edges in MachineSink optimizations for instructions that are marked with isAsCheapAsAMove in tablegen. Signed-off-by: Vladimir Radosavljevic <[email protected]>

akiramenai · 2025-01-21T14:44:14Z

@vladimirradosavljevic is there a reason to merge it to ef-stackification rather than main?

vladimirradosavljevic · 2025-01-21T14:50:06Z

@vladimirradosavljevic is there a reason to merge it to ef-stackification rather than main?

I tested this and regenerated tests against ef-stackification, so we don't need to do it when we merge ef-stackification. My suggestion is to merge this after ef-stackification is merged into main. Wdyt?

PavelKopyl · 2025-01-21T20:58:26Z

llvm/test/CodeGen/EVM/context-remat.ll

+target datalayout = "E-p:256:256-i256:256:256-S256-a:256:256"
+target triple = "evm"
+
+declare void @use(i256)


Though compilation to asm works well here, attempt to emit an object file will cause a crash. That's because for EVM we do not support compilation units.
I think it's better to define @use as an empty, no-inline function at the point of a future work on assembler.

Ah, didn't know about that, thanks for this info. Even though we have this issue with emitting an object files, do you think it is better to change that over simplicity in tests? Do you think we will use some of these tests for future work on assembler?

I guess we may use these files for asm parser testing, but ok let's leave them as is. It's not a bit deal to change when need.

PavelKopyl and others added 30 commits December 5, 2024 14:19

[EVM] Fix creation of bundles with terminators

2706295

[EVM] Fix ordering of ARGUMENT instructions

f2be3b6

ARGUMENT instructions should always be located at the beginning of a MF`s entry basic block and be ordered in ascending order of their operand values.

[EVM] Add split critical edges pass

ab9f9a9

[EVM] Disable memcopy expansion

df6fe0d

[EVM] Re-enable inlining

7d595b6

[EVM] Add backward propagation (BP) stackification

9909ac7

Original idea and some code parts were taken from the Ethereum`s compiler (solc) stackification algorithm.

[EVM] Support commutable operations in BP stackification algorithm

5d0ad54

[EVM] Remove FunctionInfo struct (#742)

6c9275d

Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Introduce pseudo jumps, call and ret instructions

18eab55

This patch adds pseudo jumps, call and ret instructions to fix machine verifier after stackification and to reduce complexity added with bundles. Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Move generation of JUMPDEST to the AsmPrinter phase

916ff45

Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Refactor EVMOptimizedCodeTransform

90ceec5

Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Refactor EVMAssembly

b979918

Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Merge EVMOptimizedCodeTransform and EVMAssembly into one file

3a37352

Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Address comments

e059414

Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Replace std::containers/algorithms with the llvm's counterparts

f5dde02

[EVM] Add pre-commit test for Update intrinsic attributes

d951427

Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Update intrinsic attributes

55fff1e

Signed-off-by: Vladimir Radosavljevic <[email protected]>

Generalize EraVMAAResult and move it to the common part

f158224

This way, we can share address space AliasAnalysis implementation between the backends. Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Add pre-commit tests for Add implementation of AliasAnalysis

de4069d

Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Add implementation of AliasAnalysis

b120e84

Signed-off-by: Vladimir Radosavljevic <[email protected]>

Generalize EraVMSHA3ConstFolding and move it to the common part

a3aeeab

This way, we can share SHA3ConstFolding implementation between the backends. Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Add pre-commit test for Add support for constant folding SHA3 c…

7afe1c1

…alls Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Add support for constant folding SHA3 calls

8713d09

Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Add pre-commit test for Don't avoid transformations to shift

cb4860f

Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Don't avoid transformations to shift

fc37ca0

For EVM, transformations to shift are preferable. Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Add pre-commit test for Set that jumps are expensive

c2ef7f3

Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Set that jumps are expensive

54123d4

Signed-off-by: Vladimir Radosavljevic <[email protected]>

vladimirradosavljevic and others added 5 commits January 17, 2025 16:22

[EVM] Add pre-commit test for Allow rematerialization of some of the …

6e4910d

…context instructions Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Allow rematerialization of some of the context instructions

a77843c

Since these instructions are cheaper than move, it is beneficial to rematerialize them. Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Add pre-commit test for Enable MachineSink to break critical ed…

ab83521

…ges for cheap instructions Signed-off-by: Vladimir Radosavljevic <[email protected]>

[EVM] Enable MachineSink to break critical edges for cheap instructions

f3f9173

Break critical edges in MachineSink optimizations for instructions that are marked with isAsCheapAsAMove in tablegen. Signed-off-by: Vladimir Radosavljevic <[email protected]>

vladimirradosavljevic requested review from akiramenai and PavelKopyl January 21, 2025 13:51

PavelKopyl reviewed Jan 21, 2025

View reviewed changes

akiramenai force-pushed the ef-stackification branch from 3635263 to d6a6e38 Compare January 23, 2025 11:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EVM] Improvements #759

[EVM] Improvements #759

vladimirradosavljevic commented Jan 21, 2025

akiramenai commented Jan 21, 2025

vladimirradosavljevic commented Jan 21, 2025

PavelKopyl Jan 21, 2025

vladimirradosavljevic Jan 22, 2025

PavelKopyl Jan 22, 2025

[EVM] Improvements #759

Are you sure you want to change the base?

[EVM] Improvements #759

Conversation

vladimirradosavljevic commented Jan 21, 2025

akiramenai commented Jan 21, 2025

vladimirradosavljevic commented Jan 21, 2025

PavelKopyl Jan 21, 2025

Choose a reason for hiding this comment

vladimirradosavljevic Jan 22, 2025

Choose a reason for hiding this comment

PavelKopyl Jan 22, 2025

Choose a reason for hiding this comment