Skip to content

Latest commit

 

History

History
68 lines (46 loc) · 3.21 KB

README.md

File metadata and controls

68 lines (46 loc) · 3.21 KB

bootstrappingCIL

Bootstrapping

From a Command Prompt at the project root:

bootstrappingCIL> bscil0\bscil0.exe < bscil1\bscil1.bscil0 > bscil1\bscil1.exe
bootstrappingCIL> bscil1\bscil1.exe < bscil2\bscil2.bscil1 > bscil2\bscil2.exe

And verifying each language can self-host:

bootstrappingCIL> bscil0\bscil0.exe < bscil0\bscil0.bscil0 > bs0.exe
bootstrappingCIL> fc /b bscil0\bscil0.exe bs0.exe

bootstrappingCIL> bscil1\bscil1.exe < bscil1\bscil1.bscil1 > bscil1.exe
bootstrappingCIL> bscil1.exe < bscil1\bscil1.bscil1 > bs1.exe
bootstrappingCIL> fc /b bscil1.exe bs1.exe

Running Tests

Install-Module Pester

$pesterConfig = New-PesterConfiguration
$pesterConfig.Run.SkipRemainingOnFailure = 'Run'
Invoke-Pester -Configuration $pesterConfig

Windows and Ubuntu test status: Build status

Background

The process of writing and running a .NET program has several steps.

First, a high-level program like C# or VB is compiled to Common Intermediate Language (CIL) (or MSIL). Next, the generated CIL is assembled to a binary Portable Executable (PE) file. Finally, to run the binary an execution engine converts CIL to native machine code as the program runs.

Common Intermediate Language (CIL) is the assembly language of the .NET framework, and represents the op codes in the executable. Unlike other assembly languages like x86 or ARM that exactly describe the instructions that will run on the CPU, CIL describes operations in a hypothetical virtual machine

"Bootstrapping" is a term used to describe the origin of a compiler for a language, written in that language itself. While being able to run a C# compiler written in C# may seem like a chicken-and-egg situation, evolution is again the answer. The solution is to make a very simple tool written in an existing language, and then use that iteratively to produce more advanced tools.

The goal of this project is to write an assembler that transforms CIL assembly files into PE files. The goal isn't to simulate the operations in the execution engine; just producing a valid PE file. Also, the goal of bootstrapping the CIL means each iteration of better tools should move closer to proper CIL, and not add higher-level C# features.

It's not reasonable to ask people interested in this project to just trust some executables off the internet. For that reason we'll try to limit the number and complexity of any committed binary executables. Each program is open for review, and includes documentation to explain what all the bytes of the program are doing. But at the same time, .NET is not the best technology to bootstrap to remove all trust dependencies (start from an FPGA if that is your goal).

This should be an educational exploration. Buckle up, and let's get started!

References

EMCA-335 describes in great detail the binary file format assemblers must produce.

Visual Studio's ildasm.exe is helpful to view PE files, but this project doesn't copy any code from other tools.