WACC Compiler

A WACC Compiler written in Rust by Jordan Hall, Bartłomiej Cieślar, Panayiotis Gavriil and Oliver Killane. Compiling WACC programs to ARM assembly.

This compiler was developed from the 24th of January to the 4th of march 2022 as part of the second year WACC Compiler project at Imperial College.

Our final report detailing the compiler design & extensions added to the compiler can be found in the /docs.

What is WACC?

WACC is a basic language in the procedural paradigm. It supports basic if statements, while loops, functions, modules, standard IO and a simple static type system.

begin
    bool collatz(int n) is
        println n;
        if n == 1 then
            return true
        else 
            bool x = false;
            if (n % 2) == 0 then
                x = call collatz(n / 2) # tail call optimised
            else
                x = call collatz(3 * n + 1) # tail call optimised
            fi;
            return x
        fi
    end

    int num = 0;
    print "Enter the value you want to check: ";
    read num;
    println "Checking...\n";
    bool _ = call collatz(num);
    print num;
    println " does eventually go to 1!"
end

Imperial provides access to the WACC reference compiler. While no source code access is provided, it is useful to check the expected behaviour of WACC Programs.

Compiler Design

Why Rust?

We chose Rust for several reasons:

High Performance - this enabled us to perform intensive optimisation passes, generate complex error messages and run large tests quickly.
Memory Safety - enabled helped us work quickly with complex graph data structures without high risk of significant memory/concurrency bugs.
Wonderful Syntax - simple, low-boilerplate, and highly expressive - making it a joy to work with.
Supporting Infrastructure - cargo made managing dependencies a breeze, as well as running our extensive tests. Rustdoc allowed for extensive documentation (updated and hosted on gitlab pages by our CI).
To improve our Rust through a large project!

We decided to use nightly rust in order to take advantage of features such as box patterns (boxes used heavily in our AST), box syntax, let chains, and more.

Design philosophy

We designed our compiler to be as extendable as possible. By separating it into multiple intermediate representations we allow for new backends, optimisations passes, and potentially entire language frontends to be implemented and connected.

This modularity also allowed us to test large compiler components in isolation, with printouts for the various representations useful for debugging.

Optimisations

The main two components of our backend are the ThreeCode (general three-address code) and ArmCode (arm based representation) both of which are control flow graphs.

Architecture non-specific optimisations such as inlining, constant propagation, dead code elimination and tail-call optimisation are done no the ThreeCode representation. This enabled us to inspect optimisations on the threecode independent of the backend arm translation.

Instruction selection (including optimisations for constants - e.g reorganising comparisons), register allocation (using our own, fast allocation algorithm) and stack organisation are all done by the ArmCode.

Get Started

Prerequisites

Cargo. This will allow nightly rust and dependencies to be installed and used.

We heavily recommend using Rust Analyser with VSCode when exploring this repository.

Build

The provided makefile can be used (used by Imperial's Testing environment).

make         # make the debug version - unoptimised (cargo build)
make release # make release (optimised) (cargo build --release)

Or alternatively cargo:

cargo build
cargo build --release

Documentation

cargo doc
sensible-browser target/doc/compile/index.html # open in browser

Test

Note: In order for the behaviour checking tests to run, the arm cross compiler and qemu need to be installed (see Assembling and Emulating)

cargo test

Using the compiler

A command line interface built using clap is provided.

wacc_33 0.6.9
Jordan Hall, Bartłomiej Cieślar, Panayiotis Gavriil and Oliver Killane
WACC compiler

USAGE:
    compile [OPTIONS] <FILE>

ARGS:
    <FILE>    
  
OPTIONS:
    -a, --arm-temp                  Print the backend representations (arm with temporaries)
        --const-branch              Enable constant branch optimization
        --const-prop                Enable constant propagation
        --dead-code                 Enable dead code elimination
    -f, --final-three-code          Print the three code optimised threecode of the program
    -h, --help                      Print help information
    -i, --ir-print                  Print the intermediate representation generated
        --inlining <MODE>           Set the function inlining mode [default: off] [possible values:
                                    off, low, medium, high]
    -o, --outputpath <FILE>         The name of the output file
    -O                              Enable all optimizations
        --tail-call                 run tail call optimisation
    -u, --unoptimised-three-code    Print the three code representation of the program
    -V, --version                   Print version information

Assembling and Emulating

For this project we targetted ARM, specifically the ARM1176JZF-S processor.

Hence while working on x86, we must cross compile compiler generated assembly to ARM, and them emulate.

# ARM cross-compiler
sudo apt install gcc-arm-linux-gnueabi

# Install QEMU for emulation
sudo apt install qemu-user
sudo apt install qemu-system

We can then assemble and execute a program using the following script.

#!/bin/bash
# assemble-emulate: Takes the assembly file name, and executable name as arguments.

# Assemble for the ARM1176JZF-S processor
arm-linux-gnueabi-gcc -o $2 -mcpu=arm1176jzf-s -mtune=arm1176jzf-s $1

# Emulate using qemu (-L flag sets path prefix for elf binary interpreter)
qemu-arm -L /usr/arm-linux-gnueabi/ $2

We can them use this script as:

./compile source.wacc -o compiled.s
assemble-emulate compiled.s executable

User input and output then use the terminal normally.

To provide user input from a file:

assemble-emulate compiled.s executable < inputs.txt

Name		Name	Last commit message	Last commit date
Latest commit History 244 Commits
docs		docs
examples		examples
src		src
stdlib		stdlib
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
Cargo.toml		Cargo.toml
Makefile		Makefile
README.md		README.md
rust-toolchain.toml		rust-toolchain.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WACC Compiler

What is WACC?

Compiler Design

Why Rust?

Design philosophy

Optimisations

Get Started

Prerequisites

Build

Documentation

Test

Using the compiler

Assembling and Emulating

About

Contributors 3

Languages

OliverKillane/WACC-Compiler

Folders and files

Latest commit

History

Repository files navigation

WACC Compiler

What is WACC?

Compiler Design

Why Rust?

Design philosophy

Optimisations

Get Started

Prerequisites

Build

Documentation

Test

Using the compiler

Assembling and Emulating

About

Topics

Resources

Stars

Watchers

Forks

Contributors 3

Languages