ppci 0.1.0 documentation

Table of contents

Introduction

The ppci project is a compiler, assembler, linker and build-system written entirely in python. The project contains everything from language front-end to code generation. It contains a front-end for the c3 language, can optimize this code and generate ARM-code.

The project contains the following:

  • Language front-ends: Brainfuck, C3 language
  • Various code optimizers
  • Backends for various platforms: ARM, Thumb, Python
  • Assembler
  • Linker

Quickstart

Install ppci in a virtual environment:

$ python -m venv sandbox
$ source sandbox/bin/activate
(sandbox) $ pip install ppci
(sandbox) $ ppci-build.py -h

If ppci installed correcly, you will get a help message.

Download and unzip the examples bundle project here examples.zip.

stm32f4 example

To build the blinky project do the following:

$ cd examples/blinky
$ ppci-build.py
$ ls blinky.hex

Flash the hexfile using your flashtool of choice on the stm32f4discovery board and enjoy the magic.

arduino example

To build the arduino blink led example, first build the example:

$ cd examples/arduino
$ ppci-build.py

Next flash the hexfile using avrdude for example:

$ avrdude -v -P /dev/ttyACM0 -c arduino -p m328p -U flash:w:blinky.hex

Installation

ppci can be installed with pip:

$ pip install ppci

To check if the installation was successful, run the following command:

$ ppci-build.py -h

If all is well, the help message for the ppci build tool appears.

Command line tools

This section describes the usage of some commandline tools installed with ppci.

ppci-build.py

This command can be used to construct build files (see Build system).

Take for example the stm32f4 led project build file:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
<project name="Examples" default="all">
    <import name="ppci.buildtasks" />

    <target name="all" depends="blinky">
    </target>

    <property name="arch" value="arm"/>

    <target name="blinky">
        <assemble source="startup_stm32f4.asm" target="thumb" 
            output="obj/startup.o" />
        <compile target="thumb"
            sources="blinky.c3;arch.c3;../../librt/io.c3"
            includes="stm32f4xx.c3"
            output="obj/blinky.o" />
        <link output="obj/blinky.elf" layout="stm32f4.mmap"
            target="thumb"
            objects="obj/startup.o;obj/blinky.o" />
        <objcopy objectfile="obj/blinky.elf"
            imagename="flash"
            format="hex"
            output="blinky.hex" />
    </target>

</project>

To build this project, run ppci-build.py in the same directory:

$ cd test/data/stm32f4xx
$ ppci-build.py

Or specify the buildfile:

$ ppci-build.py -f test/data/stm32f4xx/build.xml

ppci-c3c.py

Instead of relying on a build system, the C3 language compiler can also be activated stand alone.

$ ppci-c3c.py --target arm examples/snake/game.c3

ppci-asm.py

Invoke the assembler.

ppci-hexutil.py

Utility to handle hex files.

C3 language

Introduction

As an example language, the c3 language was created. As pointed out clearly in c2lang, the C language is widely used, but has some strange contraptions:

  • The include system. This results in lots of code duplication and file creation. Why would you need filenames in source code?
  • The comma statement: x = a(), 2; assigns 2 to x, after calling function a.
  • C is difficult to parse with a simple parser. The parser has to know what a symbol is when it is parsed.
  • Etc...

For these reasons (and of course, for fun), C3 was created.

Example

The hello world example in c3:

module hello;
import io;

function void main()
{
    io.println("Hello world");
}

Language reference

Modules

Modules in C3 live in file, and can be defined in multiple files. Modules can import each other by using the import statement.

For example:

pkg1.c3:

module pkg1;
import pkg2;

pkg2.c3:

module pkg2;
import pkg1;

Functions

Function can be defined by using the function keyword, followed by a type and the function name.

module example;

function void compute()
{
}

function void main()
{
    main();
}

Variables

Variables require the var keyword, and can be either global are function local.

module example;

var int global_var;

function void compute()
{
    var int x = global_var + 13;
    global_var = 200 - x;
}

Types

Types can be specified when a variable is declared, and also typedeffed.

module example;
var int number;
var int* ptr_num;
type int* ptr_num_t;
var ptr_num_t number2;

If statement

The following code example demonstrates the if statement. The else part is optional.

module example;

function void compute(int a)
{
    var int b = 10;
    if (a > 100)
    {
        b += a;
    }

    if (b > 50)
    {
        b += 1000;
    }
    else
    {
        b = 2;
    }
}

While statement

The while statement can be used as follows:

module example;

function void compute(int a)
{
    var int b = 10;
    while (b > a)
    {
        b -= 1;
    }
}

For statement

The for statement works like in C. The first item is initialized before the loop. The second is the condition for the loop. The third part is executed when one run of the loop is done.

module example;

function void compute(int a)
{
    var int b = 0;
    for (b = 100; b > a; b -= 1)
    {
        // Do something here!
    }
}

Build system

It can be convenient to bundle a series of build steps into a script, for example a makefile. Instead of depending on make, yet another build tool was created. The build specification is specified in xml. Much like msbuild and Ant.

A project can contain a build.xml file which describes how the project should be build. The name of the file can be build.xml or another filename. This file can than be given to ppci-build.py.

An example build file:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
<project name="Snake" default="snake">
    <import name="ppci.buildtasks" />

    <target name="snake">
        <assemble source="lm3s6965/startup.asm" target="thumb" 
            output="startup.o" />
        <compile target="thumb" 
            sources="snake/main.c3;lm3s6965/bsp.c3;../librt/io.c3;snake/game.c3"
            output="snake.o" report="snake_report.html"/>
        <link output="snake.elf" layout="lm3s6965/memlayout.mmap" 
            target="thumb"
            objects="startup.o;snake.o" />
        <objcopy objectfile="snake.elf" imagename="flash" format="bin" 
            output="snake.bin" />
    </target>

</project>

Projects

The root element of a build file is the project tag. This tag contains a name and optionally a default target attribute. When no target is given when building the project, the default target is selected.

Targets

Like make, targets can depend on eachother. Then one target is run, the build system makes sure to run depending targets first. Target elements contain a list of tasks to perform.

Tasks

The task elements are contained within target elements. Each task specifies a build action. For example the link task takes multiple object files and combines those into a merged object.

API

Instead of using the commandline, it is also possible to use the build functions of ppci via the buildfunctions.

For example to assemble, compile, link and objcopy, code could look like:

1
2
3
4
5
6
7
 from ppci.buildfunctions import assemble, c3compile, link, objcopy

 march = "thumb"
 o1 = assemble('start.asm', march)
 o2 = c3compile(['source_code.c3'], [], march)
 o3 = link([o2, o1], 'mem.ld', march)
 objcopy(o3, 'code', 'bin', 'output.bin')

buildfunctions module

This module contains a set of handy functions to invoke compilation, linking and assembling.

ppci.buildfunctions.assemble(source, target)

Invoke the assembler on the given source, returns an object containing the output.

ppci.buildfunctions.bf2ir(source, target)

Compile brainfuck source into ir code

ppci.buildfunctions.bfcompile(source, target, reporter=<ppci.reporting.DummyReportGenerator object>)

Compile brainfuck source into binary format for the given target

ppci.buildfunctions.c3compile(sources, includes, target, reporter=<ppci.reporting.DummyReportGenerator object>)

Compile a set of sources into binary format for the given target

ppci.buildfunctions.c3toir(sources, includes, target, reporter=<ppci.reporting.DummyReportGenerator object>)

Compile c3 sources to ir code using the includes and for the given target

ppci.buildfunctions.construct(buildfile, targets=())

Construct the given buildfile. Raise task error if something goes wrong

ppci.buildfunctions.fix_file(f)

Determine if argument is a file like object or make it so!

ppci.buildfunctions.fix_object(o)

Try hard to load an object

ppci.buildfunctions.fix_target(tg)

Try to return an instance of the Target class

ppci.buildfunctions.get_compiler_rt_lib(target)

Gets the runtime for the compiler. Returns an object with the compiler runtime for the given target

ppci.buildfunctions.ir_to_object(ir_modules, target, reporter=<ppci.reporting.DummyReportGenerator object>)

Translate the given list of IR-modules into object code for the given target

ppci.buildfunctions.ir_to_python(ir_modules, f, reporter=<ppci.reporting.DummyReportGenerator object>)

Convert ir-code to python code

Links the iterable of objects into one using the given layout

ppci.buildfunctions.objcopy(obj, image_name, fmt, output_filename)

Copy some parts of an object file to an output

ppci.buildfunctions.optimize(ir_module, reporter=<ppci.reporting.DummyReportGenerator object>)

Run a bag of tricks against the ir-code. This is an in-place operation!

Utilities

Hexfile manipulation

class ppci.utils.hexfile.HexFile

Represents an intel hexfile

>>> from utils import HexFile
>>> h = HexFile()
>>> h.dump()
Hexfile containing 0 bytes
>>> h.addRegion(0, bytes([1,2,3]))
>>> h
Hexfile containing 3 bytes

Yacc

// .. automodule:: yacc

Burg

Bottom up rewrite generator

This script takes as input a description of patterns and outputs a matcher class that can match trees given the patterns.

Patterns are specified as follows:

reg -> ADDI32(reg, reg) 2 (. add NT0 NT1 .)
reg -> MULI32(reg, reg) 3 (. .)

or a multiply add:

reg -> ADDI32(MULI32(reg, reg), reg) 4 (. muladd $1, $2, $3 .)

The general specification pattern is:

[result] -> [tree] [cost] [template code]

Trees

A tree is described using parenthesis notation. For example a node X with three child nodes is described as:

X(a, b, b)

Trees can be nested:

X(Y(a, a), a)

The ‘a’ in the example above indicates an open connection to a next tree pattern.

In the example above ‘reg’ is a non-terminal. ADDI32 is a terminal. non-terminals cannot have child nodes. A special case occurs in this case:

reg -> rc

where ‘rc’ is a non-terminal. This is an example of a chain rule. Chain rules can be used to allow several variants of non-terminals.

The generated matcher uses dynamic programming to find the best match of the tree. This strategy consists of two steps:

  • label: During this phase the given tree is traversed in a bottom up way. each node is labelled with a possible matching rule and the corresponding cost.
  • select: In this step, the tree is traversed again, selecting at each point the cheapest way to get to the goal.

Compiler design

This chapter describes the design of the compiler. The compiler consists a frontend, mid-end and back-end. The frontend deals with source file parsing and semantics checking. The mid-end performs optimizations. This is optional. The back-end generates machine code. The front-end produces intermediate code. This is a simple representation of the source. The back-end can accept this kind of representation.

digraph x {
rankdir="LR"
1 [label="c3 source file"]
10 [label="c3 front end" ]
11 [label="language X front end" ]
20 [label="mid end" ]
30 [label="back end for X86" ]
31 [label="back end for ARM" ]
40 [label="object file"]
1 -> 10
10 -> 20 [label="IR-code"]
11 -> 20 [label="IR-code"]
20 -> 30 [label="IR-code"]
20 -> 31 [label="IR-code"]
30 -> 40
}

C3 Front-end

For the front-end a recursive descent parser is created for the c3 language. This is a subset of the C language with some additional features.

digraph c3 {
rankdir="LR"
1 [label="source text"]
10 [label="lexer" ]
20 [label="parser" ]
40 [label="code generation"]
99 [label="IR-code object"]
1 -> 10
10 -> 20
20 -> 40
40 -> 99
}

class ppci.c3.Lexer(diag)

Generates a sequence of token from an input stream

class ppci.c3.Parser(diag)

Parses sourcecode into an abstract syntax tree (AST)

class ppci.c3.CodeGenerator(diag)

Generates intermediate (IR) code from a package. The entry function is ‘genModule’. The main task of this part is to rewrite complex control structures, such as while and for loops into simple conditional jump statements. Also complex conditional statements are simplified. Such as ‘and’ and ‘or’ statements are rewritten in conditional jumps. And structured datatypes are rewritten.

Type checking is done in one run with code generation.

class ppci.c3.Builder(diag, target)

Generates IR-code from c3 source. Reports errors to the diagnostics system.

Brainfuck frontend

The compiler has a front-end for the brainfuck language.

class ppci.bf.BrainFuckGenerator(target)

Brainfuck is a language that is so simple, the entire front-end can be implemented in one pass.

IR-code

The intermediate representation (IR) of a program de-couples the front end from the backend of the compiler.

See IR-code for details about all the available instructions.

IR-code

Front ends generate this IR-code. Backends transform it into machine code.

The IR-code is implemented in the ir package.

class ppci.ir.Module(name)

Container unit for variables and functions.

class ppci.ir.Function(name, module=None)

Represents a function.

class ppci.ir.Block(name, function=None)

Uninterrupted sequence of instructions with a label at the start.

A block contains a sequence of statements.

class ppci.ir.Load(address, name, ty, volatile=False)

Load a value from memory

class ppci.ir.Store(value, address, volatile=False)

Store a value into memory

class ppci.ir.Const(value, name, ty)

Represents a constant value

class ppci.ir.Binop(a, operation, b, name, ty, loc=None)

Generic binary operation

class ppci.ir.Call(function, arguments, name, ty, loc=None)

Call a function with some arguments

class ppci.ir.Jump(target)

Jump statement to another block within the same function

class ppci.ir.CJump(a, cond, b, lab_yes, lab_no)

Conditional jump to true or false labels.

Optimalization

The IR-code generated by the front-end can be optimized in many ways. The compiler does not have the best way to optimize code, but instead has a bag of tricks it can use.

class ppci.opt.transform.ModulePass

Base class of all optimizing passes. Subclass this class to implement your own optimization pass

class ppci.opt.mem2reg.Mem2RegPromotor

Tries to find alloc instructions only used by load and store instructions and replace them with values and phi nodes

class ppci.opt.transform.LoadAfterStorePass

Remove load after store to the same location.

[x] = a
b = [x]
c = b + 2

transforms into:

[x] = a
c = a + 2
class ppci.opt.transform.DeleteUnusedInstructionsPass

Remove unused variables from a block

class ppci.opt.transform.RemoveAddZeroPass

Replace additions with zero with the value itself. Replace multiplication by 1 with value itself.

class ppci.opt.transform.CommonSubexpressionEliminationPass

Replace common sub expressions with the previously defined one.

Back-end

The back-end is more complicated. There are several steps to be taken here.

  1. Canonicalization
  2. Tree creation
  3. Instruction selection
  4. register allocation
  5. Instruction emission
  6. TODO: Peep hole optimization?

Specification languages

Introduction

DRY

Do not repeat yourself (DRY). This is perhaps the most important idea to keep in mind when writing tools like assemblers, disassemblers, linkers, debuggers and compiler code generators. Writing these tools can be a repetitive and error prone task.

One way to achieve this is to to write a specification file for a specific processor and generate from this file the different tools. The goal of a machine description file is to describe a file and generate tools like assemblers, disassemblers, linkers, debuggers and simulators.

digraph x {
    1 [label="CPU specification"]
    2 [label="spec compiler"]
    10 [label="assembler"]
    11 [label="compiler back-end"]
    12 [label="simulator"]
    1 -> 2
    2 -> 10
    2 -> 11
    2 -> 12
}

Background

There are several existing languages to describe machines in a Domain Specific Language (DSL). Examples of these are:

  • Tablegen (llvm)
  • cgen (gnu)
  • LISA (Aachen)
  • nML (Berlin)
  • SLED (Specifying representations of machine instructions (norman ramsey and Mary F. Fernandez)) [sled]

Concepts to use in this language:

  • Single stream of instructions
  • State stored in memory
  • Pipelining
  • Instruction semantics

Optionally a description in terms of compiler code generation can be attached to this. But perhaps this clutters the description too much and we need to put it elsewhere.

The description language can help to expand these descriptions by expanding the permutations.

Example specifications

For a complete overview of ADL (Architecture Description Language) see [overview].

llvm
def IMUL64rr : RI<0xAF, MRMSrcReg, (outs GR64:$dst),
                                   (ins GR64:$src1, GR64:$src2),
                   "imul{q}\t{$src2, $dst|$dst, $src2}",
                   [(set GR64:$dst, EFLAGS,
                       (X86smul_flag GR64:$src1, GR64:$src2))],
                   IIC_IMUL64_RR>,
                TB;
LISA
<insn> BC
{
  <decode>
  {
    %ID: {0x7495, 0x0483}
    %cond_code: { %OPCODE1 & 0x7F }
    %dest_address: { %OPCODE2 }
  }
  <schedule>
  {
    BC1(PF, w:ebus_addr, w:pc) |
    BC2(PF, w:pc), BC3(IF) |
    BC4(ID) |
    <if> (condition[cond_code])
    {
      BC5(AC) |
      BC6(PF), BC7(ID), BC8(RE) |
      BC9(EX)
    }
    <else>
    {
      k:NOP(IF), BC10(AC, w:pc) |
      BC11(PF), BC12(ID), BC13(RE) |
      k:NOP(ID), BC14(EX) |
      k:NOP(ID), k:NOP(AC) |
      k:NOP(AC), k:NOP(RE) |
      k:NOP(RE), k:NOP(EX) |
      k:NOP(EX)
    }
  }
  <operate>
  {
    BC1.control: { ebus_addr = pc++; }
    BC2.control: { ir = mem[ebus_addr]; pc++ }
    BC10.control: { pc = (%OPCODE2) }
  }
}
SLED
patterns
  nullary is any of [ HALT NEG COM SHL SHR READ WRT NEWL NOOP TRA NOTR ],
    which is op = 0 & adr = { 0 to 10 }
constructors
  IMULb        Eaddr            is      (grp3.Eb;    Eaddr) & IMUL.AL.eAX
nML
type word = card(16)
type absa = card(9)
type disp = int(4)
type off = int(6)
mem PC[1,word]
mem R[16,word]
mem M[65536,word]
var L1[1,word]
var L2[1,word]
var L3[1,word]
mode register(i:card(4)) = R[i]
  syntax = format(”R%s”, i)
  image = format(”%4b”, i)
mode memory = ind | post | abs
mode ind(r:register, d:disp) = M[r+d]
  update = {}
  syntax = format(”@%s(%d)”, r.syntax, d)
  image = format(”0%4b%4b0”, r.image, d)
mode post(r:register, d:disp) = M[r+d]
  update = { r = r + 1; }
  syntax = format(”@%s++(%d)”, r.syntax, d)
  image = format(”0%4b%4b1”, r.image, d)
mode abs(a : absa) = M[a]
  update = {}
  syntax = format(”%d”, a)
  image = format(”1%9b”, a)
op instruction( i : instr )
  syntax = i.syntax
  image = i.image
  action = {
    PC = PC + 1;
    i.action;
  }
op instr = move | alu | jump
op move(lore:card(1), r:register, m:memory)
  syntax = format(”MOVE%d %s %s”, lore, r.syntax, m.syntax)
  image = format(”0%1b%4b%10b”, lore, r.image, m.image)
  action = {
    if ( lore ) then r = m;
    else m = r;
    endif;
    m.update;
  }
op alu(s1:register, s2:register, d:reg, a:aluop)
  syntax = format(”%s %s %s %s”, a.syntax, s1.syntax, s2.syntax, d.syntax)
  image = format(”10%4b%4b%4b%2b”, s1.image, s2.image, d.image, a.image)
  action = {
    L1 = s1; L2 = s2; a.action; d = L3;
  }
op jump(s1:register, s2:register, o:off)
  syntax = format(”JUMP %s %s %d”, s1.syntax, s2.syntax, o)
  image = format(”11%4b%4b%6b”, s1.image, s2.image, o)
  action = {
   if ( s1 >= S2 ) then PC = PC + o;
   endif;
  }
op aluop = and | add | sub | shift;
op and() syntax = ”and” image = ”00” action = { L3 = L1 & L2; }
op add() syntax = ”add” image = ”10” action = { L3 = L1 + L2; }
op sub() syntax = ”sub” image = ”01” action = { L3 = L1 - L2; }
Design

The following information must be captured in the specification file:

  • Assembly textual representation
  • Binary representation
  • Link relocations
  • Mapping from compiler back-end
  • Effects of instruction (semantics)
[sled]http://www.cs.tufts.edu/~nr/toolkit/
[overview]http://esl.cise.ufl.edu/Publications/iee05.pdf

Code generator

Target independent code generator part. The target is provided when the generator is created.

Canonicalize

During this phase, the IR-code is made simpler. Also unsupported operations are rewritten into function calls. For example soft floating point is introduced here.

Tree building

From IR-code a tree is generated which can be used to select instructions.

The process of instruction selection is preceeded by the creation of a selection dag (directed acyclic graph). The dagger take ir-code as input and produces such a dag for instruction selection.

A DAG represents the logic (computation) of a single basic block.

Instruction selection

The instruction selection phase takes care of scheduling and instruction selection. The output of this phase is a one frame per function with a flat list of abstract machine instructions.

To select instruction, a tree rewrite system is used. This is also called bottom up rewrite generator (BURG). See pyburg.

Register allocation

The selected instructions are used to select correct registers.

class ppci.codegen.registerallocator.RegisterAllocator

Target independent register allocator.

Algorithm is iterated register coalescing by Appel and George.

Chaitin’s algorithm: remove all nodes with less than K neighbours. These nodes can be colored when added back.

The process consists of the following steps:

  • build interference graph from the instruction list
  • remove low degree non move related nodes.
  • (optional) coalesc registers to remove redundant moves
  • (optional) spill registers
  • select registers

TODO: Implement different register classes

code emission

Code is emitted using the outputstream class. The assembler and compiler use this class to emit instructions to. The stream can output to object file or to a logger.

class ppci.binutils.outstream.OutputStream

Interface to generator code with.

Development

This chapter descibes how to develop on ppci.

Communication

Join the #ppci irc channel on freenode!

Source code

The sourcecode of the project is located at these repositories:

Continuous integration

The compiler is tested for linux:

and for windows:

Running the testsuite

To run the unit tests with the compiler, use pytest:

$ pytest -v test/

Or use the unittest module:

$ python -m unittest discover -s test

In order to test ppci versus different versions of python, tox is used. To run tox, simply run in the root directory:

$ tox

Building the docs

The docs can be build locally by using sphinx. Make sure that ppci is on your PYTHONPATH

$ export PYTHONPATH=your_ppci_root
$ cd docs
$ sphinx-build -b html . build

Alternatively the tox docs environment can be used:

$ tox -e docs