Plume 🦚

About

Plume is a statically typed programming language that compiles to Web Assembly and ARM. I use this project to mess around with compilers stuff. Plume probably wouldn't be very useful in production.

Here's an overview of the syntax: https://www.eddiehatfield.com/blog/plume-syntax

How things work

Check out src/Parsing for the abstract syntax tree, lexer, and parser, for which I used Haskell's Parsec library.

All of the semantic validation/type-checking takes place in src/Semantics, mainly in Validation.hs. This also builds up a symbol table that I use later on for the code generation phases.

Next, the AST is translated to a three-address code (TAC) IR, which you can see in src/Ir/Tac/Translation.hs.

Finally, this TAC IR is handed off to the backend. If compiling for WebAssembly, the code is translated to an internal representation of WebAssembly copied from the official spec.

Otherwise, if compiling for ARM, the TAC IR virtual registers get mapped to physical registers or stack memory. This is called register allocation, or RegAlloc in the codebase. At the moment, it just spills everything to the stack.

Then, this TAC gets emitted as textual ARM assembly.

Then, it gets linked against libc by compiling with gcc for ARM.

To compile an ARM binary, you'll need arm-linux-gnueabihf-gcc (or at least that's how I do it). Then, run the following:

$ chmod u+x compile-arm.sh 
$ ./compile-arm.sh <plumefile>

If you have your own ARM assembler setup you can use that instead.

Roadmap

AST definition
Parsing
Type-checking
Translation to TAC
Translation from TAC to Wasm
Emit valid Wasm programs
Emit valid ARM programs
More stuff

More stuff:

Adding an ARM backend
- Naive register allocation (spill everything)
- CodeGen ARM assembly
- Smarter register allocation, based on linear-scan
Control flow graphs from TAC

random idea for CFG stuff

I won't lie, the data structure for this is hard to get right. I think I need to build it incrementally to make sure I don't do too premature design. It would be really cool (and maybe necessary) to make it agnostic of the actual instruction data type being used, maybe I could add a type class for telling whether an instruction is a branch or not?

It would be super cool to output some kind of LaTeX visualization for the CFG data structure, which I don't think would be that hard?? It would be super useful for debugging this nonsense.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Plume 🦚

About

How things work

Roadmap

random idea for CFG stuff

Files

README.md

Latest commit

History

README.md

File metadata and controls

Plume 🦚

About

How things work

Roadmap

random idea for CFG stuff