In this lab, you extend your code generator to handle expressions, statements, methods without local parameters, and classes without fields.
- Write MiniJava programs and corresponding Jasmin programs which cover the following constructs:
- all kinds of unary and binary expressions,
- object and array creation,
this
expressions,- method calls without arguments,
- block statements,
if
statements,while
statements,- methods without parameters and local variables,
- classes without fields.
- Extend your code generator to handle these constructs.
You need to submit your MiniJava project with a pull request against branch assignment11
on GitHub.
Your GitHub repository contains a step-by-step procedure how to file such a request.
This project should contain a README.md
with a short paragraph explaining the organisation of your Stratego files.
When you solved challenges, you should provide another paragraph sketching your approach.
The deadline for submissions is December 17th, 17:59.
You can earn up to 25 points for your example MiniJava programs and their corresponding Jasmin programs. We will focus on completeness and correctness of your examples. Furthermore, you can earn up to 65 points for your code generator:
- transformation (55 points)
- classes (8 points)
- methods (7 points)
- statements (13 points)
- block statements (2 points)
if
statements (5 points)while
loops (6 points)
- expressions (27 points)
- logic constants (1 point)
- unary expressions (5 points)
- binary expressions, including array subscript (14 points)
- object and array creation (1 point)
this
expressions (1 point)- method calls (5 points)
- challenges (10 points)
- overlays (2 points)
- reusable method descriptors (2 points)
- stack limits (6 points)
You can earn up to 5 points for the organisation of your Stratego files and up to 5 points for the quality of your code. We focus on readability in general, meaningful variable names and the consistent use of Stratego paradigms. We will consider the fact that Stratego is new to you.
Before you extend your code generator, you should come up with small example MiniJava programs, which cover the following constructs:
- all kinds of unary and binary expressions,
- object creation,
this
expressions,- method calls without arguments,
- block statements,
if
statements,while
statements,- methods without parameters and local variables,
- classes without fields.
Write corresponding Jasmin programs, which you expect to be the result of a MiniJava-to-Jasmin compiler. Generate Java class files from them and run them. Improve your programs until they run without errors.
Do not write all the MiniJava programs and the corresponding Jasmin code in one go. Instead, you should extend your code generator once you figured out the Jasmin code for a certain construct. This way, you can go back and forth between manually writing Jasmin code and extending the code generator.
You now need to extend stmt-to-jbc
and exp-to-jbc
to cover all statements except assignments.
-
Provide a rule for
stmt-to-jbc
, which translates a block statement from MiniJava into a sequence of Java bytecode instructions. This rule should callstmt-to-jbc
recursively to translate the inner statements to Java bytecode sequences. -
Provide rules for
exp-to-jbc
, which translate logic constantstrue
andfalse
from MiniJava into sequences of Java bytecode instructions. -
Provide rules for
stmt-to-jbc
, which translateif
andwhile
statements from MiniJava into sequences of Java bytecode instructions. These rules should callstmt-to-jbc
recursively to translate inner statements to Java bytecode sequences. The JVM documentation discusses an efficient translation pattern forwhile
loops.These statements require labels. You can apply
newname
from Stratego's standard library to a string, to obtain a fresh name starting with the given string. -
Provide rules for
exp-to-jbc
, which translate unary and binary expressions from MiniJava into sequences of Java bytecode instructions. These rules should callexp-to-jbc
recursively to translate subexpressions to Java bytecode sequences. Furthermore, they should call a strategyop-to-jbc
to translate unary and binary operators to Java bytecode instructions. The only exception is the&&
operator, which needs to be treated differently, due to its lazy evaluation semantics.
You can test each rule by selecting a code fragment in the MiniJava editor and running your code generation builder.
At this point, you can generate only a single Jasmin class file from the main class of a MiniJava program. In general, you need to generate a class file for each class in a MiniJava program. To achieve this, you need to adapt the interface of your builders. Currently, you return a pair of file name and file content:
generate-jbc: (selected, position, ast, path, project-path) -> (filename, result)
Spoofax takes this information, creates the file if needed, and (over-)writes its content. You can also return a constant None()
:
generate-jbc: (selected, position, ast, path, project-path) -> None()
This tells Spoofax not to create or write any files. Instead, you need to create and write any files on your own.
- Adapt
generate-jbc
to usewrite-file
from the last lab's challenge in order to generate multiple Jasmin class files. If you didn't do this challenge, here's the implementation forwrite-file
:
write-file(|dir):
f@JBCFile(JBCHeader(_, _, JBCClass(_, cn), _, _, _, _, _, _, _), _, _) -> fn
where
fn := $[[dir]/[cn].j];
<fputs; fclose> (<jasmin-pp> f, <fopen> (fn, "w"));
<refresh-workspace-file> fn
-
Provide rules for
class-to-jbc
, which translate classes from MiniJava into Jasmin class files. -
Extend your builder for Java class files to handle multiple class files.
Now you need to define a strategy method-to-jbc
to handle methods without local variables.
-
Provide a rule for
method-to-jbc
, which translates a method without parameters and local variables from MiniJava into a Jasmin method. This rule should callstmt-to-jbc
to translate the statements of the method to a Java bytecode sequence. -
Provide a rule for
exp-to-jbc
, which translatesthis
expressions from MiniJava into sequences of Java bytecode instructions. -
Provide a rule for
exp-to-jbc
, which translates method calls without arguments from MiniJava into sequences of Java bytecode instructions. This rule should callexp-to-jbc
recursively to translate subexpressions to Java bytecode sequences. For this rule, you need to know the name of the class containing the method and the type of the method. You can extract the class name from the URI of the method name. The following strategies might be useful:nabl-uri
extracts the URI from an annotated name.nabl-uri-parent
rewrites an URI to the URI of the enclosing scope.nabl-uri-name
rewrites an URI to the name of the definition it identifies.
You can query the type associated with the method name with
get-type
. This requires access to the index and the task engine, which needs to be setup. You should do this in your builder strategies by adding the following strategy calls:index-setup(|<language>, project-path); task-setup(|project-path)
-
Extend the rule for
class-to-jbc
, which handles empty classes, in order to include code generation for methods. -
Provide a rule for
exp-to-jbc
, which translates object creation expressions into sequences of Java bytecode instructions.
Challenges are meant to distinguish excellent solutions from good solutions. Typically, they are less guided and require more investigation and programming skills.
Code generation rules become hard to read, when you construct large ASTs.
Often, these rules inject only small parts into a skeleton AST.
For example, the only variable parts in the default constructor of a class are its name c
and the name of its super class p
:
JBCMethod(
[PUBLIC()]
, Init()
, JBCMethodDesc([], Void())
, [ JBCLimitStack("1"), JBCLimitLocals("1")
, JBCVarDecl("0", "this", Reference(CRef(c)), LabelRef("start"), LabelRef("end"))
, JBCLabel("start")
, ALOAD_0()
, INVOKESPECIAL(JBCMethodRef(CRef(p), MRef(Init()), JBCMethodDesc([], Void())))
, RETURN()
, JBCLabel("end")
]
)
Furthermore, you might use the same pattern in different rules, e.g. in code generation rules for the main class and for ordinary classes.
One way to avoid code duplication, are helper strategies.
For example, you can have a strategy to-jbc-constructor
which rewrites a pair of class and super class names to the default constructor.
Alternatively, you can use overlays, which allow the specification of pattern abstractions.
Overlays are specified in their own section. They are named, can be parametrised, and can be nested:
overlays
DEFAULT-CONSTRUCTOR(c, p) =
JBCMethod(
[PUBLIC()]
, Init()
, JBCMethodDesc([], Void())
, [ JBCLimitStack("1"), JBCLimitLocals("1")
, THIS-DECLARATION(c)
, JBCLabel("start")
, ALOAD_0()
, INVOKE-CONSTRUCTOR(p)
, RETURN()
, JBCLabel("end")
]
)
You can use overlays like terms in your rules, both for matching and building:
rules
class-to-jbc:
Class(c, e, fs, ms) -> JBCFile(header, jbc-fs, [ DEFAULT-CONSTRUCTOR(c, p) | jbc-ms ])
where ...
You should identify AST patterns in your code generation rules and come up with overlays to improve the readability of these rules.
In Java bytecode, method declarations and method calls include a method descriptor.
You can construct such an descriptor from the type associated with a method name.
In the current setup, you do this construction once for a method declaration and once for each method call.
However, the descriptor for a call is the same as for the declaration the call refers to.
You can avoid the reconstruction by storing a custom property on the method name.
There are two variants to achieve this.
In the first variant, you store method descriptors (MethodDescriptor
).
In the second variant, you store method references (MethodRef
).
First, you need to declare a property descriptor
for namespace Method
in a NaBL file:
properties
descriptor of Method: MethodDescriptor
Next, you need to store the property at a method declaration by defining a rewrite rule for nabl-prop-site
:
nabl-prop-site(|lang, ctx, uris, states, implicits):
Method(ty, mname, param*, var*, stmt*, exp) -> <fail>
where
descr := ... // create descriptor
; <store-descriptor(|ctx, descr)> mname
Finally, you can access the descriptor in your code generation rules with get-descriptor
:
method-to-jbc:
Method(ty, mname, param*, var*, stmt*, exp) -> ...
where
descr := <get-descriptor> mname
; ...
exp-to-jbc:
Call(e, mname, e*) -> ...
where
descr := <get-descriptor> mname
; ...
In an alternative approach, you can store complete method references instead of just method descriptors. The property declaration in NaBL and the rewrite rule to store the property are similar to the first variant. But this variant will require you to extract descriptors from method references at definition sites:
method-to-jbc:
Method(ty, mname, param*, var*, stmt*, exp) -> ...
where
ref := <get-reference> mname
; descr := ... // extract descriptor from ref
; ...
As a benefit, the rule for method calls becomes simpler:
exp-to-jbc:
Call(e, mname, e*) -> ...
where
ref := <get-reference> mname
; ...
A stack limit directive tells the Java Virtual Machine the maximum number of elements at the operand stack.
To give a precise limit, you need to write a strategy stack-limit
that maps MiniJava expressions and statements to a stack limit ( IMPORTANT: Do not do this analysis on the Java bytecode level).
For expressions, you need to consider the number of elements already on the stack before the expression is evaluated. You can either pass this number as a term parameter or adapt intermediate results accordingly.
For statements, you do not need this information since statements should not leave any elements on the stack.
The following strategies might be useful:
inc
rewrites an integer number to its successor.max
rewrites a pair of integer numbers to the maximum of both numbers.addi
rewrites a pair of integer numbers to the sum of both numbers.foldr(s1, s2, f)
right-folds a list.s1
yields the starting point of the folding,s2
is the folding strategy, andf
is applied to each element just before each folding step.