Proposal: Introduce Intermediate Representation (IR) for Linter #68

notJoon · 2024-09-14T08:55:01Z

Background

Currently, tlin directly uses the go/ast and go/parser pacakge to analyze and process the gno code. While this approach has served us very well, it presents challenges in terms of scalability, maintability, and future language divergence.

Proposal

I propose introducing an Intermediate Representation (IR) layer between AST and our linter rulws. This IR will abstract away the specifics of AST, providing a more flexible and maintainable foundation for our linter.

Pros and Cons

Pros

Abstraction: Simplifies rule implementation by hiding AST complexities.
Maintainability: Easier to update and modify rules without deep AST background knowledge.
Language Divergence: Allows for easier adaptation as gno evolves differently from Go.

Cons

Initial Complexity: Require upfront design and implementation effort.
Learning Curve: Need to learn the new IR structure.
Potential Performance Overhead: Additional layer may introduce some performance cost.

Proof of Concept Implementation

Here's basic PoC for the IR structure and usage:

package linter

import "go/token"

// Node represents any node in our IR
type Node interface {
    Pos() token.Pos
    End() token.Pos
}

// Expression represents any expression in our IR
type Expression interface {
    Node
    exprNode()
}

// Statement represents any statement in our IR
type Statement interface {
    Node
    stmtNode()
}

// Concrete IR nodes
type (
    Ident struct {
        NamePos token.Pos
        Name    string
    }

    FuncDecl struct {
        Name *Ident
        Body *BlockStmt
    }

    BlockStmt struct {
        Statements []Statement
    }
)

// Implement necessary interface methods
func (i *Ident) Pos() token.Pos     { return i.NamePos }
func (i *Ident) End() token.Pos     { return token.Pos(int(i.NamePos) + len(i.Name)) }
func (i *Ident) exprNode()          {}

func (f *FuncDecl) Pos() token.Pos  { return f.Name.Pos() }
func (f *FuncDecl) End() token.Pos  { return f.Body.End() }
func (f *FuncDecl) stmtNode()       {}

// AST to IR conversion (simplified)
func AST2IR(file *ast.File) *IR {
    // Implementation
}

// Rule interface
type Rule interface {
    Check(node Node) []Issue
}

// Sample rule implementation
type UnusedVariableRule struct{}

func (r *UnusedVariableRule) Check(node Node) []Issue {
    // Implementation using IR
}

// Linter execution
func RunLinter(ir *IR, rules []Rule) []Issue {
    var issues []Issue
    for _, rule := range rules {
        issues = append(issues, applyRule(ir, rule)...)
    }
    return issues
}

Handling Language Divergence

As out language evolves and potentially diverges from Go, the IR approach offers significant advantages:

Abstraction Layer: The IR serves as a buffer between the language-specific AST and out linter rulws. When language changes occur, we can update the IR generation process without modifying all the rules.
Custom Nodes: We can introduce custom IR nodes for new language features without waiting for or relying on updates to the go's package.
Gradual Migration: We can incrementally move away from go/ast (and more related packages) by implementing our own parser that generates our IR directly, allowing for a smooth transition as the language diverge.
Feature Toogling: The IR can include flags or variants to represent language specific feeatures, allowing rules to adapt their behavior based on the target language version.

The text was updated successfully, but these errors were encountered:

notJoon added the C-propossal Category: sumitting RFC or proposal label Sep 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: Introduce Intermediate Representation (IR) for Linter #68

Proposal: Introduce Intermediate Representation (IR) for Linter #68

notJoon commented Sep 14, 2024

Proposal: Introduce Intermediate Representation (IR) for Linter #68

Proposal: Introduce Intermediate Representation (IR) for Linter #68

Comments

notJoon commented Sep 14, 2024

Background

Proposal

Pros and Cons

Pros

Cons

Proof of Concept Implementation

Handling Language Divergence