TODO

Essential

  • SSA is broken by simplify_loads() & branches rewriting/simplification

  • attributes of struct, union & enums are ignored (and maybe others too). This requires correct support for __packed which itself needs partial and unaligned loads & stores (wip)

  • add support for bitwise enums (wip)

Documentation

  • document the API

  • document the limitations of modifying ptrlists during list walking

  • document the data structures

  • document flow of data / architecture / code structure

Core

  • if a variable has its address taken but in an unreachable BB then its MOD_ADDRESSABLE may be wrong and it won’t be SSA converted.

    • let kill_insn() check killing of SYMADDR,

    • add the sym into a list and

    • recalculate the addressability before memops’s SSA conversion

  • bool_ctype should be split into internal 1-bit / external 8-bit

Testsuite

  • there are 60 failing tests. They should be fixed (but most are non-trivial to fix).

Misc

  • GCC’s -Wenum-compare / clangs’s -Wenum-conversion -Wassign-enum

  • parse _attribute((fallthrough))

  • add support for format(printf()) (WIP by Ben Dooks)

  • make use of UNDEFs (issues warnings, simplification, … ?)

  • make memory accesses more explicit: add EXPR_ACCESS (wip)

  • it would be nice to do our own parsing of floating point (wip)

  • some header files needed for crypto/ need __vector or __fp16

  • some even need __complex

Optimization

  • a lot of small simplifications are waiting to be upstreamed

  • the domtree need to be rebuilt (or updated)

  • critical edges need to be split

  • the current way of doing CSE uses a lot of time

  • add SSA based DCE

  • add SSA based PRE

  • Add SSA based SCCP

  • add a pass to inline small functions during simplification.

  • use better/more systematic use of internal verification framework

  • tracking of operands size should be improved (WIP)

  • OP_INLINE is sometimes in the way

  • would be nice to strictly separate phases that don’t changes the CFG and thus the dominance tree.

IR

  • OP_SET should return a bool, always

  • add IR instructions for va_arg() & friends

  • add a possibility to import of file in “IR assembly”

  • dump the symtable

  • dump the CFG

LLVM

  • fix …

Internal backends

  • it would be nice the upstream the code generator

  • add a pass to transform 3-addresses code to 2-addresses

  • add some basic register allocation

  • add a pass to order the BBs and changes 2-ways CBR into one-way branches

  • what can be done for x86?

  • add support to add constraints in the MD rules

Longer term/to investigate

  • attributes are represented as ctypes’s alignment, modifiers & contexts but plenty of attributes doesn’t fit, for example they need arguments.

    • format(printf, …),

    • section(“…”)

    • assume_aligned(alignment[, offsert])

    • error(“message”), warning(“message”)

  • should support “-Werror=…” ?

  • All warning messages should include the option how to disable it. For example:

    “warning: Variable length array is used.”

    should be something like:

    “warning: Variable length array is used. (-Wno-vla)”

  • ptrlists must not have elements removed while being iterated; this should somehow be enforced.

  • having ‘struct symbol’ used to represent symbols and types is quite handy but it also creates lots of problems and complications

  • Possible mixup of symbol for a function designator being not a pointer? This seems to make evaluation of function pointers much more complex than needed.

  • extend test-inspect to inspect more AST fields.

  • extend test-inspect to inspect instructions.