AustenX (sometimes just called "Austen") is a parser generator that uses Parsing Expression Grammars (PEGs), and a Packrat Parsing derived algorithm. Unlike other PEG parsers Austen currently uses an initial tokenisation step to convert the input into tokens, which are then handled by the grammar parser. This tokenisation can be done as part of the Austen package, and allows a particular token to be a member of more than one token class.

In essence, Austen is a tool for generating program code that can be used to parse text files based on a specialised language describing the syntax and grammar of the text to be read. Currently, on Java code can be generated.


AustenX has a number of significant features. These can be summarised as follows:



Please see also, the common libraries