antlr

Class CppCodeGenerator


public class CppCodeGenerator
extends CodeGenerator

Generate MyParser.cpp, MyParser.hpp, MyLexer.cpp, MyLexer.hpp and MyParserTokenTypes.hpp

Field Summary

(package private) boolean
DEBUG_CPP_CODE_GENERATOR
protected static String
NONUNIQUE
(package private) int
astVarNumber
static int
caseSizeThreshold
(package private) String
commonExtraArgs
(package private) String
commonExtraParams
(package private) String
commonLocalVars
(package private) String
currentASTResult
(package private) RuleBlock
currentRule
(package private) Hashtable
declaredASTVariables
Used to keep track of which AST variables have been defined in a rule (except for the #rule_name and #rule_name_in var's
(package private) String
exceptionThrown
protected boolean
genAST
protected boolean
genHashLines
(package private) String
labeledElementASTInit
(package private) String
labeledElementASTType
(package private) String
labeledElementInit
(package private) String
labeledElementType
(package private) String
lt1Value
protected boolean
noConstructors
protected String
outputFile
protected int
outputLine
protected boolean
saveText
protected int
syntacticPredLevel
(package private) String
throwNoViable
(package private) Hashtable
treeVariableMap
(package private) boolean
usingCustomAST

Fields inherited from class antlr.CodeGenerator

BITSET_OPTIMIZE_INIT_THRESHOLD, DEBUG_CODE_GENERATOR, DEFAULT_BITSET_TEST_THRESHOLD, DEFAULT_MAKE_SWITCH_THRESHOLD, TokenTypesFileExt, TokenTypesFileSuffix, analyzer, antlrTool, behavior, bitsetTestThreshold, bitsetsUsed, charFormatter, currentOutput, grammar, makeSwitchThreshold, tabs

Constructor Summary

CppCodeGenerator()
Create a C++ code-generator using the given Grammar.

Method Summary

protected void
_print(String s)
Output a String to the currentOutput stream.
protected void
_printAction(String s)
Print an action without leading tabs, attempting to preserve the current indentation level for multi-line actions Ignored if string is null.
protected void
_println(String s)
Output a String followed by newline, to the currentOutput stream.
protected int
addSemPred(String predicate)
Adds a semantic predicate string to the sem pred vector These strings will be used to build an array of sem pred names when building a debugging parser.
protected int
countLines(String s)
void
exitIfError()
void
gen()
Generate the parser, lexer, treeparser, and token types in C++
void
gen(ActionElement action)
Generate code for the given grammar element.
void
gen(AlternativeBlock blk)
Generate code for the given grammar element.
void
gen(BlockEndElement end)
Generate code for the given grammar element.
void
gen(CharLiteralElement atom)
Generate code for the given grammar element.
void
gen(CharRangeElement r)
Generate code for the given grammar element.
void
gen(LexerGrammar g)
Generate the lexer C++ files
void
gen(OneOrMoreBlock blk)
Generate code for the given grammar element.
void
gen(ParserGrammar g)
Generate the parser C++ file
void
gen(RuleRefElement rr)
Generate code for the given grammar element.
void
gen(StringLiteralElement atom)
Generate code for the given grammar element.
void
gen(TokenRangeElement r)
Generate code for the given grammar element.
void
gen(TokenRefElement atom)
Generate code for the given grammar element.
void
gen(TreeElement t)
Generate code for the given grammar element.
void
gen(TreeWalkerGrammar g)
Generate the tree-parser C++ files
void
gen(WildcardElement wc)
Generate code for the given grammar element.
void
gen(ZeroOrMoreBlock blk)
Generate code for the given grammar element.
protected void
genASTDeclaration(AlternativeElement el)
protected void
genASTDeclaration(AlternativeElement el, String node_type)
protected void
genASTDeclaration(AlternativeElement el, String var_name, String node_type)
protected void
genAlt(Alternative alt, AlternativeBlock blk)
Generate an alternative.
protected void
genBitsets(Vector bitsetList, int maxVocabulary, String prefix)
Generate all the bitsets to be used in the parser or lexer Generate the raw bitset data like "long _tokenSet1_data[] = {...};" and the BitSet object declarations like "BitSet _tokenSet1 = new BitSet(_tokenSet1_data);" Note that most languages do not support object initialization inside a class definition, so other code-generators may have to separate the bitset declarations from the initializations (e.g., put the initializations in the generated constructor instead).
protected void
genBitsetsHeader(Vector bitsetList, int maxVocabulary)
protected void
genBlockInitAction(AlternativeBlock blk)
Generate the initaction for a block, which may be a RuleBlock or a plain AlternativeBLock.
protected void
genBlockPreamble(AlternativeBlock blk)
Generate the header for a block, which may be a RuleBlock or a plain AlternativeBlock.
void
genBody(LexerGrammar g)
void
genBody(ParserGrammar g)
void
genBody(TreeWalkerGrammar g)
protected void
genCases(BitSet p)
Generate a series of case statements that implement a BitSet test.
CppBlockFinishingInfo
genCommonBlock(AlternativeBlock blk, boolean noTestForSingle)
Generate common code for a block of alternatives; return a postscript that needs to be generated at the end of the block.
protected void
genHeader(String fileName)
Generate a header that is common to all C++ files
void
genInclude(LexerGrammar g)
void
genInclude(ParserGrammar g)
void
genInclude(TreeWalkerGrammar g)
void
genInitFactory(Grammar g)
void
genLineNo(GrammarElement el)
Generate a #line or // line depending on options
void
genLineNo(Token t)
Generate a #line or // line depending on options
void
genLineNo(int line)
Generate a #line or // line depending on options
void
genLineNo2()
Generate a #line or // line depending on options
protected void
genMatch(GrammarAtom atom)
protected void
genMatch(BitSet b)
protected void
genMatchUsingAtomText(GrammarAtom atom)
protected void
genMatchUsingAtomTokenType(GrammarAtom atom)
void
genNextToken()
Generate the nextToken() rule.
void
genRule(RuleSymbol s, boolean startSymbol, int ruleNum, String prefix)
Gen a named rule block.
void
genRuleHeader(RuleSymbol s, boolean startSymbol)
protected void
genSemPred(String pred, int line)
protected void
genSemPredMap(String prefix)
Write an array of Strings which are the semantic predicate expressions.
protected void
genSynPred(SynPredBlock blk, String lookaheadExpr)
void
genTokenStrings(String prefix)
Generate a static array containing the names of the tokens, indexed by the token type values.
protected void
genTokenTypes(TokenManager tm)
Generate the token types C++ file
String
getASTCreateString(String str)
Get a string for an expression to generate creating of an AST node
String
getASTCreateString(GrammarAtom atom, String str)
Get a string for an expression to generate creating of an AST node
String
getASTCreateString(Vector v)
Get a string for an expression to generate creation of an AST subtree.
protected String
getLookaheadTestExpression(Alternative alt, int maxDepth)
Generate a lookahead test expression for an alternate.
protected String
getLookaheadTestExpression(Lookahead[] look, int k)
protected String
getLookaheadTestTerm(int k, BitSet p)
Generate a depth==1 lookahead test expression given the BitSet.
String
getRangeExpression(int k, int[] elems)
Return an expression for testing a contiguous renage of elements
protected boolean
lookaheadIsEmpty(Alternative alt, int maxDepth)
Is the lookahead for this alt empty?
String
mapTreeId(String idParam, ActionTransInfo transInfo)
Map an identifier to it's corresponding tree-node variable.
void
printAction(Token t)
Print an action stored in a token surrounded by #line stuff
void
printHeaderAction(String name)
Print a header action by #line stuff also process any tree construction
protected void
println(String s)
Output tab indent followed by a String followed by newline, to the currentOutput stream.
protected String
processActionForSpecialSymbols(String actionStr, int line, RuleBlock currentRule, ActionTransInfo tInfo)
Lexically process tree-specifiers in the action.
String
processStringForASTConstructor(String str)
Process a string for an simple expression for use in xx/action.g it is used to cast simple tokens/references to the right type for the generated language.

Methods inherited from class antlr.CodeGenerator

_print, _printAction, _println, decodeLexerRuleName, elementsAreRange, encodeLexerRuleName, extractIdOfAction, extractIdOfAction, extractTypeOfAction, extractTypeOfAction, gen, gen, gen, gen, gen, gen, gen, gen, gen, gen, gen, gen, gen, gen, gen, gen, gen, genTokenInterchange, getASTCreateString, getASTCreateString, getBitsetName, getFIRSTBitSet, getFOLLOWBitSet, mapTreeId, markBitsetForGen, print, printAction, printTabs, println, processActionForSpecialSymbols, processStringForASTConstructor, removeAssignmentFromDeclaration, reverseLexerRuleName, setAnalyzer, setBehavior, setGrammar, setTool

Field Details

DEBUG_CPP_CODE_GENERATOR

(package private)  boolean DEBUG_CPP_CODE_GENERATOR

NONUNIQUE

protected static final String NONUNIQUE

astVarNumber

(package private)  int astVarNumber

caseSizeThreshold

public static final int caseSizeThreshold
Field Value:
127

commonExtraArgs

(package private)  String commonExtraArgs

commonExtraParams

(package private)  String commonExtraParams

commonLocalVars

(package private)  String commonLocalVars

currentASTResult

(package private)  String currentASTResult

currentRule

(package private)  RuleBlock currentRule

declaredASTVariables

(package private)  Hashtable declaredASTVariables
Used to keep track of which AST variables have been defined in a rule (except for the #rule_name and #rule_name_in var's

exceptionThrown

(package private)  String exceptionThrown

genAST

protected boolean genAST

genHashLines

protected boolean genHashLines

labeledElementASTInit

(package private)  String labeledElementASTInit

labeledElementASTType

(package private)  String labeledElementASTType

labeledElementInit

(package private)  String labeledElementInit

labeledElementType

(package private)  String labeledElementType

lt1Value

(package private)  String lt1Value

noConstructors

protected boolean noConstructors

outputFile

protected String outputFile

outputLine

protected int outputLine

saveText

protected boolean saveText

syntacticPredLevel

protected int syntacticPredLevel

throwNoViable

(package private)  String throwNoViable

treeVariableMap

(package private)  Hashtable treeVariableMap

usingCustomAST

(package private)  boolean usingCustomAST

Constructor Details

CppCodeGenerator

public CppCodeGenerator()
Create a C++ code-generator using the given Grammar. The caller must still call setTool, setBehavior, and setAnalyzer before generating code.

Method Details

_print

protected void _print(String s)
Output a String to the currentOutput stream. Ignored if string is null.
Overrides:
_print in interface CodeGenerator
Parameters:
s - The string to output

_printAction

protected void _printAction(String s)
Print an action without leading tabs, attempting to preserve the current indentation level for multi-line actions Ignored if string is null.
Overrides:
_printAction in interface CodeGenerator
Parameters:
s - The action string to output

_println

protected void _println(String s)
Output a String followed by newline, to the currentOutput stream. Ignored if string is null.
Overrides:
_println in interface CodeGenerator
Parameters:
s - The string to output

addSemPred

protected int addSemPred(String predicate)
Adds a semantic predicate string to the sem pred vector These strings will be used to build an array of sem pred names when building a debugging parser. This method should only be called when the debug option is specified

countLines

protected int countLines(String s)

exitIfError

public void exitIfError()

gen

public void gen()
Generate the parser, lexer, treeparser, and token types in C++
Overrides:
gen in interface CodeGenerator

gen

public void gen(ActionElement action)
Generate code for the given grammar element.
Overrides:
gen in interface CodeGenerator
Parameters:

gen

public void gen(AlternativeBlock blk)
Generate code for the given grammar element.
Overrides:
gen in interface CodeGenerator
Parameters:
blk - The "x|y|z|..." block to generate

gen

public void gen(BlockEndElement end)
Generate code for the given grammar element.
Overrides:
gen in interface CodeGenerator
Parameters:

gen

public void gen(CharLiteralElement atom)
Generate code for the given grammar element. Only called from lexer grammars.
Overrides:
gen in interface CodeGenerator
Parameters:

gen

public void gen(CharRangeElement r)
Generate code for the given grammar element. Only called from lexer grammars.
Overrides:
gen in interface CodeGenerator
Parameters:

gen

public void gen(LexerGrammar g)
            throws IOException
Generate the lexer C++ files
Overrides:
gen in interface CodeGenerator

gen

public void gen(OneOrMoreBlock blk)
Generate code for the given grammar element.
Overrides:
gen in interface CodeGenerator
Parameters:
blk - The (...)+ block to generate

gen

public void gen(ParserGrammar g)
            throws IOException
Generate the parser C++ file
Overrides:
gen in interface CodeGenerator

gen

public void gen(RuleRefElement rr)
Generate code for the given grammar element.
Overrides:
gen in interface CodeGenerator
Parameters:

gen

public void gen(StringLiteralElement atom)
Generate code for the given grammar element.
Overrides:
gen in interface CodeGenerator
Parameters:

gen

public void gen(TokenRangeElement r)
Generate code for the given grammar element.
Overrides:
gen in interface CodeGenerator
Parameters:

gen

public void gen(TokenRefElement atom)
Generate code for the given grammar element.
Overrides:
gen in interface CodeGenerator
Parameters:

gen

public void gen(TreeElement t)
Generate code for the given grammar element.
Overrides:
gen in interface CodeGenerator
Parameters:

gen

public void gen(TreeWalkerGrammar g)
            throws IOException
Generate the tree-parser C++ files
Overrides:
gen in interface CodeGenerator

gen

public void gen(WildcardElement wc)
Generate code for the given grammar element.
Overrides:
gen in interface CodeGenerator
Parameters:
wc - The wildcard element to generate

gen

public void gen(ZeroOrMoreBlock blk)
Generate code for the given grammar element.
Overrides:
gen in interface CodeGenerator
Parameters:
blk - The (...)* block to generate

genASTDeclaration

protected void genASTDeclaration(AlternativeElement el)

genASTDeclaration

protected void genASTDeclaration(AlternativeElement el,
                                 String node_type)

genASTDeclaration

protected void genASTDeclaration(AlternativeElement el,
                                 String var_name,
                                 String node_type)

genAlt

protected void genAlt(Alternative alt,
                      AlternativeBlock blk)
Generate an alternative.
Parameters:
alt - The alternative to generate
blk - The block to which the alternative belongs

genBitsets

protected void genBitsets(Vector bitsetList,
                          int maxVocabulary,
                          String prefix)
Generate all the bitsets to be used in the parser or lexer Generate the raw bitset data like "long _tokenSet1_data[] = {...};" and the BitSet object declarations like "BitSet _tokenSet1 = new BitSet(_tokenSet1_data);" Note that most languages do not support object initialization inside a class definition, so other code-generators may have to separate the bitset declarations from the initializations (e.g., put the initializations in the generated constructor instead).
Parameters:
bitsetList - The list of bitsets to generate.
maxVocabulary - Ensure that each generated bitset can contain at least this value.
prefix - string glued in from of bitset names used for namespace qualifications.

genBitsetsHeader

protected void genBitsetsHeader(Vector bitsetList,
                                int maxVocabulary)

genBlockInitAction

protected void genBlockInitAction(AlternativeBlock blk)
Generate the initaction for a block, which may be a RuleBlock or a plain AlternativeBLock.

genBlockPreamble

protected void genBlockPreamble(AlternativeBlock blk)
Generate the header for a block, which may be a RuleBlock or a plain AlternativeBlock. This generates any variable declarations and syntactic-predicate-testing variables.

genBody

public void genBody(LexerGrammar g)
            throws IOException

genBody

public void genBody(ParserGrammar g)
            throws IOException

genBody

public void genBody(TreeWalkerGrammar g)
            throws IOException

genCases

protected void genCases(BitSet p)
Generate a series of case statements that implement a BitSet test.
Parameters:
p - The Bitset for which cases are to be generated

genCommonBlock

public CppBlockFinishingInfo genCommonBlock(AlternativeBlock blk,
                                            boolean noTestForSingle)
Generate common code for a block of alternatives; return a postscript that needs to be generated at the end of the block. Other routines may append else-clauses and such for error checking before the postfix is generated. If the grammar is a lexer, then generate alternatives in an order where alternatives requiring deeper lookahead are generated first, and EOF in the lookahead set reduces the depth of the lookahead.
Parameters:
blk - The block to generate
noTestForSingle - If true, then it does not generate a test for a single alternative.

genHeader

protected void genHeader(String fileName)
Generate a header that is common to all C++ files

genInclude

public void genInclude(LexerGrammar g)
            throws IOException

genInclude

public void genInclude(ParserGrammar g)
            throws IOException

genInclude

public void genInclude(TreeWalkerGrammar g)
            throws IOException

genInitFactory

public void genInitFactory(Grammar g)

genLineNo

public void genLineNo(GrammarElement el)
Generate a #line or // line depending on options

genLineNo

public void genLineNo(Token t)
Generate a #line or // line depending on options

genLineNo

public void genLineNo(int line)
Generate a #line or // line depending on options

genLineNo2

public void genLineNo2()
Generate a #line or // line depending on options

genMatch

protected void genMatch(GrammarAtom atom)

genMatch

protected void genMatch(BitSet b)

genMatchUsingAtomText

protected void genMatchUsingAtomText(GrammarAtom atom)

genMatchUsingAtomTokenType

protected void genMatchUsingAtomTokenType(GrammarAtom atom)

genNextToken

public void genNextToken()
Generate the nextToken() rule. nextToken() is a synthetic lexer rule that is the implicit OR of all user-defined lexer rules.
Parameters:

genRule

public void genRule(RuleSymbol s,
                    boolean startSymbol,
                    int ruleNum,
                    String prefix)
Gen a named rule block. ASTs are generated for each element of an alternative unless the rule or the alternative have a '!' modifier. If an alternative defeats the default tree construction, it must set _AST to the root of the returned AST. Each alternative that does automatic tree construction, builds up root and child list pointers in an ASTPair structure. A rule finishes by setting the returnAST variable from the ASTPair.
Parameters:
startSymbol - true if the rule is a start symbol (i.e., not referenced elsewhere)

genRuleHeader

public void genRuleHeader(RuleSymbol s,
                          boolean startSymbol)

genSemPred

protected void genSemPred(String pred,
                          int line)

genSemPredMap

protected void genSemPredMap(String prefix)
Write an array of Strings which are the semantic predicate expressions. The debugger will reference them by number only

genSynPred

protected void genSynPred(SynPredBlock blk,
                          String lookaheadExpr)

genTokenStrings

public void genTokenStrings(String prefix)
Generate a static array containing the names of the tokens, indexed by the token type values. This static array is used to format error messages so that the token identifers or literal strings are displayed instead of the token numbers. If a lexical rule has a paraphrase, use it rather than the token label.

genTokenTypes

protected void genTokenTypes(TokenManager tm)
            throws IOException
Generate the token types C++ file

getASTCreateString

public String getASTCreateString(String str)
Get a string for an expression to generate creating of an AST node
Parameters:
str - The arguments to the AST constructor

getASTCreateString

public String getASTCreateString(GrammarAtom atom,
                                 String str)
Get a string for an expression to generate creating of an AST node
Overrides:
getASTCreateString in interface CodeGenerator
Parameters:
str - The arguments to the AST constructor

getASTCreateString

public String getASTCreateString(Vector v)
Get a string for an expression to generate creation of an AST subtree.
Overrides:
getASTCreateString in interface CodeGenerator
Parameters:
v - A Vector of String, where each element is an expression in the target language yielding an AST node.

getLookaheadTestExpression

protected String getLookaheadTestExpression(Alternative alt,
                                            int maxDepth)
Generate a lookahead test expression for an alternate. This will be a series of tests joined by '&&' and enclosed by '()', the number of such tests being determined by the depth of the lookahead.

getLookaheadTestExpression

protected String getLookaheadTestExpression(Lookahead[] look,
                                            int k)

getLookaheadTestTerm

protected String getLookaheadTestTerm(int k,
                                      BitSet p)
Generate a depth==1 lookahead test expression given the BitSet. This may be one of: 1) a series of 'x==X||' tests 2) a range test using >= && <= where possible, 3) a bitset membership test for complex comparisons
Parameters:
k - The lookahead level
p - The lookahead set for level k

getRangeExpression

public String getRangeExpression(int k,
                                 int[] elems)
Return an expression for testing a contiguous renage of elements
Parameters:
k - The lookahead level
elems - The elements representing the set, usually from BitSet.toArray().
Returns:
String containing test expression.

lookaheadIsEmpty

protected boolean lookaheadIsEmpty(Alternative alt,
                                   int maxDepth)
Is the lookahead for this alt empty?

mapTreeId

public String mapTreeId(String idParam,
                        ActionTransInfo transInfo)
Map an identifier to it's corresponding tree-node variable. This is context-sensitive, depending on the rule and alternative being generated
Overrides:
mapTreeId in interface CodeGenerator
Parameters:
idParam - The identifier name to map
Returns:
The mapped id (which may be the same as the input), or null if the mapping is invalid due to duplicates

printAction

public void printAction(Token t)
Print an action stored in a token surrounded by #line stuff

printHeaderAction

public void printHeaderAction(String name)
Print a header action by #line stuff also process any tree construction
Parameters:
name - The name of the header part

println

protected void println(String s)
Output tab indent followed by a String followed by newline, to the currentOutput stream. Ignored if string is null.
Overrides:
println in interface CodeGenerator
Parameters:
s - The string to output

processActionForSpecialSymbols

protected String processActionForSpecialSymbols(String actionStr,
                                                int line,
                                                RuleBlock currentRule,
                                                ActionTransInfo tInfo)
Lexically process tree-specifiers in the action. This will replace #id and #(...) with the appropriate function calls and/or variables.
Overrides:
processActionForSpecialSymbols in interface CodeGenerator

processStringForASTConstructor

public String processStringForASTConstructor(String str)
Process a string for an simple expression for use in xx/action.g it is used to cast simple tokens/references to the right type for the generated language. Basically called for every element in the vector to getASTCreateString(vector V)
Overrides:
processStringForASTConstructor in interface CodeGenerator
Parameters:
str - A String.