apache > cocoon
 
Font size:      

Lexical Transformer (2.1 legacy document)

Warning
This document was copied as is from the Cocoon 2.1 documentation, but has not yet been fully reviewed or moved to its new home.

Lexical Transformer

The lexical transformer tokenizes the content of special marked elements of a SAX stream, by using a lexicon file.

  • Name: lexer
  • Class: org.apache.cocoon.transformation.LexicalTransformer
  • Cacheable: yes - uses the last modification date of the lexicon document for validation.

The lexer parses the following elements from the SAX stream, and replaces them through generated documents.

<text xmlns="http://chaperon.sourceforge.net/schema/text/1.0">
 [Text, which should be parsed]
</text>

The lexical transformer will replace these elements by a list of lexemes (tokens).

<lexemes xmlns="http://chaperon.sourceforge.net/schema/lexemes/1.0">
 <lexeme symbol="..." text="..."/>
 <lexeme symbol="..." text="..."/>
 <lexeme symbol="..." text="..."/>
</lexemes>

A detailed explanation of function and the lexicon format can be found at Chaperon.

Comments

add your comments