Package net.sf.saxon.codenorm

This package contains the code to implement Unicode normalization.

See:
          Description

Class Summary
Normalizer Implements Unicode Normalization Forms C, D, KC, KD.
NormalizerData Accesses the Normalization Data used for Forms C and D.
UnicodeData  
UnicodeDataGenerator This class reads the Unicode character database, extracts information needed to perform unicode normalization, and writes this information out in the form of the Java "source" module UnicodeData.java.
UnicodeDataParser This class reads the data compiled into class UnicodeData, and builds hash tables that can be used by the Unicode normalization routines.
 

Package net.sf.saxon.codenorm Description

This package contains the code to implement Unicode normalization.

Much of the code is derived directly from the reference implementation published by the Unicode Consortium. However, the code for constructing the data tables used as input to the algorithm has been redesigned and rewritten for efficiency.