net.sf.saxon.codenorm
class UnicodeDataGenerator extends Object
The class is derived from the sample program NormalizerData.java published by the Unicode consortium. That code has been modified so that instead of building the run-time data structures directly, they are written to a Java "source" module, which is then compiled. Also, the ability to construct a condensed version of the data tables has been removed.
Copyright (c) 1991-2005 Unicode, Inc.
For terms of use, see http://www.unicode.org/terms_of_use.html
For documentation, see UAX#15.
Field Summary | |
---|---|
static String | copyright |
Method Summary | |
---|---|
static void | build()
Called exactly once by NormalizerData to build the static data |
static String | fromHex(String source)
Utility: Parses a sequence of hex Unicode characters separated by spaces |
static String | hex(char i)
Utility: Supplies a zero-padded hex representation of a Unicode character (without 0x, \\u) |
static String | hex(String s, String sep)
Utility: Supplies a zero-padded hex representation of a Unicode character (without 0x, \\u) |
static void | main(String[] args)
Main program. |
Usage: java UnicodeDataGenerator dir >UnicodeData.java
where dir is the directory containing the files UnicodeData.text and CompositionExclusions.txt from the Unicode character database.