org.apache.lucene.analysis.ru
Class RussianCharsets
java.lang.Objectorg.apache.lucene.analysis.ru.RussianCharsets
public class RussianCharsets
extends java.lang.Object
RussianCharsets class contains encodings schemes (charsets) and toLowerCase() method implementation
for russian characters in Unicode, KOI8 and CP1252.
Each encoding scheme contains lowercase (positions 0-31) and uppercase (position 32-63) characters.
One should be able to add other encoding schemes (like ISO-8859-5 or customized) by adding a new charset
and adding logic to toLowerCase() method for that charset.
Version:
- Boris Okner, b.okner@rogers.com
CP1251
public static char[] CP1251
KOI8
public static char[] KOI8
UnicodeRussian
public static char[] UnicodeRussian
toLowerCase
public static char toLowerCase(char letter,
char[] charset)
Copyright © 2000-2005 Apache Software Foundation. All Rights Reserved.