Package org.jcodings.unicode
Class UnicodeEncoding
java.lang.Object
org.jcodings.Encoding
org.jcodings.AbstractEncoding
org.jcodings.MultiByteEncoding
org.jcodings.unicode.UnicodeEncoding
- All Implemented Interfaces:
Cloneable
- Direct Known Subclasses:
BaseUTF8Encoding
,CESU8Encoding
,FixedWidthUnicodeEncoding
,UTF16BEEncoding
,UTF16LEEncoding
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionprivate static class
private static class
private static class
private static class
private static class
private static class
(package private) static class
-
Field Summary
FieldsModifier and TypeFieldDescription(package private) static final int
(package private) static final int
(package private) static final int
(package private) static final int
private static final int
(package private) static final short[]
-
Constructor Summary
ConstructorsModifierConstructorDescriptionprotected
UnicodeEncoding
(String name, int minLength, int maxLength, int[] EncLen) protected
UnicodeEncoding
(String name, int minLength, int maxLength, int[] EncLen, int[][] Trans) -
Method Summary
Modifier and TypeMethodDescriptionvoid
applyAllCaseFold
(int flag, ApplyAllCaseFoldFunction fun, Object arg) onigenc_ascii_apply_all_case_fold / used also by multibyte encodingscaseFoldCodesByString
(int flag, byte[] bytes, int p, int end) onigenc_ascii_get_case_fold_codes_by_str / used also by multibyte encodingsfinal int
Oniguruma equivalent:case_map
protected final int[]
ctypeCodeRange
(int ctype) private static int
extractCode
(int packed) private static int
extractLength
(int packed) The name of the equivalent Java Charset for this encoding.boolean
isCodeCType
(int code, int ctype) Perform a check whether given code is of given character type (e.g.static boolean
isInCodeRange
(UnicodeCodeRange range, int code) int
mbcCaseFold
(int flag, byte[] bytes, IntHolder pp, int end, byte[] fold) onigenc_ascii_mbc_case_foldint
propertyNameToCType
(byte[] name, int p, int end) onigenc_minimum_property_name_to_ctype notably overridden by unicode encodingsprivate static Object[]
Methods inherited from class org.jcodings.MultiByteEncoding
isInRange, length, lengthForTwoUptoFour, mb2CodeToMbc, mb2CodeToMbcLength, mb2IsCodeCType, mb4CodeToMbc, mb4CodeToMbcLength, mb4IsCodeCType, mbnMbcCaseFold, mbnMbcToCode, missing, missing, safeLengthForUptoFour, safeLengthForUptoThree, safeLengthForUptoTwo, strCodeAt, strLength
Methods inherited from class org.jcodings.AbstractEncoding
asciiApplyAllCaseFold, asciiCaseFoldCodesByString, asciiMbcCaseFold, isCodeCTypeInternal, isNewLine
Methods inherited from class org.jcodings.Encoding
asciiToLower, asciiToUpper, codeToMbc, codeToMbcLength, ctypeCodeRange, digitVal, equals, getCharset, getIndex, getName, hashCode, isAlnum, isAlpha, isAscii, isAscii, isAsciiCompatible, isBlank, isCntrl, isDigit, isDummy, isFixedWidth, isGraph, isLower, isMbcAscii, isMbcCrnl, isMbcHead, isMbcWord, isNewLine, isPrint, isPunct, isReverseMatchAllowed, isSbWord, isSingleByte, isSpace, isUnicode, isUpper, isUTF8, isWord, isWordGraphPrint, isXDigit, leftAdjustCharHead, length, load, load, maxLength, maxLengthDistance, mbcodeStartPosition, mbcToCode, minLength, odigitVal, prevCharHead, rightAdjustCharHead, rightAdjustCharHeadWithPrev, setDummy, setName, setName, step, stepBack, strByteLengthNull, strLengthNull, strNCmp, toLowerCaseTable, toString, xdigitVal
-
Field Details
-
PROPERTY_NAME_MAX_SIZE
private static final int PROPERTY_NAME_MAX_SIZE- See Also:
-
I_WITH_DOT_ABOVE
static final int I_WITH_DOT_ABOVE- See Also:
-
DOTLESS_i
static final int DOTLESS_i- See Also:
-
DOT_ABOVE
static final int DOT_ABOVE- See Also:
-
CASE_MAPPING_SLACK
static final int CASE_MAPPING_SLACK- See Also:
-
UNICODE_ISO_8859_1_CTypeTable
static final short[] UNICODE_ISO_8859_1_CTypeTable
-
-
Constructor Details
-
UnicodeEncoding
-
UnicodeEncoding
-
-
Method Details
-
getCharsetName
Description copied from class:Encoding
The name of the equivalent Java Charset for this encoding. Defaults to the name of the encoding. Subclasses can override this to provide a different name.- Overrides:
getCharsetName
in classEncoding
- Returns:
- the name of the equivalent Java Charset for this encoding
-
isCodeCType
public boolean isCodeCType(int code, int ctype) Description copied from class:Encoding
Perform a check whether given code is of given character type (e.g. used by isWord(someByte) and similar methods)- Specified by:
isCodeCType
in classEncoding
- Parameters:
code
- a code point of a characterctype
- a character type to check against Oniguruma equivalent:is_code_ctype
-
isInCodeRange
-
ctypeCodeRange
protected final int[] ctypeCodeRange(int ctype) -
propertyNameToCType
public int propertyNameToCType(byte[] name, int p, int end) Description copied from class:AbstractEncoding
onigenc_minimum_property_name_to_ctype notably overridden by unicode encodings- Overrides:
propertyNameToCType
in classAbstractEncoding
-
mbcCaseFold
Description copied from class:AbstractEncoding
onigenc_ascii_mbc_case_fold- Overrides:
mbcCaseFold
in classAbstractEncoding
- Parameters:
flag
- case fold flagpp
- anIntHolder
that points at character headfold
- a buffer where to extract case folded character Oniguruma equivalent:mbc_case_fold
-
applyAllCaseFold
Description copied from class:AbstractEncoding
onigenc_ascii_apply_all_case_fold / used also by multibyte encodings- Overrides:
applyAllCaseFold
in classAbstractEncoding
- Parameters:
flag
- case fold flagfun
- case folding functor (look at:ApplyCaseFold
)arg
- case folding functor argument (look at:ApplyCaseFoldArg
) Oniguruma equivalent:apply_all_case_fold
-
caseFoldCodesByString
Description copied from class:AbstractEncoding
onigenc_ascii_get_case_fold_codes_by_str / used also by multibyte encodings- Overrides:
caseFoldCodesByString
in classAbstractEncoding
-
caseMap
public final int caseMap(IntHolder flagP, byte[] bytes, IntHolder pp, int end, byte[] to, int toP, int toEnd) Description copied from class:Encoding
Oniguruma equivalent:case_map
- Overrides:
caseMap
in classMultiByteEncoding
-
readFoldN
-
extractLength
private static int extractLength(int packed) -
extractCode
private static int extractCode(int packed)
-