intlGetCharAttrs
Returns attributes describing a given character.
Synopsis
uint32 intlGetCharAttrs(Item locItem, unichar character);
Description
This function examines the provided UniCode character and returns general information about it.
Arguments
- locItem
- A locale item, as obtained from intlOpenLocale().
- character
- The character to get the attribute of.
Return Value
Returns a bit mask, with bit sets to indicate various characteristics as defined by the UniCode standard. The possible bits are:
- This character is uppercase.
- INTL_ATTRF_LOWERCASE
- This character is lowercase.
- INTL_ATTRF_PUNCTUATION
- This character is a punctuation mark.
- INTL_ATTRF_DECIMAL_DIGIT
- This character is a numeric digit.
- INTL_ATTRF_NUMBER
- This character represent a numerical value not representable as a single decimal digit.
For example, a character 0x00bc represents the constant 1/2.
- INTL_ATTRF_NONSPACING
- This character is a nonspacing mark.
- INTL_ATTRF_SPACE
- This character is a space character.
- INTL_ATTRF_HALF_WIDTH
- This character is HalfKana.
INTL_ATTRF_FULL_WIDTH
- This character is FullKana.
- INTL_ATTRF_KANA
- This character is Kana (Katakana).
INTL_ATTRF_HIRAGANA
This character is Hiragana.
- INTL_ATTRF_KANJI
- This character is Kanji.
Implementation
Folio call implemented in international folio V24.
Associated Files
intl.h
Caveats
This function currently does not report any attributes for many upper UniCode characters. Only the ECMA Latin-1 character page (0x0000 to
0x00ff) is handled correctly at this time. If the language bound to the Locale structure is Japanese, then this function
will also work correctly for Japanese characters.
See Also
intlOpenLocale
(), intlConvertString()