intlGetCharAttrs

Returns attributes describing a given character.

Synopsis

uint32 intlGetCharAttrs(Item locItem, unichar character);

Description

This function examines the provided UniCode character and returns general information about it.

Arguments

locItem
A locale item, as obtained from intlOpenLocale().
character
The character to get the attribute of.

Return Value

Returns a bit mask, with bit sets to indicate various characteristics as defined by the UniCode standard. The possible bits are:

This character is uppercase.
INTL_ATTRF_LOWERCASE
This character is lowercase.
INTL_ATTRF_PUNCTUATION
This character is a punctuation mark.
INTL_ATTRF_DECIMAL_DIGIT
This character is a numeric digit.
INTL_ATTRF_NUMBER
This character represent a numerical value not representable as a single decimal digit. For example, a character 0x00bc represents the constant 1/2.
INTL_ATTRF_NONSPACING
This character is a nonspacing mark.
INTL_ATTRF_SPACE
This character is a space character.
INTL_ATTRF_HALF_WIDTH
This character is HalfKana.
INTL_ATTRF_FULL_WIDTH
This character is FullKana.
INTL_ATTRF_KANA
This character is Kana (Katakana).
INTL_ATTRF_HIRAGANA
This character is Hiragana.
INTL_ATTRF_KANJI
This character is Kanji.

Implementation

Folio call implemented in international folio V24.

Associated Files

intl.h

Caveats

This function currently does not report any attributes for many upper UniCode characters. Only the ECMA Latin-1 character page (0x0000 to 0x00ff) is handled correctly at this time. If the language bound to the Locale structure is Japanese, then this function will also work correctly for Japanese characters.

See Also

intlOpenLocale(), intlConvertString()