intlGetCharAttrs

Returns attributes describing a given character.

Synopsis

uint32 intlGetCharAttrs(Item locItem, unichar character);

Description

This function examines the provided UniCode character and returns general information about it.

Arguments

locItem: A locale item, as obtained from intlOpenLocale().

character: The character to get the attribute of.

Return Value

Returns a bit mask, with bit sets to indicate various characteristics as defined by the UniCode standard. The possible bits are:

This character is uppercase.

INTL_ATTRF_LOWERCASE: This character is lowercase.

INTL_ATTRF_PUNCTUATION: This character is a punctuation mark.

INTL_ATTRF_DECIMAL_DIGIT: This character is a numeric digit.

INTL_ATTRF_NUMBER: This character represent a numerical value not representable as a single decimal digit. For example, a character 0x00bc represents the constant 1/2.

INTL_ATTRF_NONSPACING: This character is a nonspacing mark.

INTL_ATTRF_SPACE: This character is a space character.

INTL_ATTRF_HALF_WIDTH: This character is HalfKana.

INTL_ATTRF_FULL_WIDTH

This character is FullKana.

INTL_ATTRF_KANA: This character is Kana (Katakana).

INTL_ATTRF_HIRAGANA

This character is Hiragana.

INTL_ATTRF_KANJI: This character is Kanji.

Implementation

Folio call implemented in international folio V24.

Associated Files

intl.h

Caveats

This function currently does not report any attributes for many upper UniCode characters. Only the ECMA Latin-1 character page (0x0000 to 0x00ff) is handled correctly at this time. If the language bound to the Locale structure is Japanese, then this function will also work correctly for Japanese characters.