[ << ] | [ < ] | [ Up ] | [ > ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
3.3 Unicode Normalization
stringprep_ucs4_nfkc_normalize
- Function: uint32_t * stringprep_ucs4_nfkc_normalize (const uint32_t * str, ssize_t len)
str: a Unicode string.
len: length of
str
array, or -1 ifstr
is nul-terminated.Converts a UCS4 string into canonical form, see
stringprep_utf8_nfkc_normalize()
for more information.Return value: a newly allocated Unicode string, that is the NFKC normalized form of
str
.
stringprep_utf8_nfkc_normalize
- Function: char * stringprep_utf8_nfkc_normalize (const char * str, ssize_t len)
str: a UTF-8 encoded string.
len: length of
str
, in bytes, or -1 ifstr
is nul-terminated.Converts a string into canonical form, standardizing such issues as whether a character with an accent is represented as a base character and combining accent or as a single precomposed character.
The normalization mode is NFKC (ALL COMPOSE). It standardizes differences that do not affect the text content, such as the above-mentioned accent representation. It standardizes the "compatibility" characters in Unicode, such as SUPERSCRIPT THREE to the standard forms (in this case DIGIT THREE). Formatting information may be lost but for most text operations such characters should be considered the same. It returns a result with composed forms rather than a maximally decomposed form.
Return value: a newly allocated string, that is the NFKC normalized form of
str
.
[ << ] | [ < ] | [ Up ] | [ > ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
This document was generated on February 1, 2012 using texi2html 5.0.