[ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
4.2.2 Encoding Options
These options control the encoding the document is expected to be in and how it is displayed.
- encoding
(string) The encoding the input text is in. Valid values include, but not limited to, ‘iso-8859-*’, ‘utf-8’, ‘ucs-2’, ‘ucs-4’. When using the Aspell utility the default encoding is based on the current locale. Thus if your locale currently uses the ‘utf-8’ encoding than everything will be in UTF-8. The ‘ucs-2’ and ‘ucs-4’ encodings are intended to be used by other programs using the Aspell library and is not supported by the Aspell utility.
- normalize
(boolean) Perform Unicode normalization. Enabled by default.
- norm-strict
(boolean) Avoid lossy conversions when normalizing. Lossy conversions includes compatibility mappings such as splitting the letter ‘OE’ (U+152) into ‘O’ and ‘E’ (when the combined letter is not available), and mappings which will remove accents. Disabled by default except when creating dictionaries.
- norm-form
(string) The normalization form the output should be in. This option primarily effects the normalization form of the suggestions as when spell checkering as the actual text is unchanged unless there is an error. Valid values are ‘none’, ‘nfd’ for fully decomposition (Normalization Form D), ‘nfc’ for Normalization Form C, or ‘comp’ for fully composed. ‘comp’ is like ‘nfc’ except that full composition is used rather than canonical composition. The ‘normalize’ option must be enabled for this option to be used.
- norm-required
(boolean) Set to true when the current language requires Unicode normalization. This is generally the case when private use characters are used internally by Aspell or when Normalization Form C is not the same as full composition.
[ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |