[Openmcl-devel] Unicode Composition
R. Matthew Emerson
rme at clozure.com
Tue Dec 18 15:13:15 UTC 2012
On Dec 18, 2012, at 9:33 AM, Martial B <martialhb at gmail.com> wrote:
> I would like to know if there is a plan to build a Unicode NFC Clozure CL. Is it useful for you? I am dealing with french and vietnamese characters and not being able to get a char/codepoint (but instead an array of bytes for each char-unit and its diacritic elements) annoys me a bit (I am new to ccl switching from sbcl). I will try to normalize strings with an external-call with something like icu4c but I'd be glad to know if there's move in this direction in the future.
There's an internal function ccl::precompose-simple-string that might help you. It might be worth creating a ticket requesting exported functions to produce the various Unicode normalization forms.
More information about the Openmcl-devel