C library Ukrainian Cyrillic to Latin script transliteration
Install
Dependencies:
mkdir build
cd build && cmake ..
Usage
The library provides UTF-8 and UTF-16 API. The underlying ICU operates on UTF-16 strings.
/* UTF-8 */
uklatn_encode("Доброго вечора!", UklaltnTable_default, dest, destsize);
uklatn_decode("Paljanycja", UklaltnTable_default, dest, destsize);
/* UTF-16 */
uklatn_encodeu(u"Доброго вечора!", UklaltnTable_default, dest, destsize);
uklatn_decodeu(u"Paljanycja", UklaltnTable_default, dest, destsize);
Notes
Input is assumed to be in Ukrainian (Cyrillic or Latin script), and will be processed in full. If your data has mixed languages, do preprocessing to extract Ukrainian chunks.