Re: UTF-8 to CESU-8 conversion

Hi, colleagues

The following works for me well,

You can also try to use python which is

easy to implement and test. If using unicode function and represent CESU-8 encoded string as byte stream already encoded with UTF-8 this will work fine. The problem with CESU-9 only comes for Unicode point starting with U+10000 and higher. For those point you can use surrogaite pair which is available on wiki in google, here is the algorithm to get UTF-16 representation for Unicode points higher then FFFF.

v = 0x64321

v′ = v - 0x10000

= 0x54321

= 0101 0100 0011 0010 0001

vh = v′ >> 10

= 01 0101 0000 // higher 10 bits of v′

vl = v′ & 0x3FF

= 11 0010 0001 // lower 10 bits of v′

w1 = 0xD800 + vh

= 1101 1000 0000 0000

+ 01 0101 0000

= 1101 1001 0101 0000

= 0xD950 // first code unit of UTF-16 encoding

w2 = 0xDC00 + vl

= 1101 1100 0000 0000

+ 11 0010 0001

= 1101 1111 0010 0001

= 0xDF21 // second code unit of UTF-16 encoding

In other words you get UTF-8 encoded stream which is perfectly understood by HANA and you can store the information perfectly by using your own codec that is compliant with CESU-8.

To get some knowledge about UTF-8 encoding you can refer to utfcpp.sourceforge.net library and the algorithm above can be used to extend it for CESU-8 compatibility.

You do not need to use UTF-16 for python, this will not work for HANA.

Regards,

Vasily Sukhanov

Re: UTF-8 to CESU-8 conversion

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

Download: FK ft Shenky – Nakuyewa ”Prod by: Shenky”

How to win at Markstrat (Markstrat Tips and Tricks) – Vodites

Ominde Commission Report and Recommendations – Ominde Report of 1964

Bureau of Internal Revenue: Regional Offices (Directory)

GO 53 on Enhancement of Ex-gratia upto 5 Lakhs Toddy Tappers in Telangana

Cakewalk CA-2A Leveling Amplifier v2.0.1.97 WiN, v2.0.1.96 OSX Incl Keygen

Mp3 Download: Mdu - Kunjenjenjena

How the kill the job , when DTP request running for long hours.

Microsoft Intune から展開しているアプリのアップデートについて

18-year-old girl was beaten for half an hour by two Northampton men in 'an...

Car crash in Dunton Bassett leaves driver in critical condition

Macky 2, Two Others In Road Accident

Application log 00000000000000089514: Could not convert queue DLVST90CLNT

Detroit mafia: D’Anna Brothers agree to plea deal

Delivery block field greyed out using VA02

Muloraki Au

【個人撮影】スマホのプライベート映像♪「中に出さないで///」カラオケ屋での生ハメ撮りが流出ｗ【リベンジポルノ】＠PornHub

BREAKING NEWS: Diamond Platnumz Is Reported Dead After Ghastly Car Accident

FIAT 500 B0111 B0112