Nitin Verma’s Blog

Posts Tagged ‘awk

UTF-8 using plain hands

Posted by: Neo on: January 1, 2009

Just open this rfc : RFC 3629 Open this table: Char. number range | UTF-8 octet sequence (hexadecimal) | (binary) ——————–+——————————————— 0000 0000-0000 007F | 0xxxxxxx 0000 0080-0000 07FF | 110xxxxx 10xxxxxx 0000 0800-0000 FFFF | 1110xxxx 10xxxxxx 10xxxxxx 0001 0000-0010 FFFF | 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx Select a language with chars < U+10FFFF {It [...]


Follow

Get every new post delivered to your Inbox.