© Zühlke 2014
Tobias Kniep
Effective Internationalization for Embedded Systems
26. February 2014
Safety Usability Acceptance
Coded Character Set “A mapping from a set of abstract characters to a set of integers.”
Ken Lunde, CJKV Information Processing
ASCII 128
JIS X 0213:2004 4354
Unicode BMP 65536
Unicode 1114112
Number of characters
ASCII
H e a l t h 0x48 0x65 0x61 0x6C 0x74 0x68
[0x48,0x65,0x61,0x6C,0x74,0x68]
Encoding Form
„A character encoding form plus byte serialization “
Glossary of the Unicode Consortium
„Mapping from a character set definition to the actual code units used to represent the data.”
Encoding Scheme
Unicode
健 康 0x5065 0x5EB7
UTF-16LE:[0x65,0x50,0xB7,0x5E] ? UTF-8:[0xE5,0x81,0xA5,0xE5,0xBA,0xB7]
UTF-32BE:[0x00,0x00,0x50,0x65,0x00,0x00,0x5E,0xB7]
�康
Endianess
Little Endian
Big Endian [0x65,0x50,0xB7,0x5E]
[0x50,0x65,0x5E,0xB7]
健康 (0x6550,0xB75E)
敐띞
UTF-8 H 0x48 ¥ 0xC2,0xA5
健 0xE5,0x81,0xA5
𠀷 0xF0,0xA0,0x80,0xB7
0x48 01001000 0xC2 11000010 0xE5 11100101 0xF0 11110000
UTF-16 H 0x0048 ¥ 0x00A5
健 0x5065
𠀷 0xD840,0xDC37
UTF-32 H 0x00000048 ¥ 0x000000A5
健 0x00005065
𠀷 0x00020037
Font “A collection of glyphs used for the visual depiction of character data.”
Glossary of the Unicode Consortium
Pixel-Matrix Here:
3 bits/pixel
No scaling
Easy to display
Predictable output
Bitmap Font
Large
Outline Font Lines, Arcs,
Curves Needs Renderer
Scaling Output hard to predict
Anti-aliasing
Character Set Encoding Font
Ken Lunde, CJKV Information Processing O’Reilly 2009
Contact me
Tobias Kniep