View
1.060
Download
1
Category
Tags:
Preview:
DESCRIPTION
Blogpost "Internationalization for embedded systems" http://blog.zuehlke.com/i18n/ Displaying text in the user’s language is taken for granted in the PC world. However, it is also becoming more common for embedded systems. Even small devices, which may have just had a segment display a couple of years ago, are now often capable of displaying high-quality characters on pixel displays.
Citation preview
© Zühlke 2014
Tobias Kniep
Effective Internationalization for Embedded Systems
26. February 2014
Safety Usability Acceptance
Coded Character Set “A mapping from a set of abstract characters to a set of integers.”
Ken Lunde, CJKV Information Processing
ASCII 128
JIS X 0213:2004 4354
Unicode BMP 65536
Unicode 1114112
Number of characters
ASCII
H e a l t h 0x48 0x65 0x61 0x6C 0x74 0x68
[0x48,0x65,0x61,0x6C,0x74,0x68]
Encoding Form
„A character encoding form plus byte serialization “
Glossary of the Unicode Consortium
„Mapping from a character set definition to the actual code units used to represent the data.”
Encoding Scheme
Unicode
健 康 0x5065 0x5EB7
UTF-16LE:[0x65,0x50,0xB7,0x5E] ? UTF-8:[0xE5,0x81,0xA5,0xE5,0xBA,0xB7]
UTF-32BE:[0x00,0x00,0x50,0x65,0x00,0x00,0x5E,0xB7]
�康
Endianess
Little Endian
Big Endian [0x65,0x50,0xB7,0x5E]
[0x50,0x65,0x5E,0xB7]
健康 (0x6550,0xB75E)
敐띞
UTF-8 H 0x48 ¥ 0xC2,0xA5
健 0xE5,0x81,0xA5
𠀷 0xF0,0xA0,0x80,0xB7
0x48 01001000 0xC2 11000010 0xE5 11100101 0xF0 11110000
UTF-16 H 0x0048 ¥ 0x00A5
健 0x5065
𠀷 0xD840,0xDC37
UTF-32 H 0x00000048 ¥ 0x000000A5
健 0x00005065
𠀷 0x00020037
Font “A collection of glyphs used for the visual depiction of character data.”
Glossary of the Unicode Consortium
Pixel-Matrix Here:
3 bits/pixel
No scaling
Easy to display
Predictable output
Bitmap Font
Large
Outline Font Lines, Arcs,
Curves Needs Renderer
Scaling Output hard to predict
Anti-aliasing
Character Set Encoding Font
Ken Lunde, CJKV Information Processing O’Reilly 2009
Contact me
Tobias Kniep
Recommended