Transcript
Page 1: Effective Internationalization for Embedded Systems (Embedded World 2014)

© Zühlke 2014

Tobias Kniep

Effective Internationalization for Embedded Systems

26. February 2014

Page 2: Effective Internationalization for Embedded Systems (Embedded World 2014)

Safety Usability Acceptance

Page 3: Effective Internationalization for Embedded Systems (Embedded World 2014)
Page 4: Effective Internationalization for Embedded Systems (Embedded World 2014)
Page 5: Effective Internationalization for Embedded Systems (Embedded World 2014)

Coded Character Set “A mapping from a set of abstract characters to a set of integers.”

Ken Lunde, CJKV Information Processing

Page 6: Effective Internationalization for Embedded Systems (Embedded World 2014)
Page 7: Effective Internationalization for Embedded Systems (Embedded World 2014)
Page 8: Effective Internationalization for Embedded Systems (Embedded World 2014)

ASCII 128

JIS X 0213:2004 4354

Unicode BMP 65536

Unicode 1114112

Number of characters

Page 9: Effective Internationalization for Embedded Systems (Embedded World 2014)
Page 10: Effective Internationalization for Embedded Systems (Embedded World 2014)

ASCII

H e a l t h 0x48 0x65 0x61 0x6C 0x74 0x68

[0x48,0x65,0x61,0x6C,0x74,0x68]

Page 11: Effective Internationalization for Embedded Systems (Embedded World 2014)

Encoding Form

„A character encoding form plus byte serialization “

Glossary of the Unicode Consortium

„Mapping from a character set definition to the actual code units used to represent the data.”

Encoding Scheme

Page 12: Effective Internationalization for Embedded Systems (Embedded World 2014)
Page 13: Effective Internationalization for Embedded Systems (Embedded World 2014)

Unicode

健 康 0x5065 0x5EB7

UTF-16LE:[0x65,0x50,0xB7,0x5E] ? UTF-8:[0xE5,0x81,0xA5,0xE5,0xBA,0xB7]

UTF-32BE:[0x00,0x00,0x50,0x65,0x00,0x00,0x5E,0xB7]

�康

Page 14: Effective Internationalization for Embedded Systems (Embedded World 2014)

Endianess

Page 15: Effective Internationalization for Embedded Systems (Embedded World 2014)

Little Endian

Big Endian [0x65,0x50,0xB7,0x5E]

[0x50,0x65,0x5E,0xB7]

健康 (0x6550,0xB75E)

敐띞

Page 16: Effective Internationalization for Embedded Systems (Embedded World 2014)

UTF-8 H 0x48 ¥ 0xC2,0xA5

健 0xE5,0x81,0xA5

𠀷 0xF0,0xA0,0x80,0xB7

0x48 01001000 0xC2 11000010 0xE5 11100101 0xF0 11110000

Page 17: Effective Internationalization for Embedded Systems (Embedded World 2014)

UTF-16 H 0x0048 ¥ 0x00A5

健 0x5065

𠀷 0xD840,0xDC37

Page 18: Effective Internationalization for Embedded Systems (Embedded World 2014)

UTF-32 H 0x00000048 ¥ 0x000000A5

健 0x00005065

𠀷 0x00020037

Page 19: Effective Internationalization for Embedded Systems (Embedded World 2014)
Page 20: Effective Internationalization for Embedded Systems (Embedded World 2014)
Page 21: Effective Internationalization for Embedded Systems (Embedded World 2014)

Font “A collection of glyphs used for the visual depiction of character data.”

Glossary of the Unicode Consortium

Page 22: Effective Internationalization for Embedded Systems (Embedded World 2014)

Pixel-Matrix Here:

3 bits/pixel

No scaling

Easy to display

Predictable output

Bitmap Font

Large

Page 23: Effective Internationalization for Embedded Systems (Embedded World 2014)
Page 24: Effective Internationalization for Embedded Systems (Embedded World 2014)

Outline Font Lines, Arcs,

Curves Needs Renderer

Scaling Output hard to predict

Page 25: Effective Internationalization for Embedded Systems (Embedded World 2014)
Page 26: Effective Internationalization for Embedded Systems (Embedded World 2014)

Anti-aliasing

Page 27: Effective Internationalization for Embedded Systems (Embedded World 2014)

Character Set Encoding Font

Page 28: Effective Internationalization for Embedded Systems (Embedded World 2014)

Ken Lunde, CJKV Information Processing O’Reilly 2009


Recommended