CEN/TC304 N932Source: SecretariatDate: 15 Dec 1999Title: European Fallback Rules, ballotMailed: 15 Dec 1999Status: TC-enquiry: DEADLINE 1st March 2000
Action required: Respond before 1 March 2000*Notes:This is a TC-enquiry, to establish the suitability of N932 to be sent forFormal vote as prENV. National member body officers, responsible forCEN/TC304 issues are asked to fill in this form and send it to theTC-secretariat before 1 March 2000.
Comments in any form will be forwarded to the CEN/TC 304 Project Team ofEuropean Fallback Rules. The PT will before the next plenary of TC304 inApril 2000 produce a disposition of comments and a revised draft. The PTwill produce a disposition of comments and plans to ask the TC304 plenary inNovember to approve a revised draft to be sent for Formal Vote.Comments from affiliated members of CEN and liaisons are welcome and will beconsidered.
Country:______________________________
Approves without comments ___
Approves with comments ___
Disapproves with comments ___
Date:_______________
Signature_____________________________________(National Member Body officer)
Name:__________________________________
EUROPEAN PRESTANDARD DRAFTPRÉNORME EUROPÉENNE prENV_____EUROPÄISCHE VORNORM
ICS: 35.040
Descriptors: Data processing, information interchange, text processing, text communication, graphic characters,character sets, representation of characters, coded character sets, conversion, fallback
English version
Information TechnologyEuropean fallback rules
This draft ENV is submitted to CEN members for Formal Vote. It has been drawn up by theTechnical Committee CEN/TC 304.
This draft ENV was prepared by CEN in one off icial version (English). A version in anotherlanguage made by translation under the responsibilit y of a CEN member into its own language andnotified to the Central Secretariat has the same status as the off icial version.
CEN members are the national bodies of Austria, Belgium, Denmark, Finland, France, Germany,Greece, Iceland, Ireland, Italy, Luxembourg, Netherlands, Norway, Portugal, Spain, Sweden,Switzerland, and United Kingdom.
CEN
European Committee for StandardisationComité Européen de NormalisationEuropäisches Komite für Normung
Central Secretariat: rue de Stassart 36,B-1050 Brussels
© CEN 1999 Copyright reserved to all CEN membersRef.No. ENV_____:1999
Technologies de l'information-Character repertoire and coding transformations:European fallback rules - Nº 1
Informations technologiesCharacter repertoire and coding transformations:European fallback rules - Nº 1
This page has been left intentionally blank.
Contents
Foreword .............................................................................................................................................40. Introduction.................................................................................................................................5
0.1 Rationale for the standard provision of fallback rules.............................................................50.2 Basic concepts.........................................................................................................................60.2 Requirements...........................................................................................................................60.3 Satisfying the requirements.....................................................................................................7
1. Scope and field of application.....................................................................................................71.1 Scope.......................................................................................................................................71.2 Field of application..................................................................................................................7
2. Normative references...................................................................................................................73. Definitions and abbreviations......................................................................................................8
3.1 Basic definitions......................................................................................................................83.2 Other definitions........................................................................................................................83.2 Abbreviations............................................................................................................................9
4. Conformance...................................................................................................................................95. Fallback rules specification.............................................................................................................9
5.1 Specification of the general fallback rules................................................................................9ANNEX 1. (Normative) The list of fallback specification per character ...........................................9
Table I GENERAL FALLBACK TABLE ............................................................................9ANNEX II (Informative) EXAMPLES OF TEXT IN DIFFERENT LANGUAGES PRESENTEDWITH FALLBACKS FROM TABLE IANNEX III (Informative) Notes on fallback for Latin, Greek and Cyrilli c charactersANNEX IV (Informative) Bibliography...............................................................................................
Foreword
This ENV was prepared by CEN/TC304 European Localisation Requirements. It was approved inthe month of_____ 1999. No other organisation than CEN took part in the preparation of thisdocument.
This European Standard is intended to facilit ate cross border communications and data exchangeand to ensure that European cultural requirements are safeguarded in the increasinglyinterconnected world of today. It provides rules for fallback for multili ngual European texts into theinvariant set of ISO/IEC 646. These rules come into effect if data from different languages must berepresented by equipment and systems which do not support the presentation of all the characters inthe different language repertoires.
This European Standard does not intend to influence, let alone substitute itself for, nationalstandards or customs in this field. Nevertheless, national standards have the opportunity to adaptthis European Standard by declaring a formalized set of deviation rules (»delta«) if they so wish.
This document does not cancel or replace another standard.
There is no known identical national standard in Europe.
According to the CEN/CENELEC Common Rules the following countries are bound to announcethe existence of this ENV: Austria, Belgium, Denmark, Finland, France, Germany, Greece, Iceland,Ireland, Italy, Luxembourg, Netherlands, Norway, Portugal, Spain, Sweden, Switzerland, andUnited Kingdom.
0. Introduction
0.1 Rationale for the standard provision of fallback rules
Users who are trying to write text in a language which is not their mother language often wish towrite that text using a character repertoire which does not contain all the letters needed for thatlanguage, especially those with diacriti c marks. A standard method of character substitution wouldbe useful for such users.
In spite of the computers being able to process larger repertoires of graphic characters than everbefore, there are cases where it is not possible to render all the characters of a processing repertoireon an output device. In these cases, not all the characters in a processing repertoire are available inan output repertoire. In order to cater for these situations, a widely applicable standard method ofcharacter substitution (fallback) is required which will allow an approximate rendition to be madeof the unsupported characters of the processing repertoire for output and rendition.
Examples of key applications are:
a) a multili ngual information service offered across Europe where personal or business documents come from different countries and are presented in a standardised rendition using MES 2 characters and which cannot be properly represented by the information service; and
b) search engines in the World Wide Web which make use of "fuzzy" search techniques based onthe use of search terms which have diacritical marks removed and make use of commonsubstitutions for less frequently used letters of the Latin alphabet. Examples of the latter are -eth (ð), thorn (Þ), æ, œ and the German sharp s (ß)
The provision of single fallback rules with a collection of fallback representations for MES 2 willenable the services to be improved and the applications easier for use by the human end user. Astandard set of substitutions would be useful for such applications in order to avoid confusion. Thesame applies for the other two scripts represented by MES-2, Cyrilli c and Greek.
The justification for preparing an ENV for these purposes is that the concept of representing thecharacters without diacriti cal marks is not useful for scripts originating outside Europe.Furthermore, Standardisation bodies of Europe that may wish to specify National schemes forfallback may modify the scheme given in this ENV for a limited set of characters and promulgatenational standards for fallbacks. Greek and Cyrilli c fallback representation specified in this ENVshould be used with caution since transliteration into various Latin script languages depends on thetarget language. Local standards or local best practice should be referenced where they exist.
The European standard fallback specification can be used as a default in all relevant situations. Itcan be used as the basis for national standards with local preferences being used for specificsubstitutions defined by particular nation. It is expected that national standards for fallback will beregistered in the international cultural registry as part of national locales. Well known localsolutions will also be documented in addition to the default values.
0.2 Basic concepts
This standard specifies how a source stream of coded characters from a processing repertoire isrepresented in a target stream of an output repertoire. The worst case that is covered by thesubstitutions defined in this standard is where the processing repertoire is MES-2 and the outputrepertoire is the invariant repertoire of ISO/IEC 646. The coding of the processing repertoire andthe coding of the output repertoire are outside the scope of this standard.
Characters in the source stream which occur in the output repertoire are transferred directly to thetarget stream without substitution. Characters in the source stream which do not occur in the outputrepertoire are subject to substitution.
There are two types of substitution. In the first type, the target characters are represented in a waythat disables the reverse transformation of the target stream to the source stream because of loss ofinformation. A very common example of this type of presentation is when the Latin small l etter eacute (é) is presented by Latin small l etter e (e). This type of substitution when a letter withdiacritical mark is represented with the same character but without diacriti cal mark is known asaccent dropping The second type of presentation introduces special symbols that preserve theinformation about the original graphical symbol enabling transformation of the character stream tothe original encoding. An example of this is the use of the SGML symbols (e.g. é in thecase above). This type of substitution is outside the scope of this standard.
The substitution with loss of information can have more forms, but two main classes are alwaysrecognised as basic:
-one-to-many when one graphical character of the source stream is substituted with more than onegraphical character from the output repertoire in the target stream. An example of this type ofpresentation is Latin capital letter Æ presented as AE. This class is recommended for general use.
-one-to-one when one graphical character of the source stream is substituted with one graphicalcharacter of the output repertoire in the target stream. This type of presentation is required inapplications were the number of characters in the data entries or fields (e.g. in data bases orapplication forms) is fixed. The accent dropping is a type of one to one substitution. It isanticipated that this class will have minority application and should be discouraged, only to beused when there are strong technical reason for doing so.
0.3 Requirements
It is desirable that a standard fallback specification have a very large field of application so that itmay be used across a wide range of platforms. To achieve this, a fallback specification is neededfor a processing repertoire which is a superset of a large proportion of existing processingrepertoires. Also, a fallback repertoire is needed which is a subset of a large proportion of existingoutput repertoires.
The characters contained in the collection Multili ngual European Subset No.2 (MES-2) specified inISO/IEC 10646, have a wide usage across Europe. In many cases, MES-2 will become theprocessing repertoire of choice. MES-2 is a large repertoire for which there will be a need for afallback specification.
0.4 Satisfying the requirements
MES-2 has been designed to be a superset of a wide range of processing repertoires for commercialand administrative applications in off ice environments across the EEA.
It is a valid assumption that the minimum output repertoire that is implemented in computersystems is the invariant repertoire of ISO/IEC 646. Therefore a substitution with one or morecharacters from the invariant repertoire of ISO/IEC 646 will always be able to be rendered on anoutput device. The invariant repertoire of ISO/IEC 646 does not have letters with diacriti c marksnor letters from national alphabets used in Europe.
This standard satisfies the requirements by providing a fallback specification which can representeach character of MES-2 with one or more characters of the invariant repertoire of ISO/IEC 646.
1. Scope and field of application
1.1 Scope
This standard specifies the representation of the characters of the collection Multili ngual EuropeanSubset No.2 (MES 2) with the characters from the repertoire of the invariant part of ISO/IEC 646(83 graphic characters). Where a character of MES-2 is not available in the invariant set ofISO/IEC 646, a fallback representation for rendering is specified.
1.2 Field of application
The fallback rules given here are only intended for data in more than one European language, i.e.for use in pan-European applications. They are not meant to influence, let alone replace existingnational standards or practices.
2. Normative references
2.1 ISO/IEC 10646-1:1993, Information technology-Universal Multiple-Octet Coded CharacterSet (UCS)-Part 1: Architecture and Basic Multilingual Plane.
2.2 ISO/IEC 10646-1:1993/Cor.1:1996, Information technology-Universal Multiple-Octet CodedCharacter Set (UCS)-Part 1: Architecture and Basic Multilingual Plane, Technical Corrigendum 1.
2.3 ISO/IEC 646:1991, Information technology - ISO 7-bit coded character set for informationinterchange.
3. Definitions and abbreviations
3.1 Basic definitions
For the purposes of this standard the basic definitions of ISO/IEC 10646-1 section 4 apply. Thefollowing are reproduced here for ease of reference:
3.1.1 character: A member of a set of elements used for organisation, control, or representation ofdata.
3.1.2 coded character: A character together with its coded representation.
3.1.3 coded character set: A set of unambiguous rules that establishes a character set and therelationship between the characters of the set and their coded representation.
3.1.4 repertoire: A specified set of characters that are represented in a coded character set
3.1.5 presentation: To present: the process of writing, printing, or displaying of a graphic symbol.
3.1.6 graphic symbol: The visual representation of a graphic character or of a composite sequence.
3.1.7 graphic character: A character, other than a control function, that has a visual representation,normally hand-written, printed, or displayed.
3.2 Other definitions
Also, the following definitions apply:
3.2.1 diacritical mark: Any of a number of recurring graphical structures placed over, under, nextto, or through a basic letter which does not significantly modify the shape of the basic letter itselfand which in combination with that basic letter is a graphic character of the identified repertoire ofMES-2.
3.2.2 letter with diacritical mark: A letter which is constructed as the combination of a basic letterand a diacriti cal mark.
3.2.3 basic letter: a letter that is one of the letters of the repertoire of the IRV of ISO 646.
3.2.4 character stream: A series of coded characters in sequence, sometimes referenced as a stream.
3.2.5 character classification: The characters of the repertoire characters are classified into letters,digits and special characters.
3.2.6 processing repertoire: This is the repertoire used by an application for processing characterbased information.
3.2.7 output repertoire: This the repertoire used by a computer system for external representation ofcharacter based information.
3.2 Abbreviations
The following abbreviations apply:
3.2.1 MES-2: Multili ngual European Subset No. 2
4. Conformance
A claim of conformance to this standard during a fallback operation implies representing thecharacters of MES-2 (as specified in Table 1 columns 1-3) by the characters of the InvariantSubset of ISO/IEC 646 (as specified in Table 1, column 4).
The fallbacks used in Table 1, column 5 should only be used where no other fallback is possible.
5. Fallback rules specification
5.1 Specification of the general fallback rules
This standard provides two mappings for substitution. The first mapping is one to many (1:many) and the second is one to one (1:0) fallback for characters outside of the ISO/IEC 646Invariant Subset. The one-to-one mapping should be avoided, and used only when technicaldiff iculties prevent the use of the one-to-many fallback.
Common practice is used as the basis for determining fallbacks, where possible. Both fallbackmethods drop accents (diacriti cal marks). Common practice in several European countries leads tofallbacks to digraphs for several accented and modified vowels used in Nordic countries and inGerman-speaking countries (aa, ae, oe, ue, etc). One character may default to several characters,especially in the case of symbols, where fallbacks are sometimes based on the name of thecharacter.
Fallbacks for the letters of the Greek and Cyrilli c alphabets are based on the internationalromanisation standards ISO 843 and ISO 9 where possible, amended where necessary to allowfallbacks to be limited to the Invariant Subset of ISO/IEC 646.
ANNEX 1. (Normative) The list of fallback specification per character
Table I GENERAL FALLBACK TABLE
Column 1 contains the ISO/IEC 10646-1 code for the source characterColumn 2 contains a glyphColumn 3 contains the ISO/IEC 10646-1 character nameColumn 4 contains a glyph or glyphs for the one-to-many fallbackColumn 5 contains a glyph for the one-to-one fallback
ANNEX II . (Informative)
Examples of fallback representation of text in different languages and scripts
Multili ngual original text
Notre ancienne langue est le fondement de notre héritage culturel et spirituel sans lequel nous nepourrions exister de la façon prévue par notre Créateur.L’éducation est la transmission de la cultureet des valeurs; parconséquent, nous déclarons que l’éducation de la langue ancienne sera le moyende transmission de la de l’héritage spirituel et culturel.
In alten, alten Zeiten, als die Menschen nochin ganz anderen Sprachen redeten, gab es in denwarmen Ländern schongroße und prächtige Städte. Da erhoben sich die Paläste derKönige undKaiser, da gab es breite Straßen, enge Gassen und winkelige Gäßchen, da standen herrliche Tempelmit goldenen undmarmornen Götterstatuen, da gab es bunte Märkte, wo Waren aus allerHerrenLänder feilgeboten wurden, und weite schöne Plätze, wo dieLeute sich versammelten, umNeuigkeiten zu besprechen und Reden zu halten oder anzuhören. Und vor allem gab es dortgroßeTheater.
Internet je tehnološki in �������
beni fenomen devedesetih let. O Internetu se povsod govori in piše.Nekateri opisujejo Internet kot pojav novega obdobja v razvoju
���������in jo imenujejo globalna
informacijska ����������
ali bolj ����� ��� �� informacijska infrastruktura te �����������
Poenostavljenoinformacijska
����������je globalna širokopasovna komunikacijska infrastruktura in storitve za
� � � � �� podjetja, ustanove in vse ostale subjekte svetovnega gospodarstva.
Kroz projekt Hrvatske akademske i ��� ������� ��� � �! #"$�%�� e CARnet i Klinike za bolesti srca i krvnih� ��& � Medicinskog fakulteta po prvi put u kardioloskoj praksi biti ' izvršenoudaljeno podešavanjeelektrostimulatora srca na bolesniku putem Interneta. ()���+* �, ovakve prakse u -�� ����� �., medicine /0 � �0� niz prednosti za bolesnike sa
��12�%� * �� " elektrostimulatorom 34#57698;:=< >�?�@BA0CB5!D�E�FG;HJILK�@�>MK�@5�H � @5!N!OPK�QRK�FFG;HSFT5VU�5!N�K
WYX O[ZT5�F?�?]\^D�>�?_IL`�abK!D�K W+X OcUBK�N>�N!?OP5�F?�dfe X Z X D�F?�A�K�D�E2F X+ghX Z�D�5!A�Gi> W D�j#k�K�FT5l> X N!O XMm F X I!A�En?TI�Z X D�E�N X >M5�F?BdpoTq)osr ��t 5LI�u$?�@K�F?TKv 5 g D X > � ?_I�Z X D�EwNxCBK�O X KyeYD+d X U2@5LU X A W ?zoTqfo �p{ X+W CMOPK�F�A25�|?�dVZ X o}q)osr~?�e+@BCM��?O�I�@K�e�I!A�>M5WWW
Z�@ X �x@5�O^O[?�@ X >+5�F?�d�e X I!ALCMZFT5�ALC2A �~� >M5 m 5LK�O[G�K�� X D�E�N X >M5!A�K�D�? ! 4#KLI�O X A�@Yd$FT5)>2>+K!eK�F?TK>�� X I W >+K�I���?jpD+d~���Y�Y�l� X e�5��B��.F X � X FT5�D X �L5�I�Z�@ X e�5 m , WYX O[ZT5�F?Bd�\^��69�f89`�a�:=��:������Z�@�?�F�d2D�5P@K�uRK�F?TK#�nFTK9Z X >2G�u�5!AwEpI�> X ?$A25�@�? v GiFT59>MILK9>2?�e�G�CYI!DYC�� �
� W 5 X u�A X I�>M5 W 5�UB5 jka jeDYA�K#Z X kB?�pK K�> X ?¡O ?£¢�K�O X Z X k�K!Aw?�5 W 5 WYX ¢�K�ILKpN�5�>L@�u ?�A�?¤N�5�>�?TI�? X eFT5�u�KVZB@B¥ 5�>MKfOP5�u�A�K � 4�K W 5�e�5)e�5�>�F XRm ?>MK!D�?¡I!CR�f>2?|T5p?¦�i5x@�?|T5VC���CYI!A Xx§ u�C+O[? � �iK!MC2A�?OiC?TI�A Xx§ u�C+O[? § K m ?>MK!D�5�I!A�5x@59>+K�u�A�?�|T5 �
©�ª�«;¬P�®�Mª�®±°B²0³;°_�²0µ²�¶fª�®�¶$·Y�°�®¹�º �ª�ºf²L �»Bw®¼Y½=¾�¿M¶w»�²LÀRÁB¶2Â�!ÃÄ»�Y®Y²L ÀM²�¶�Á�$Á� ²�®�¬[ÅYxÆ�º�Á�%»B�¬ÇÈ�®�w®��¾�¬;w»�ÀMÆL��¬�Ç=É�¶+®_Àw»BÊ+¿+«[ª�¶+®_Á�9�»��ªYË}¿�ñ�¬ �0Ì °�Íw¶MÅY¶2ÎTÁ�+®TÁ�µ²�Ë+»�¾���¶B°BË ²�¶2Â�ª�0¿�®�¶Bw®¹�¶2ÎÌ�Ï#ÌpÐnÑfÒ}Ó�Ì7ÔÖÕfÒ_Ì ��Ê�³;¬[�Y®_Á� ²�¶�°B²�ÉT°�®�Á�¶ 2ÂB²0³�»f²0«�»�°�ÅMÃ�2«�»
Multili ngual text with fallbacks
Notre ancienne langue est le fondement de notre heritage culturel et spirituel sans lequel nous nepourrions exister de la facon prevue par notre Createur. L'education est la transmission de la cultureet des valeurs; par consequent, nous declarons que l'education de la langue ancienne sera le moyende transmission de l'heritage spirituel et culturel.
In alten, alten Zeiten, als die Menschen nochin ganz anderen Sprachen redeten, gab es in denwarmen Laendern schongrosse und praechtige Staedte. Da erhoben sich die Palaeste derKoenigeund Kaiser, da gab es breite Strassen, enge Gaessen und winkelige Gasschen, da standenherrliche Tempel mitgoldenen und marmornen Goetterstatuen, da gab es bunte Maerkte, woWarenaus aller Herren Laender feilgeboten wurden, und weite schoene Plaetze, wo die Leute sichversammelten, um Neuigkeiten zu besprechen und Reden zu halten oder anzuehoren. Und vorallemgab es dortgrosse Theater.
Internet je tehnoloshki in druzhbeni fenomen devedesetih let. O Internetu se povsod govori in pishe.Nekateri opisujejo Internet kot pojav novega obdobja v razvoju druzhbe in jo imenujejo globalnainformacijska druzhba ali bolj natanchno informacijska infrastruktura te druzhbe. Poenostavljenoinformacijska druzhba je globalna shirokopasovna komunikacijska infrastruktura in storitve zaobchane, podjetja, ustanove in vse ostale subjekte svetovnega gospodarstva.
Kroz projekt Hrvatske akademske i istrazhivachke mrezhe CARnet i Klinike za bolesti srca ikrvnih zhila Medicinskog fakulteta po prvi put u kardioloshkoj praksi biti ce izvrsheno udaljenopodeshavanje elektrostimulatora srca na bolesniku putem Interneta. Uvodjenje ovakve prakse upodruchje medicine znachi niz prednosti za bolesnike sa ugradjenim elektrostimulatorom.
Na VSEKH virtualynih serverakh, razmeshchennyikh na baze kompanii Yelvis-Telekom bezvzimaniya dopolnitel”noj oplat’ i vklyuchena vozmozhnost’ ispol’zovaniya RNR3. Rasshireniefajlov, ispol’zuemoe dlya obrabotki RNR. Dokumentatsiya po RNR3 i drugim sredstva WWWprogrammirovaniya dostupna tut. Uvazhaemye Polz’ovateli ! Nesmotrya na vvedenie v Moskve s 1iyuliya 1999 goda 2%-nogo naloga s prodazh, kompaniya YELVIS-TELEKOM prinyala reshenie:ne povyshat’ svoi tarif’ i na vse vidyi uslug.
I kao shto svaka bajka jelte pochinje evo i mi c'emo pocheti a kako c'e se zavrshiti zavisi od nasheprljave mashte. Nekada davno zhiveli su Ivica i Marica u gustoj shumi. Medjutim u istoj shumi jezhivela stara veshtica.
Opo6- diapisto-sate to pio basiko apo ta endiaferonta moy einai ta atoma me ti6- legomene6-“eidekes anagke6-“ i- oi anthro-poi me anapi-rie6-. Ascholoymai me ti-n ekdosi- toy periodikoyANAPI-RIA TO-RA katho-6- kai me to sti-sipso ayto-n to-n selido-n.
Annex III: Notes on fallback for Latin, Greek and Cyrillic characters
III .0 Fallback from extended Latin characters
Diacritics as parts of pre-composed Latin characters are not recorded, although non-combining free-standing diacritics (rarely used in practice, compared to their precomposed forms in MES-2 (or incombination with their combining forms if this fallback standard were to be extended to MES-3)are recorded as " . "
Note: sometimes extended Latin to basic Latin fallback is described as transliteration. This shouldmore correctly be described as fallback, as transliteration really relates to the Conversion of Scripts:transliteration is a form of Romanization as noted below insection III .1.
III .1 Fallback from Greek characters to Latin characters
Fallback from Greek characters to the ISO/IEC 646 Invariant Subset is effectively the same asRomanization. There are two types of romanization: transliteration and transcription.
Transliteration is a simple character by character conversion, irrespective of pronunciation.
Transcription undertakes additional conversions to maximise compatibilit y with pronunciation, andmay differ in detail from a simple transliteration. For some languages and scripts the two outcomescan be identical, but for some languages, including Greek, transcription is desirable due to the waythe Greek language is written in Greek script.
In some cases, including Greek to Latin, transcription can also be undertaken through a multiple-pass (usually a two pass) process, to achieve a more pronouncable fallback. Restoration of theoriginal script is possible both from fallback using transliteration, and from fallback usingtranscription. Transcription is the preferred method used in Greece.
In this context transcription provides different conversions on some character pairs, beforeundertaking a simple transliteration fallback. The fallback repertoire in both cases is the InvariantSubset of ISO/IEC 646.
From the Greek characters in Table 1 it is is possible to derive: - one-to-many transliteration fallbacks, - one-to-many transcription fallbacks, or - one-to-one transliteration fallbacks.
In any body of Greek text that has been converted via a fallback conversion to the repertoire of theISO/IEC 646 Invariant Subset, it should be possible to compare the text against Table 1 or Table 2,to determine whether the fallback conversion has used (in order of preference) a one-to-manytransliteration fallback, a one-to-many transcription fallback, or a one-to-one transliterationfallback.
III .2. one-to-many transliteration fallbacks
Fallbacks that are listed in Table 1 for Greek characters provide a one-to-many transliterationfallback for Greek according to the provisions of the Romanization standards ISO 843 and ELOT743 with minor modifications to ensure that the fallbacks are limited to the repertoire of theInvariant Subset of ISO/IEC 646.
The most obvious modification is use of " - " (HYPHEN-MINUS) instead of the macron specifiedin ISO 843 and ELOT 743. In one-to-many fallback, this is used with the Greek letters OMEGAand ETA, as is typically used in Greece. The use of " - " is also extended to the much more rarelyused Greek symbols outside the range of basic Greek characters in order to maximisedistinctiveness in characters, while maximising readabilit y.
In Greek text there is littl e use of HYPHEN-MINUS within words, so ambiguous fallback text isunlikely as a result of this usage.
In addition, diacriti cs as parts of pre-composed Greek characters are not recorded, although non-combining free-standing diacritics (rarely used in practice, compared to Greek letters includingprecomposed diacritics) are recorded as " . " which is identical to the treatment of such characters infallback from Latin script characters.
III .3. one-to-many transcription fallbacks
Note: for Greek characters, a one-to-many transcription fallback is strongly preferred to a one-to-many transliteration fallback. It is the transcription tables from ISO 843 and ELOT 743 that areused in Greece and Cyprus, not simply the transliteration tables.
Transcription can also be achieved automatically, using a multiple stage fallback conversion, againusing only the repertoire of the ISO/IEC 646 Invariant Subset.
Transcription provides a better equivalence to Greek pronunciation, by using a simple automaticmultiple-pass conversion on some digraphs or trigraphs used in Greek before performing a simplefallback conversion on the remaining single characters.
Transcription can be applied either to the source data or the target data, after a one-to-manytranscription fallback has been performed.
The transcription table appears as Table 2 in this standard.
III .4. Restoring Greek text from Latin script fallback text
A reversible transformation from the Invariant Subset of ISO/IEC 646 back to the equivalent Greekcharacters is also possible, whether this is done using a one-to-many transcription fallback or a one-to-many transliteration fallback, although information on restoration of relevant Greek diacritics isoutside the scope of this standard.
III .5. one-to-one transliteration fallbacks
Note that the use of a one-to-one transliteration fallback would result in incorrect data if a similarLatin to Greek conversion were to be applied. Use of one-to-one fallback is generally discouragedin this standard, and should be particularly avoided when dealing with Greek text.
III .6 Fallback from Cyrilli c characters to Latin characters
From the Cyrilli c characters in Table 1 it is possible to derive one-to-many transliteration fallbacksor one-to-one transliteration fallbacks.
In any body of Cyrilli c script text that has been converted via a fallback conversion to the repertoireof the ISO/IEC 646 Invariant Subset, it may be possible to use statistical analysis of fallback text todetermine the likely languages used in fallback text.
Specification of such analyses is outside the scope of this standard: however it may be useful tonote that the characters in the Cyrilli c section of Table 1 can be loosely grouped as:
(a) 0410-044F Basic Slavonic Cyrilli c characters (in Russian/Bulgarian)(b) 0490-0491 Additional Ukrainian/Belarusian characters(c) 0401-040F&) Additional modern Slavonic Cyrilli c characters 0451-045F ) (in the above languages and in Serbian and Macedonian)(d) 0492-04F9 ) + 048C-048F ) Additional modern non-Slavonic Cyrilli c characters + 04EC-04ED )(e) 0460-0486 Additional archaic Slavonic Cyrilli c characters + indicates new to ISO/IEC 10646-1:2000.
Languages using ranges (b), (c), (d) and (e) also tend to use most of the characters from the range(a). Russian (except for using 0401 CYRILLIC CAPITAL LETTER IO and 0451 CYRILLICSMALL LETTER IO) and Bulgarian generally use only characters from the range (a).
Diacritics as parts of pre-composed Cyrilli c script characters are noted in the same way as othermodifications to basic characters, mainly through use of " ' " (APOSTROPHE).
[Combining diacritics, which are rarely used in practice, compared to their pre composed forms inMES-2 (or in combination with their combining forms if this fallback standard were to be extendedto MES-3) are recorded as " . "]
III .7. one-to-many transliteration fallbacks
Fallbacks that are listed in the section of Table 1 listing Cyrilli c characters provide a one-to-manytransliteration fallback for Cyrilli c script, based where possible on tables in the Cyrilli cRomanization standard ISO 9, with modifications to ensure that the fallbacks are limited to therepertoire of the Invariant Subset of ISO/IEC 646, and to allow the possibilit y of conversion offallback text from the repertoire of ISO/IEC 646 back to Cyrilli c characters, if required.
As ISO 9 is currently under process of revision, the fallbacks in the Cyrilli c section of Table 1 alsotake into account approaches being considered during this revision. These include the use of
"fallback-like" transliterations available as an option in ISO R/9, the predecessor of ISO 9 for thefollowing characters:0416; 0419; 0425; 0426; 0427; 0428; 0429; 042E and 042F (upper-case)0436; 0439; 0445; 0446; 0447; 0448; 0449; 044E and 044F (lower-case).
It is also planned to document language-specific romanizations both in the revised ISO 9 and in thisEuropean fallback standard.
The main modification of ISO 9 is use of " ' " (APOSTROPHE) instead of the various diacriticsspecified in ISO 9. APOSTROPHE is mainly used for the fallback of less common characters, inorder to maximise distinctiveness of characters, while maximising readabilit y. In Cyrilli c text thereis very littl e use of APOSTROPHE within words, so ambiguous fallback text is unlikely as a resultof this usage.
III .8 Restoring Cyrilli c text from Latin script fallback text
Unequivocal reversible transformation from the Invariant Subset of ISO/IEC 646 back to theequivalent Cyrilli c text is not possible, due to the number of Cyrilli c characters with identicalfallbacks.
However, it may still be possible to produce readable Cyrilli c text, in transforming back fromfallback text which may contain errors due to information loss, and to use other comparisonmethods, including the use of spelli ng checkers, to improve the text. Before any suchtransformation back to Cyrilli c script text, it would be essential to do a multiple pass conversionfirst, transforming various multiple character fallbacks first.
Specification of such comparison methods to improve the accuracy of text converted back toCyrilli c script is outside the scope of this standard.
III .9 One-to-one transliteration fallbacks
One-to-one transliteration fallbacks are discouraged due to the resulting ambivalent text, which islikely to be much more diff icult to read, or to enable reconstruction of the original text, than is thecase for Latin or Greek text, as there are more basic Cyrilli c script letters in many languages than isthe case for basic letters in Greek script or Latin script.
Note that the use of a one-to-one transliteration fallback would result in incorrect data if a similarLatin script to Cyrilli c script conversion were to be applied. Use of one-to-one fallback is generallydiscouraged in this standard, and should be particularly avoided when dealing with Cyrilli c text.
Table 2ISO 843 Transcription modified for ISO/IEC 646 Invariant Subset fallbackTransliteration, and Transcription possibiliti es for character pairs (transcription shown only where different fromtransliteration)-------------------------------------------------------------------------------------------------------------------------------UCS ID 1st and 2nd letters Transliterated Transcription-------------------------------------------------------------------------------------------------------------------------------0391 03B1 ALPHA A a ALPHA + UPSILON Ay ay AV av [1] ALPHA + UPSILON Ay ay AF af [2]---------------------------------------------------------------------0392 03B2 BETA V v---------------------------------------------------------------------0393 03B3 GAMMA G g GAMMA + GAMMA Gg gg NG ng GAMMA + XI Gx gx NX nx GAMMA + CHI Gch gch NCH nch---------------------------------------------------------------------0394 03B4 DELTA D d---------------------------------------------------------------------0395 03B5 EPSILON E e EPSILON+UPSILON Ey ey EV ev [1] EPSILON+UPSILON Ey ey EF ef [2]---------------------------------------------------------------------0396 03B6 ZETA Z z---------------------------------------------------------------------0397 03B7 ETA I- i- ETA + UPSILON I-y i-y IV iv [1] ETA + UPSILON I-y i-y IF if [2]---------------------------------------------------------------------0398 03B8 THETA Th th---------------------------------------------------------------------0399 03B9 IOTA I i---------------------------------------------------------------------039A 03BA KAPPA K k---------------------------------------------------------------------039B 03BB LAMDA L l---------------------------------------------------------------------039C 03BC MU M m MU + PI Mp mp B b [3]---------------------------------------------------------------------039D 03BD NU N n---------------------------------------------------------------------039E 03BE XI X x---------------------------------------------------------------------039F 03BF OMICRON O o OMICRON+UPSILON Oy oy OU ou [4]---------------------------------------------------------------------03A0 03C0 PI P p---------------------------------------------------------------------03A1 03C1 RHO R r---------------------------------------------------------------------03A3 03C3 SIGMA S s [5]---------------------------------------------------------------------03A4 03C4 TAU T t---------------------------------------------------------------------03A5 03C5 UPSILON Y y---------------------------------------------------------------------03A6 03C6 PHI F f---------------------------------------------------------------------03A7 03C7 CHI Ch ch---------------------------------------------------------------------03A8 03C8 PSI Ps ps---------------------------------------------------------------------03A9 03C9 OMEGA O- o----------------------------------------------------------------------
[1] Before BETA, GAMMA, DELTA, ZETA, LAMDA, MU, NU, RHO & before all vowels,unless the fallback is from an accented UPSILON,when a simple transliteration is used. [2] Before THETA, KAPPA, XI, PI, SIGMA, TAU, CHI, PSI and at end of words,unless the fallback is froman accented UPSILON, when a simple transliteration is used.[3] "B" At the beginning or end of words; "mp" in the middle of words. Note that sometimes 03D0 GREEK BETA SYMBOL is used as an explicitB, in cases where users want to make clear its distinction as sounding like a B, in comparison to BETA (romanized as V). Unlike MU with PI, notethat NU with TAU are not transcribed to LATIN LETTER D in ISO 843 or ELOT 743.[4] If a vowel in the pair is accented, then a simple transliteration is used.[5] 03C2 GREEK SMALL LETTER FINAL SIGMA only occurs at the end of words, so it is straightforward to distinguish 03C3 from 03C2 inreconversion, even though in Latin fallback text both default to a single character s.
ANNEX IV (Informative) Bibliography
1. ISO/IEC 8879, 1986 (SGML), Standardised General Mark up Language
2.Satz und Korrektur Anweisungen mit aus Fuhrili cher Beispiel Sammlung, Publisher: DudenVerlag, Menheim Wien, Zuerich 1986
3. EWOS EG/ODA/91/Technical Guidance in Fallbacks for ENV 41 510: 1992
4. CWA TC304/MES-2:1999
5.Information Technology -Character Repertoire and Coding for Interworking with Telex services,EN 1922:1997
CH
AR
AC
TE
RS
AN
D F
AL
LB
AC
KS
ME
S-2
SOU
RC
E C
HA
RA
CT
ER
FA
LL
BA
CK
SC
HA
RA
CT
ER
SF
AL
LB
AC
KC
HA
RA
CT
ER
Cod
eG
lyph
Cha
ract
er n
ame
One
-to-
man
y gl
yph(
s)O
ne-t
o-on
egl
yph
0020
SPACE
0021
EXCLAMATION MARK
0022
QUOTATION MARK
0023
NUMBER SIGN
24
DOLLAR SIGN
0025
PERCENT SIGN
0026
AMPERSAND
0027
APOSTROPHE
0028
LEFT PARENTHESIS
0029
RIGHT PARENTHESIS
002A
ASTERISK
002B
PLUS SIGN
002C
COMMA
002D
HYPHEN-MINUS
002E
FULL STOP
002F
SOLIDUS
0030
DIGIT ZERO
0031
DIGIT ONE
0032
DIGIT TWO
0033
DIGIT THREE
0034
DIGIT FOUR
0035
DIGIT FIVE
0036
DIGIT SIX
0037
DIGIT SEVEN
0038
DIGIT EIGHT
0039
DIGIT NINE
003A
COLON
003B
SEMICOLON
003C
LESS-THAN SIGN
003D
EQUALS SIGN
003E
GREATER-THAN SIGN
003F
QUESTION MARK
0040
COMMERCIAL AT
0041
LATIN CAPITAL LETTER A
0042
LATIN CAPITAL LETTER B
0043
LATIN CAPITAL LETTER C
0044
LATIN CAPITAL LETTER D
0045
LATIN CAPITAL LETTER E
0046
LATIN CAPITAL LETTER F
0047
LATIN CAPITAL LETTER G
0048
LATIN CAPITAL LETTER H
0049
LATIN CAPITAL LETTER I
004A
LATIN CAPITAL LETTER J
004B
LATIN CAPITAL LETTER K
004C
LATIN CAPITAL LETTER L
004D
LATIN CAPITAL LETTER M
004E
LATIN CAPITAL LETTER N
004F
LATIN CAPITAL LETTER O
0050
LATIN CAPITAL LETTER P
0051
LATIN CAPITAL LETTER Q
0052
LATIN CAPITAL LETTER R
0053
LATIN CAPITAL LETTER S
0054
LATIN CAPITAL LETTER T
0055
LATIN CAPITAL LETTER U
0056
LATIN CAPITAL LETTER V
0057
LATIN CAPITAL LETTER W
0058
LATIN CAPITAL LETTER X
0059
LATIN CAPITAL LETTER Y
005A
LATIN CAPITAL LETTER Z
005B
LEFT SQUARE BRACKET
005C
REVERSE SOLIDUS
005D
RIGHT SQUARE BRACKET
005E
CIRCUMFLEX ACCENT
005F
LOW LINE
0060
GRAVE ACCENT
0061
LATIN SMALL LETTER A
0062
LATIN SMALL LETTER B
0063
LATIN SMALL LETTER C
0064
LATIN SMALL LETTER D
0065
LATIN SMALL LETTER E
0066
LATIN SMALL LETTER F
0067
LATIN SMALL LETTER G
0068
LATIN SMALL LETTER H
0069
LATIN SMALL LETTER I
006A
LATIN SMALL LETTER J
006B
LATIN SMALL LETTER K
006C
LATIN SMALL LETTER L
006D
LATIN SMALL LETTER M
006E
LATIN SMALL LETTER N
006F
LATIN SMALL LETTER O
0070
LATIN SMALL LETTER P
0071
LATIN SMALL LETTER Q
0072
LATIN SMALL LETTER R
0073
LATIN SMALL LETTER S
0074
LATIN SMALL LETTER T
0075
LATIN SMALL LETTER U
0076
LATIN SMALL LETTER V
0077
LATIN SMALL LETTER W
0078
LATIN SMALL LETTER X
0079
LATIN SMALL LETTER Y
007A
LATIN SMALL LETTER Z
007B
LEFT CURLY BRACKET
007C
VERTICAL LINE
007D
RIGHT CURLY BRACKET
007E
TILDE
00A0
NO-BREAK SPACE
00A1
INVERTED EXCLAMATION MARK
00A2
CENT SIGN
00A3
POUND SIGN
00A4
CURRENCY SIGN
00A5
YEN SIGN
00A6
BROKEN BAR
00A7
SECTION SIGN
00A8
DIAERESIS
00A9
COPYRIGHT SIGN
00AA
FEMININE ORDINAL INDICATOR
00AB
LEFT-POINTING DOUBLE ANGLE
QUOTATION MARK
00AC
NOT SIGN
00AD
SOFT HYPHEN
00AE
REGISTERED SIGN
00AF
MACRON
00B0
DEGREE SIGN
00B1
PLUS-MINUS SIGN
00B2
SUPERSCRIPT TWO
00B3
SUPERSCRIPT THREE
00B4
ACUTE ACCENT
00B5
MICRO SIGN
00B6
PILCROW SIGN
00B7
MIDDLE DOT
00B8
CEDILLA
00B9
SUPERSCRIPT ONE
00BA
MASCULINE ORDINAL INDICATOR
00BB
RIGHT-POINTING DOUBLE ANGLE
QUOTATION MARK
00BC
VULGAR FRACTION ONE QUARTER
00BD
VULGAR FRACTION ONE HALF
00BE
VULGAR FRACTION THREE
QUARTERS
00BF
INVERTED QUESTION MARK
00C0
LATIN CAPITAL LETTER A WITH
GRAVE
00C1
LATIN CAPITAL LETTER A WITH
ACUTE
00C2
LATIN CAPITAL LETTER A WITH
CIRCUMFLEX
00C3
LATIN CAPITAL LETTER A WITH
TILDE
00C4
LATIN CAPITAL LETTER A WITH
DIAERESIS
00C5
LATIN CAPITAL LETTER A WITH
RING ABOVE
00C6
LATIN CAPITAL LETTER AE
00C7
LATIN CAPITAL LETTER C WITH
CEDILLA
00C8
LATIN CAPITAL LETTER E WITH
GRAVE
00C9
LATIN CAPITAL LETTER E WITH
ACUTE
00CA
LATIN CAPITAL LETTER E WITH
CIRCUMFLEX
00CB
LATIN CAPITAL LETTER E WITH
DIAERESIS
00CC
LATIN CAPITAL LETTER I WITH
GRAVE
00CD
LATIN CAPITAL LETTER I WITH
ACUTE
00CE
LATIN CAPITAL LETTER I WITH
CIRCUMFLEX
00CF
LATIN CAPITAL LETTER I WITH
DIAERESIS
00D0
LATIN CAPITAL LETTER ETH
00D1
LATIN CAPITAL LETTER N WITH
TILDE
00D2
LATIN CAPITAL LETTER O WITH
GRAVE
00D3
LATIN CAPITAL LETTER O WITH
ACUTE
00D4
LATIN CAPITAL LETTER O WITH
CIRCUMFLEX
00D5
LATIN CAPITAL LETTER O WITH
TILDE
00D6
LATIN CAPITAL LETTER O WITH
DIAERESIS
00D7
MULTIPLICATION SIGN
00D8
LATIN CAPITAL LETTER O WITH
STROKE
00D9
LATIN CAPITAL LETTER U WITH
GRAVE
00DA
LATIN CAPITAL LETTER U WITH
ACUTE
00DB
LATIN CAPITAL LETTER U WITH
CIRCUMFLEX
00DC
LATIN CAPITAL LETTER U WITH
DIAERESIS
00DD
LATIN CAPITAL LETTER Y WITH
ACUTE
00DE
LATIN CAPITAL LETTER THORN
00DF
LATIN SMALL LETTER SHARP S
00E0
LATIN SMALL LETTER A WITH
GRAVE
00E1
LATIN SMALL LETTER A WITH
ACUTE
00E2
LATIN SMALL LETTER A WITH
CIRCUMFLEX
00E3
LATIN SMALL LETTER A WITH
TILDE
00E4
LATIN SMALL LETTER A WITH
DIAERESIS
00E5
LATIN SMALL LETTER A WITH
RING ABOVE
00E6
LATIN SMALL LETTER AE
00E7
LATIN SMALL LETTER C WITH
CEDILLA
00E8
LATIN SMALL LETTER E WITH
GRAVE
00E9
LATIN SMALL LETTER E WITH
ACUTE
00EA
LATIN SMALL LETTER E WITH
CIRCUMFLEX
00EB
LATIN SMALL LETTER E WITH
DIAERESIS
00EC
LATIN SMALL LETTER I WITH
GRAVE
00ED
LATIN SMALL LETTER I WITH
ACUTE
00EE
LATIN SMALL LETTER I WITH
CIRCUMFLEX
00EF
LATIN SMALL LETTER I WITH
DIAERESIS
00F0
LATIN SMALL LETTER ETH
00F1
LATIN SMALL LETTER N WITH
TILDE
00F2
LATIN SMALL LETTER O WITH
GRAVE
00F3
LATIN SMALL LETTER O WITH
ACUTE
00F4
LATIN SMALL LETTER O WITH
CIRCUMFLEX
00F5
LATIN SMALL LETTER O WITH
TILDE
00F6
LATIN SMALL LETTER O WITH
DIAERESIS
00F7
DIVISION SIGN
00F8
LATIN SMALL LETTER O WITH
STROKE
00F9
LATIN SMALL LETTER U WITH
GRAVE
00FA
LATIN SMALL LETTER U WITH
ACUTE
00FB
LATIN SMALL LETTER U WITH
CIRCUMFLEX
00FC
LATIN SMALL LETTER U WITH
DIAERESIS
00FD
LATIN SMALL LETTER Y WITH
ACUTE
00FE
LATIN SMALL LETTER THORN
00FF
LATIN SMALL LETTER Y WITH
DIAERESIS
0100
LATIN CAPITAL LETTER A WITH
MACRON
0101
LATIN SMALL LETTER A WITH
MACRON
0102
LATIN CAPITAL LETTER A WITH
BREVE
0103
LATIN SMALL LETTER A WITH
BREVE
0104
LATIN CAPITAL LETTER A WITH
OGONEK
0105
LATIN SMALL LETTER A WITH
OGONEK
0106
LATIN CAPITAL LETTER C WITH
ACUTE
0107
LATIN SMALL LETTER C WITH
ACUTE
0108
LATIN CAPITAL LETTER C WITH
CIRCUMFLEX
0109
LATIN SMALL LETTER C WITH
CIRCUMFLEX
010A
LATIN CAPITAL LETTER C WITH
DOT ABOVE
010B
LATIN SMALL LETTER C WITH
DOT ABOVE
010C
LATIN CAPITAL LETTER C WITH
CARON
010D
LATIN SMALL LETTER C WITH
CARON
010E
LATIN CAPITAL LETTER D WITH
CARON
010F
LATIN SMALL LETTER D WITH
CARON
0110
LATIN CAPITAL LETTER D WITH
STROKE
0111
LATIN SMALL LETTER D WITH
STROKE
0112
LATIN CAPITAL LETTER E WITH
MACRON
0113
LATIN SMALL LETTER E WITH
MACRON
0114
LATIN CAPITAL LETTER E WITH
BREVE
0115
LATIN SMALL LETTER E WITH
BREVE
0116
LATIN CAPITAL LETTER E WITH
DOT ABOVE
0117
LATIN SMALL LETTER E WITH
DOT ABOVE
0118
LATIN CAPITAL LETTER E WITH
OGONEK
0119
LATIN SMALL LETTER E WITH
OGONEK
011A
LATIN CAPITAL LETTER E WITH
CARON
011B
LATIN SMALL LETTER E WITH
CARON
011C
LATIN CAPITAL LETTER G WITH
CIRCUMFLEX
011D
LATIN SMALL LETTER G WITH
CIRCUMFLEX
011E
LATIN CAPITAL LETTER G WITH
BREVE
011F
LATIN SMALL LETTER G WITH
BREVE
0120
LATIN CAPITAL LETTER G WITH
DOT ABOVE
0121
LATIN SMALL LETTER G WITH
DOT ABOVE
0122
LATIN CAPITAL LETTER G WITH
CEDILLA
0123
LATIN SMALL LETTER G WITH
CEDILLA
0124
LATIN CAPITAL LETTER H WITH
CIRCUMFLEX
0125
LATIN SMALL LETTER H WITH
CIRCUMFLEX
0126
LATIN CAPITAL LETTER H WITH
STROKE
0127
LATIN SMALL LETTER H WITH
STROKE
0128
LATIN CAPITAL LETTER I WITH
TILDE
0129
LATIN SMALL LETTER I WITH
TILDE
012A
LATIN CAPITAL LETTER I WITH
MACRON
012B
LATIN SMALL LETTER I WITH
MACRON
012C
LATIN CAPITAL LETTER I WITH
BREVE
012D
LATIN SMALL LETTER I WITH
BREVE
012E
LATIN CAPITAL LETTER I WITH
OGONEK
012F
LATIN SMALL LETTER I WITH
OGONEK
0130
LATIN CAPITAL LETTER I WITH
DOT ABOVE
0131
LATIN SMALL LETTER DOTLESS
I
0132
LATIN CAPITAL LIGATURE IJ
0133
LATIN SMALL LIGATURE IJ
0134
LATIN CAPITAL LETTER J WITH
CIRCUMFLEX
0135
LATIN SMALL LETTER J WITH
CIRCUMFLEX
0136
LATIN CAPITAL LETTER K WITH
CEDILLA
0137
LATIN SMALL LETTER K WITH
CEDILLA
0138
LATIN SMALL LETTER KRA
0139
LATIN CAPITAL LETTER L WITH
ACUTE
013A
LATIN SMALL LETTER L WITH
ACUTE
013B
LATIN CAPITAL LETTER L WITH
CEDILLA
013C
LATIN SMALL LETTER L WITH
CEDILLA
013D
LATIN CAPITAL LETTER L WITH
CARON
013E
LATIN SMALL LETTER L WITH
CARON
013F
LATIN CAPITAL LETTER L WITH
MIDDLE DOT
0140
LATIN SMALL LETTER L WITH
MIDDLE DOT
0141
LATIN CAPITAL LETTER L WITH
STROKE
0142
LATIN SMALL LETTER L WITH
STROKE
0143
LATIN CAPITAL LETTER N WITH
ACUTE
0144
LATIN SMALL LETTER N WITH
ACUTE
0145
LATIN CAPITAL LETTER N WITH
CEDILLA
0146
LATIN SMALL LETTER N WITH
CEDILLA
0147
LATIN CAPITAL LETTER N WITH
CARON
0148
LATIN SMALL LETTER N WITH
CARON
0149
LATIN SMALL LETTER N
PRECEDED BY APOSTROPHE
014A
LATIN CAPITAL LETTER ENG
014B
LATIN SMALL LETTER ENG
014C
LATIN CAPITAL LETTER O WITH
MACRON
014D
LATIN SMALL LETTER O WITH
MACRON
014E
LATIN CAPITAL LETTER O WITH
BREVE
014F
LATIN SMALL LETTER O WITH
BREVE
0150
LATIN CAPITAL LETTER O WITH
DOUBLE ACUTE
0151
LATIN SMALL LETTER O WITH
DOUBLE ACUTE
0152
LATIN CAPITAL LIGATURE OE
0153
LATIN SMALL LIGATURE OE
0154
LATIN CAPITAL LETTER R WITH
ACUTE
0155
LATIN SMALL LETTER R WITH
ACUTE
0156
LATIN CAPITAL LETTER R WITH
CEDILLA
0157
LATIN SMALL LETTER R WITH
CEDILLA
0158
LATIN CAPITAL LETTER R WITH
CARON
0159
LATIN SMALL LETTER R WITH
CARON
015A
LATIN CAPITAL LETTER S WITH
ACUTE
015B
LATIN SMALL LETTER S WITH
ACUTE
015C
LATIN CAPITAL LETTER S WITH
CIRCUMFLEX
015D
LATIN SMALL LETTER S WITH
CIRCUMFLEX
015E
LATIN CAPITAL LETTER S WITH
CEDILLA
015F
LATIN SMALL LETTER S WITH
CEDILLA
0160
LATIN CAPITAL LETTER S WITH
CARON
0161
LATIN SMALL LETTER S WITH
CARON
0162
LATIN CAPITAL LETTER T WITH
CEDILLA
0163
LATIN SMALL LETTER T WITH
CEDILLA
0164
LATIN CAPITAL LETTER T WITH
CARON
0165
LATIN SMALL LETTER T WITH
CARON
0166
LATIN CAPITAL LETTER T WITH
STROKE
0167
LATIN SMALL LETTER T WITH
STROKE
0168
LATIN CAPITAL LETTER U WITH
TILDE
0169
LATIN SMALL LETTER U WITH
TILDE
016A
LATIN CAPITAL LETTER U WITH
MACRON
016B
LATIN SMALL LETTER U WITH
MACRON
016C
LATIN CAPITAL LETTER U WITH
BREVE
016D
LATIN SMALL LETTER U WITH
BREVE
016E
LATIN CAPITAL LETTER U WITH
RING ABOVE
016F
LATIN SMALL LETTER U WITH
RING ABOVE
0170
LATIN CAPITAL LETTER U WITH
DOUBLE ACUTE
0171
LATIN SMALL LETTER U WITH
DOUBLE ACUTE
0172
LATIN CAPITAL LETTER U WITH
OGONEK
0173
LATIN SMALL LETTER U WITH
OGONEK
0174
LATIN CAPITAL LETTER W WITH
CIRCUMFLEX
0175
LATIN SMALL LETTER W WITH
CIRCUMFLEX
0176
LATIN CAPITAL LETTER Y WITH
CIRCUMFLEX
0177
LATIN SMALL LETTER Y WITH
CIRCUMFLEX
0178
LATIN CAPITAL LETTER Y WITH
DIAERESIS
0179
LATIN CAPITAL LETTER Z WITH
ACUTE
017A
LATIN SMALL LETTER Z WITH
ACUTE
017B
LATIN CAPITAL LETTER Z WITH
DOT ABOVE
017C
LATIN SMALL LETTER Z WITH
DOT ABOVE
017D
LATIN CAPITAL LETTER Z WITH
CARON
017E
LATIN SMALL LETTER Z WITH
CARON
017F
LATIN SMALL LETTER LONG S
018F
LATIN CAPITAL LETTER SCHWA
0192
LATIN SMALL LETTER F WITH
HOOK
01B7
LATIN CAPITAL LETTER EZH
01DE
LATIN CAPITAL LETTER A WITH
DIAERESIS AND MACRON
01DF
LATIN SMALL LETTER A WITH
DIAERESIS AND MACRON
01E0
LATIN CAPITAL LETTER A WITH
DOT ABOVE AND MACRON
01E1
LATIN SMALL LETTER A WITH
DOT ABOVE AND MACRON
01E2
LATIN CAPITAL LETTER AE
WITH MACRON
01E3
LATIN SMALL LETTER AE WITH
MACRON
01E4
LATIN CAPITAL LETTER G WITH
STROKE
01E5
LATIN SMALL LETTER G WITH
STROKE
01E6
LATIN CAPITAL LETTER G WITH
CARON
01E7
LATIN SMALL LETTER G WITH
CARON
01E8
LATIN CAPITAL LETTER K WITH
CARON
01E9
LATIN SMALL LETTER K WITH
CARON
01EA
LATIN CAPITAL LETTER O WITH
OGONEK
01EB
LATIN SMALL LETTER O WITH
OGONEK
01EC
LATIN CAPITAL LETTER O WITH
OGONEK AND MACRON
01ED
LATIN SMALL LETTER O WITH
OGONEK AND MACRON
01EE
LATIN CAPITAL LETTER EZH
WITH CARON
01EF
LATIN SMALL LETTER EZH WITH
CARON
01FA
LATIN CAPITAL LETTER A WITH
RING ABOVE AND ACUTE
01FB
LATIN SMALL LETTER A WITH
RING ABOVE AND ACUTE
01FC
LATIN CAPITAL LETTER AE
WITH ACUTE
01FD
LATIN SMALL LETTER AE WITH
ACUTE
01FE
LATIN CAPITAL LETTER O WITH
STROKE AND ACUTE
01FF
LATIN SMALL LETTER O WITH
STROKE AND ACUTE
0218
LA
TIN
CA
PIT
AL
LE
TT
ER
SW
ITH
CO
MM
A B
EL
OW
0219
LA
TIN
SM
AL
L L
ET
TE
R S
WIT
H C
OM
MA
BE
LO
W
021A
LA
TIN
CA
PIT
AL
LE
TT
ER
TW
ITH
CO
MM
A B
EL
OW
021B
LA
TIN
SM
AL
L L
ET
TE
R T
WIT
H C
OM
MA
BE
LO
W
021E
LA
TIN
CA
PIT
AL
LE
TT
ER
HW
ITH
CO
MM
A B
EL
OW
021F
LA
TIN
SM
AL
L L
ET
TE
R H
WIT
H C
OM
MA
BE
LO
W
0259
LATIN SMALL LETTER SCHWA
027C
LATIN SMALL LETTER R WITH
LONG LEG
0292
LATIN SMALL LETTER EZH
02BB
MODIFIER LETTER TURNED
COMMA
02BC
MODIFIER LETTER APOSTROPHE
02BD
MODIFIER LETTER REVERSED
COMMA
02C6
MODIFIER LETTER CIRCUMFLEX
ACCENT
02C7
CARON
02C9
MODIFIER LETTER MACRON
02D8
BREVE
02D9
DOT ABOVE
02DA
RING ABOVE
02DB
OGONEK
02DC
SMALL TILDE
02DD
DOUBLE ACUTE ACCENT
02EE
MODIFIER LETTER DOUBLE
APOSTROPHE
0374
GREEK NUMERAL SIGN
0375
GREEK LOWER NUMERAL SIGN
037A
GREEK YPOGEGRAMMENI
037E
GREEK QUESTION MARK
0384
GREEK TONOS
0385
GREEK DIALYTIKA TONOS
0386
GREEK CAPITAL LETTER ALPHA
WITH TONOS
0387
GREEK ANO TELEIA
0388
GREEK CAPITAL LETTER
EPSILON WITH TONOS
0389
GREEK CAPITAL LETTER ETA
WITH TONOS
038A
GREEK CAPITAL LETTER IOTA
WITH TONOS
038C
GREEK CAPITAL LETTER
OMICRON WITH TONOS
038E
GREEK CAPITAL LETTER
UPSILON WITH TONOS
038F
GREEK CAPITAL LETTER OMEGA
WITH TONOS
0390
GREEK SMALL LETTER IOTA
WITH DIALYTIKA AND TONOS
0391
GREEK CAPITAL LETTER ALPHA
0392
GREEK CAPITAL LETTER BETA
0393
GREEK CAPITAL LETTER GAMMA
0394
GREEK CAPITAL LETTER DELTA
0395
GREEK CAPITAL LETTER
EPSILON
0396
GREEK CAPITAL LETTER ZETA
0397
GREEK CAPITAL LETTER ETA
0398
GREEK CAPITAL LETTER THETA
0399
GREEK CAPITAL LETTER IOTA
039A
GREEK CAPITAL LETTER KAPPA
039B
GREEK CAPITAL LETTER LAMDA
039C
GREEK CAPITAL LETTER MU
039D
GREEK CAPITAL LETTER NU
039E
GREEK CAPITAL LETTER XI
039F
GREEK CAPITAL LETTER
OMICRON
03A0
GREEK CAPITAL LETTER PI
03A1
GREEK CAPITAL LETTER RHO
03A3
GREEK CAPITAL LETTER SIGMA
03A4
GREEK CAPITAL LETTER TAU
03A5
GREEK CAPITAL LETTER
UPSILON
03A6
GREEK CAPITAL LETTER PHI
03A7
GREEK CAPITAL LETTER CHI
03A8
GREEK CAPITAL LETTER PSI
03A9
GREEK CAPITAL LETTER OMEGA
03AA
GREEK CAPITAL LETTER IOTA
WITH DIALYTIKA
03AB
GREEK CAPITAL LETTER
UPSILON WITH DIALYTIKA
03AC
GREEK SMALL LETTER ALPHA
WITH TONOS
03AD
GREEK SMALL LETTER EPSILON
WITH TONOS
03AE
GREEK SMALL LETTER ETA WITH
TONOS
03AF
GREEK SMALL LETTER IOTA
WITH TONOS
03B0
GREEK SMALL LETTER UPSILON
WITH DIALYTIKA AND TONOS
03B1
GREEK SMALL LETTER ALPHA
03B2
GREEK SMALL LETTER BETA
03B3
GREEK SMALL LETTER GAMMA
03B4
GREEK SMALL LETTER DELTA
03B5
GREEK SMALL LETTER EPSILON
03B6
GREEK SMALL LETTER ZETA
03B7
GREEK SMALL LETTER ETA
03B8
GREEK SMALL LETTER THETA
03B9
GREEK SMALL LETTER IOTA
03BA
GREEK SMALL LETTER KAPPA
03BB
GREEK SMALL LETTER LAMDA
03BC
GREEK SMALL LETTER MU
03BD
GREEK SMALL LETTER NU
03BE
GREEK SMALL LETTER XI
03BF
GREEK SMALL LETTER OMICRON
03C0
GREEK SMALL LETTER PI
03C1
GREEK SMALL LETTER RHO
03C2
GREEK SMALL LETTER FINAL
SIGMA
03C3
GREEK SMALL LETTER SIGMA
03C4
GREEK SMALL LETTER TAU
03C5
GREEK SMALL LETTER UPSILON
03C6
GREEK SMALL LETTER PHI
03C7
GREEK SMALL LETTER CHI
03C8
GREEK SMALL LETTER PSI
03C9
GREEK SMALL LETTER OMEGA
03CA
GREEK SMALL LETTER IOTA
WITH DIALYTIKA
03CB
GREEK SMALL LETTER UPSILON
WITH DIALYTIKA
03CC
GREEK SMALL LETTER OMICRON
WITH TONOS
03CD
GREEK SMALL LETTER UPSILON
WITH TONOS
03CE
GREEK SMALL LETTER OMEGA
WITH TONOS
03D7
GR
EE
K K
AI
SYM
BO
L
03DA
GREEK LETTER STIGMA
03DB
GREEK SMALL LETTER STIGMA
03DC
GREEK LETTER DIGAMMA
03DD
GREEK SMALL LETTER DIGAMMA
03DE
GREEK LETTER KOPPA
03DF
GREEK SMALL LETTER KOPPA
03E0
GREEK LETTER SAMPI
03E1
GREEK SMALL LETTER SAMPI
O400
CYRILLIC CAPITAL LETTER IE
WITH GRAVE
0401
CYRILLIC CAPITAL LETTER IO
0402
CYRILLIC CAPITAL LETTER DJE
0403
CYRILLIC CAPITAL LETTER GJE
0404
CYRILLIC CAPITAL LETTER
UKRAINIAN IE
0405
CYRILLIC CAPITAL LETTER DZE
0406
CYRILLIC CAPITAL LETTER
BYELORUSSIAN-UKRAINIAN I
0407
CYRILLIC CAPITAL LETTER YI
0408
CYRILLIC CAPITAL LETTER JE
0409
CYRILLIC CAPITAL LETTER LJE
040A
CYRILLIC CAPITAL LETTER NJE
040B
CYRILLIC CAPITAL LETTER
TSHE
040C
CYRILLIC CAPITAL LETTER KJE
040D
CYRILLIC CAPITAL LETTER I
WITH GRAVE
040E
CYRILLIC CAPITAL LETTER
SHORT U
040F
CYRILLIC CAPITAL LETTER
DZHE
0410
CYRILLIC CAPITAL LETTER A
0411
CYRILLIC CAPITAL LETTER BE
0412
CYRILLIC CAPITAL LETTER VE
0413
CYRILLIC CAPITAL LETTER GHE
0414
CYRILLIC CAPITAL LETTER DE
0415
CYRILLIC CAPITAL LETTER E
0416
CYRILLIC CAPITAL LETTER ZHE
0417
CYRILLIC CAPITAL LETTER ZE
0418
CYRILLIC CAPITAL LETTER I
0419
CYRILLIC CAPITAL LETTER
SHORT I
041A
CYRILLIC CAPITAL LETTER KA
041B
CYRILLIC CAPITAL LETTER EL
041C
CYRILLIC CAPITAL LETTER EM
041D
CYRILLIC CAPITAL LETTER EN
041E
CYRILLIC CAPITAL LETTER O
041F
CYRILLIC CAPITAL LETTER PE
0420
CYRILLIC CAPITAL LETTER ER
0421
CYRILLIC CAPITAL LETTER ES
0422
CYRILLIC CAPITAL LETTER TE
0423
CYRILLIC CAPITAL LETTER U
0424
CYRILLIC CAPITAL LETTER EF
0425
CYRILLIC CAPITAL LETTER HA
0426
CYRILLIC CAPITAL LETTER TSE
0427
CYRILLIC CAPITAL LETTER CHE
0428
CYRILLIC CAPITAL LETTER SHA
0429
CYRILLIC CAPITAL LETTER
SHCHA
042A
CYRILLIC CAPITAL LETTER
HARD SIGN
042B
CYRILLIC CAPITAL LETTER
YERU
042C
CYRILLIC CAPITAL LETTER
SOFT SIGN
042D
CYRILLIC CAPITAL LETTER E
042E
CYRILLIC CAPITAL LETTER YU
042F
CYRILLIC CAPITAL LETTER YA
0430
CYRILLIC SMALL LETTER A
0431
CYRILLIC SMALL LETTER BE
0432
CYRILLIC SMALL LETTER VE
0433
CYRILLIC SMALL LETTER GHE
0434
CYRILLIC SMALL LETTER DE
0435
CYRILLIC SMALL LETTER IE
0436
CYRILLIC SMALL LETTER ZHE
0437
CYRILLIC SMALL LETTER ZE
0438
CYRILLIC SMALL LETTER I
0439
CYRILLIC SMALL LETTER SHORT
I
043A
CYRILLIC SMALL LETTER KA
043B
CYRILLIC SMALL LETTER EL
043C
CYRILLIC SMALL LETTER EM
043D
CYRILLIC SMALL LETTER EN
043E
CYRILLIC SMALL LETTER O
043F
CYRILLIC SMALL LETTER PE
0440
CYRILLIC SMALL LETTER ER
0441
CYRILLIC SMALL LETTER ES
0442
CYRILLIC SMALL LETTER TE
0443
CYRILLIC SMALL LETTER U
0444
CYRILLIC SMALL LETTER EF
0445
CYRILLIC SMALL LETTER HA
0446
CYRILLIC SMALL LETTER TSE
0447
CYRILLIC SMALL LETTER CHE
0448
CYRILLIC SMALL LETTER SHA
0449
CYRILLIC SMALL LETTER SHCHA
044A
CYRILLIC SMALL LETTER HARD
SIGN
044B
CYRILLIC SMALL LETTER YERU
044C
CYRILLIC SMALL LETTER SOFT
SIGN
044D
CYRILLIC SMALL LETTER E
044E
CYRILLIC SMALL LETTER YU
044F
CYRILLIC SMALL LETTER YA
0450
CYRILLIC SMALL LETTER IE
WITH GRAVE
0451
CYRILLIC SMALL LETTER IO
0452
CYRILLIC SMALL LETTER DJE
0453
CYRILLIC SMALL LETTER GJE
0454
CYRILLIC SMALL LETTER
UKRAINIAN IE
0455
CYRILLIC SMALL LETTER DZE
0456
CYRILLIC SMALL LETTER
BYELORUSSIAN-UKRAINIAN I
0457
CYRILLIC SMALL LETTER YI
0458
CYRILLIC SMALL LETTER JE
459
CYRILLIC SMALL LETTER LJE
045A
CYRILLIC SMALL LETTER NJE
045B
CYRILLIC SMALL LETTER TSHE
045C
CYRILLIC SMALL LETTER KJE
045D
CY
RIL
LIC
SM
AL
L L
ET
TE
R I
WIT
H G
RA
VE
045E
CYRILLIC SMALL LETTER SHORT
U
045F
CYRILLIC SMALL LETTER DZHE
048C
CY
RIL
LIC
CA
PIT
AL
LE
TT
ER
SEM
ISO
FT S
IGN
048D
CY
RIL
LIC
SM
AL
L L
ET
TE
RSE
MIS
OFT
SIG
N
048E
CY
RIL
LIC
CA
PIT
AL
LE
TT
ER
ER
WIT
H T
ICK
048F
CY
RIL
LIC
SM
AL
L L
ET
TE
R E
RW
ITH
TIC
K
0490
CYRILLIC CAPITAL LETTER GHE
WITH UPTURN
0491
CYRILLIC SMALL LETTER GHE
WITH UPTURN
0492
CYRILLIC CAPITAL LETTER GHE
WITH STROKE
0493
CYRILLIC SMALL LETTER GHE
WITH STROKE
0494
CYRILLIC CAPITAL LETTER GHE
WITH MIDDLE HOOK
0495
CYRILLIC SMALL LETTER GHE
WITH MIDDLE HOOK
0496
CYRILLIC CAPITAL LETTER ZHE
WITH DESCENDER
0497
CYRILLIC SMALL LETTER ZHE
WITH DESCENDER
0498
CYRILLIC CAPITAL LETTER ZE
WITH DESCENDER
0499
CYRILLIC SMALL LETTER ZE
WITH DESCENDER
049A
CYRILLIC CAPITAL LETTER KA
WITH DESCENDER
049B
CYRILLIC SMALL LETTER KA
WITH DESCENDER
049C
CYRILLIC CAPITAL LETTER KA
WITH VERTICAL STROKE
049D
CYRILLIC SMALL LETTER KA
WITH VERTICAL STROKE
049E
CYRILLIC CAPITAL LETTER KA
WITH STROKE
049F
CYRILLIC SMALL LETTER KA
WITH STROKE
04A0
CYRILLIC CAPITAL LETTER
BASHKIR KA
04A1
CYRILLIC SMALL LETTER
BASHKIR KA
04A2
CYRILLIC CAPITAL LETTER EN
WITH DESCENDER
04A3
CYRILLIC SMALL LETTER EN
WITH DESCENDER
04A4
CYRILLIC CAPITAL LIGATURE
EN GHE
04A5
CYRILLIC SMALL LIGATURE EN
GHE
04A6
CYRILLIC CAPITAL LETTER PE
WITH MIDDLE HOOK
04A7
CYRILLIC SMALL LETTER PE
WITH MIDDLE HOOK
04A8
CYRILLIC CAPITAL LETTER
ABKHASIAN HA
04A9
CYRILLIC SMALL LETTER
ABKHASIAN HA
04AA
CYRILLIC CAPITAL LETTER ES
WITH DESCENDER
04AB
CYRILLIC SMALL LETTER ES
WITH DESCENDER
04AC
CYRILLIC CAPITAL LETTER TE
WITH DESCENDER
04AD
CYRILLIC SMALL LETTER TE
WITH DESCENDER
04AE
CYRILLIC CAPITAL LETTER
STRAIGHT U
04AF
CYRILLIC SMALL LETTER
STRAIGHT U
04B0
CYRILLIC CAPITAL LETTER
STRAIGHT U WITH STROKE
04B1
CYRILLIC SMALL LETTER
STRAIGHT U WITH STROKE
04B2
CYRILLIC CAPITAL LETTER HA
WITH DESCENDER
04B3
CYRILLIC SMALL LETTER HA
WITH DESCENDER
04B4
CYRILLIC CAPITAL LIGATURE
TE TSE
04B5
CYRILLIC SMALL LIGATURE TE
TSE
04B6
CYRILLIC CAPITAL LETTER CHE
WITH DESCENDER
04B7
CYRILLIC SMALL LETTER CHE
WITH DESCENDER
04B8
CYRILLIC CAPITAL LETTER CHE
WITH VERTICAL STROKE
04B9
CYRILLIC SMALL LETTER CHE
WITH VERTICAL STROKE
04BA
CYRILLIC CAPITAL LETTER
SHHA
04BB
CYRILLIC SMALL LETTER SHHA
04BC
CYRILLIC CAPITAL LETTER
ABKHASIAN CHE
04BD
CYRILLIC SMALL LETTER
ABKHASIAN CHE
04BE
CYRILLIC CAPITAL LETTER
ABKHASIAN CHE WITH
DESCENDER
04BF
CYRILLIC SMALL LETTER
ABKHASIAN CHE WITH
DESCENDER
04C0
CYRILLIC LETTER PALOCHKA
04C1
CYRILLIC CAPITAL LETTER ZHE
WITH BREVE
04C2
CYRILLIC SMALL LETTER ZHE
WITH BREVE
04C3
CYRILLIC CAPITAL LETTER KA
WITH HOOK
04C4
CYRILLIC SMALL LETTER KA
WITH HOOK
04C7
CYRILLIC CAPITAL LETTER EN
WITH HOOK
04C8
CYRILLIC SMALL LETTER EN
WITH HOOK
04CB
CYRILLIC CAPITAL LETTER
KHAKASSIAN CHE
04CC
CYRILLIC SMALL LETTER
KHAKASSIAN CHE
04D0
CYRILLIC CAPITAL LETTER A
WITH BREVE
04D1
CYRILLIC SMALL LETTER A
WITH BREVE
04D2
CYRILLIC CAPITAL LETTER A
WITH DIAERESIS
04D3
CYRILLIC SMALL LETTER A
WITH DIAERESIS
04D4
CYRILLIC CAPITAL LIGATURE A
IE
04D5
CYRILLIC SMALL LIGATURE A
IE
04D6
CYRILLIC CAPITAL LETTER IE
WITH BREVE
04D7
CYRILLIC SMALL LETTER IE
WITH BREVE
04D8
CYRILLIC CAPITAL LETTER
SCHWA
04D9
CYRILLIC SMALL LETTER SCHWA
04DA
CYRILLIC CAPITAL LETTER
SCHWA WITH DIAERESIS
04DB
CYRILLIC SMALL LETTER SCHWA
WITH DIAERESIS
04DC
CYRILLIC CAPITAL LETTER ZHE
WITH DIAERESIS
04DD
CYRILLIC SMALL LETTER ZHE
WITH DIAERESIS
04DE
CYRILLIC CAPITAL LETTER ZE
WITH DIAERESIS
04DF
CYRILLIC SMALL LETTER ZE
WITH DIAERESIS
04E0
CYRILLIC CAPITAL LETTER
ABKHASIAN DZE
04E1
CYRILLIC SMALL LETTER
ABKHASIAN DZE
04E2
CYRILLIC CAPITAL LETTER I
WITH MACRON
04E3
CYRILLIC SMALL LETTER I
WITH MACRON
04E4
CYRILLIC CAPITAL LETTER I
WITH DIAERESIS
04E5
CYRILLIC SMALL LETTER I
WITH DIAERESIS
04E6
CYRILLIC CAPITAL LETTER O
WITH DIAERESIS
04E7
CYRILLIC SMALL LETTER O
WITH DIAERESIS
04E8
CYRILLIC CAPITAL LETTER
BARRED O
04E9
CYRILLIC SMALL LETTER
BARRED O
04EA
CYRILLIC CAPITAL LETTER
BARRED O WITH DIAERESIS
04EB
CYRILLIC SMALL LETTER
BARRED O WITH DIAERESIS
04EC
CYRILLIC CAPITAL LETTER
BARRED E WITH DIAERESIS
04ED
CYRILLIC SMALL LETTER
BARRED E WITH DIAERESIS
04EE
CYRILLIC CAPITAL LETTER U
WITH MACRON
04EF
CYRILLIC SMALL LETTER U
WITH MACRON
04F0
CYRILLIC CAPITAL LETTER U
WITH DIAERESIS
04F1
CYRILLIC SMALL LETTER U
WITH DIAERESIS
04F2
CYRILLIC CAPITAL LETTER U
WITH DOUBLE ACUTE
04F3
CYRILLIC SMALL LETTER U
WITH DOUBLE ACUTE
04F4
CYRILLIC CAPITAL LETTER CHE
WITH DIAERESIS
04F5
CYRILLIC SMALL LETTER CHE
WITH DIAERESIS
04F8
CYRILLIC CAPITAL LETTER
YERU WITH DIAERESIS
04F9
CYRILLIC SMALL LETTER YERU
WITH DIAERESIS
1E02
LATIN CAPITAL LETTER B WITH
DOT ABOVE
1E03
LATIN SMALL LETTER B WITH
DOT ABOVE
1E0A
LATIN CAPITAL LETTER D WITH
DOT ABOVE
1E0B
LATIN SMALL LETTER D WITH
DOT ABOVE
1E1E
LATIN CAPITAL LETTER F WITH
DOT ABOVE
1E1F
LATIN SMALL LETTER F WITH
DOT ABOVE
1E40
LATIN CAPITAL LETTER M WITH
DOT ABOVE
1E41
LATIN SMALL LETTER M WITH
DOT ABOVE
1E56
LATIN CAPITAL LETTER P WITH
DOT ABOVE
1E57
LATIN SMALL LETTER P WITH
DOT ABOVE
1E60
LATIN CAPITAL LETTER S WITH
DOT ABOVE
1E61
LATIN SMALL LETTER S WITH
DOT ABOVE
1E6A
LATIN CAPITAL LETTER T WITH
DOT ABOVE
1E6B
LATIN SMALL LETTER T WITH
DOT ABOVE
1E80
LATIN CAPITAL LETTER W WITH
GRAVE
1E81
LATIN SMALL LETTER W WITH
GRAVE
1E82
LATIN CAPITAL LETTER W WITH
ACUTE
1E83
LATIN SMALL LETTER W WITH
ACUTE
1E84
LATIN CAPITAL LETTER W WITH
DIAERESIS
1E85
LATIN SMALL LETTER W WITH
DIAERESIS
1E9B
LATIN SMALL LETTER LONG S
WITH DOT ABOVE
1EF2
LATIN CAPITAL LETTER Y WITH
GRAVE
1EF3
LATIN SMALL LETTER Y WITH
GRAVE
1F00
GREEK SMALL LETTER ALPHA
WITH PSILI
1F01
GREEK SMALL LETTER ALPHA
WITH DASIA
1F02
GREEK SMALL LETTER ALPHA
WITH PSILI AND VARIA
1F03
GREEK SMALL LETTER ALPHA
WITH DASIA AND VARIA
1F04
GREEK SMALL LETTER ALPHA
WITH PSILI AND OXIA
1F05
GREEK SMALL LETTER ALPHA
WITH DASIA AND OXIA
1F06
GREEK SMALL LETTER ALPHA
WITH PSILI AND PERISPOMENI
1F07
GREEK SMALL LETTER ALPHA
WITH DASIA AND PERISPOMENI
1F08
GREEK CAPITAL LETTER ALPHA
WITH PSILI
1F09
GREEK CAPITAL LETTER ALPHA
WITH DASIA
1F0A
GREEK CAPITAL LETTER ALPHA
WITH PSILI AND VARIA
1F0B
GREEK CAPITAL LETTER ALPHA
WITH DASIA AND VARIA
1F0C
GREEK CAPITAL LETTER ALPHA
WITH PSILI AND OXIA
1F0D
GREEK CAPITAL LETTER ALPHA
WITH DASIA AND OXIA
1F0E
GREEK CAPITAL LETTER ALPHA
WITH PSILI AND PERISPOMENI
1F0F
GREEK CAPITAL LETTER ALPHA
WITH DASIA AND PERISPOMENI
1F10
GREEK SMALL LETTER EPSILON
WITH PSILI
1F11
GREEK SMALL LETTER EPSILON
WITH DASIA
1F12
GREEK SMALL LETTER EPSILON
WITH PSILI AND VARIA
1F13
GREEK SMALL LETTER EPSILON
WITH DASIA AND VARIA
1F14
GREEK SMALL LETTER EPSILON
WITH PSILI AND OXIA
1F15
GREEK SMALL LETTER EPSILON
WITH DASIA AND OXIA
1F18
GREEK CAPITAL LETTER
EPSILON WITH PSILI
1F19
GREEK CAPITAL LETTER
EPSILON WITH DASIA
1F1A
GREEK CAPITAL LETTER
EPSILON WITH PSILI AND
VARIA
1F1B
GREEK CAPITAL LETTER
EPSILON WITH DASIA AND
VARIA
1F1C
GREEK CAPITAL LETTER
EPSILON WITH PSILI AND OXIA
1F1D
GREEK CAPITAL LETTER
EPSILON WITH DASIA AND OXIA
1F20
GREEK SMALL LETTER ETA WITH
PSILI
1F21
GREEK SMALL LETTER ETA WITH
DASIA
1F22
GREEK SMALL LETTER ETA WITH
PSILI AND VARIA
1F23
GREEK SMALL LETTER ETA WITH
DASIA AND VARIA
1F24
GREEK SMALL LETTER ETA WITH
PSILI AND OXIA
1F25
GREEK SMALL LETTER ETA WITH
DASIA AND OXIA
1F26
GREEK SMALL LETTER ETA WITH
PSILI AND PERISPOMENI
1F27
GREEK SMALL LETTER ETA WITH
DASIA AND PERISPOMENI
1F28
GREEK CAPITAL LETTER ETA
WITH PSILI
1F29
GREEK CAPITAL LETTER ETA
WITH DASIA
1F2A
GREEK CAPITAL LETTER ETA
WITH PSILI AND VARIA
1F2B
GREEK CAPITAL LETTER ETA
WITH DASIA AND VARIA
1F2C
GREEK CAPITAL LETTER ETA
WITH PSILI AND OXIA
1F2D
GREEK CAPITAL LETTER ETA
WITH DASIA AND OXIA
1F2E
GREEK CAPITAL LETTER ETA
WITH PSILI AND PERISPOMENI
1F2F
GREEK CAPITAL LETTER ETA
WITH DASIA AND PERISPOMENI
1F30
GREEK SMALL LETTER IOTA
WITH PSILI
1F31
GREEK SMALL LETTER IOTA
WITH DASIA
1F32
GREEK SMALL LETTER IOTA
WITH PSILI AND VARIA
1F33
GREEK SMALL LETTER IOTA
WITH DASIA AND VARIA
1F34
GREEK SMALL LETTER IOTA
WITH PSILI AND OXIA
1F35
GREEK SMALL LETTER IOTA
WITH DASIA AND OXIA
1F36
GREEK SMALL LETTER IOTA
WITH PSILI AND PERISPOMENI
1F37
GREEK SMALL LETTER IOTA
WITH DASIA AND PERISPOMENI
1F38
GREEK CAPITAL LETTER IOTA
WITH PSILI
1F39
GREEK CAPITAL LETTER IOTA
WITH DASIA
1F3A
GREEK CAPITAL LETTER IOTA
WITH PSILI AND VARIA
1F3B
GREEK CAPITAL LETTER IOTA
WITH DASIA AND VARIA
1F3C
GREEK CAPITAL LETTER IOTA
WITH PSILI AND OXIA
1F3D
GREEK CAPITAL LETTER IOTA
WITH DASIA AND OXIA
1F3E
GREEK CAPITAL LETTER IOTA
WITH PSILI AND PERISPOMENI
1F3F
GREEK CAPITAL LETTER IOTA
WITH DASIA AND PERISPOMENI
1F40
GREEK SMALL LETTER OMICRON
WITH PSILI
1F41
GREEK SMALL LETTER OMICRON
WITH DASIA
1F42
GREEK SMALL LETTER OMICRON
WITH PSILI AND VARIA
1F43
GREEK SMALL LETTER OMICRON
WITH DASIA AND VARIA
1F44
GREEK SMALL LETTER OMICRON
WITH PSILI AND OXIA
1F45
GREEK SMALL LETTER OMICRON
WITH DASIA AND OXIA
1F48
GREEK CAPITAL LETTER
OMICRON WITH PSILI
1F49
GREEK CAPITAL LETTER
OMICRON WITH DASIA
1F4A
GREEK CAPITAL LETTER
OMICRON WITH PSILI AND
VARIA
1F4B
GREEK CAPITAL LETTER
OMICRON WITH DASIA AND
VARIA
1F4C
GREEK CAPITAL LETTER
OMICRON WITH PSILI AND OXIA
1F4D
GREEK CAPITAL LETTER
OMICRON WITH DASIA AND OXIA
1F50
GREEK SMALL LETTER UPSILON
WITH PSILI
1F51
GREEK SMALL LETTER UPSILON
WITH DASIA
1F52
GREEK SMALL LETTER UPSILON
WITH PSILI AND VARIA
1F53
GREEK SMALL LETTER UPSILON
WITH DASIA AND VARIA
1F54
GREEK SMALL LETTER UPSILON
WITH PSILI AND OXIA
1F55
GREEK SMALL LETTER UPSILON
WITH DASIA AND OXIA
1F56
GREEK SMALL LETTER UPSILON
WITH PSILI AND PERISPOMENI
1F57
GREEK SMALL LETTER UPSILON
WITH DASIA AND PERISPOMENI
1F59
GREEK CAPITAL LETTER
UPSILON WITH DASIA
1F5B
GREEK CAPITAL LETTER
UPSILON WITH DASIA AND
VARIA
1F5D
GREEK CAPITAL LETTER
UPSILON WITH DASIA AND OXIA
1F5F
GREEK CAPITAL LETTER
UPSILON WITH DASIA AND
PERISPOMENI
1F60
GREEK SMALL LETTER OMEGA
WITH PSILI
1F61
GREEK SMALL LETTER OMEGA
WITH DASIA
1F62
GREEK SMALL LETTER OMEGA
WITH PSILI AND VARIA
1F63
GREEK SMALL LETTER OMEGA
WITH DASIA AND VARIA
1F64
GREEK SMALL LETTER OMEGA
WITH PSILI AND OXIA
1F65
GREEK SMALL LETTER OMEGA
WITH DASIA AND OXIA
1F66
GREEK SMALL LETTER OMEGA
WITH PSILI AND PERISPOMENI
1F67
GREEK SMALL LETTER OMEGA
WITH DASIA AND PERISPOMENI
1F68
GREEK CAPITAL LETTER OMEGA
WITH PSILI
1F69
GREEK CAPITAL LETTER OMEGA
WITH DASIA
1F6A
GREEK CAPITAL LETTER OMEGA
WITH PSILI AND VARIA
1F6B
GREEK CAPITAL LETTER OMEGA
WITH DASIA AND VARIA
1F6C
GREEK CAPITAL LETTER OMEGA
WITH PSILI AND OXIA
1F6D
GREEK CAPITAL LETTER OMEGA
WITH DASIA AND OXIA
1F6E
GREEK CAPITAL LETTER OMEGA
WITH PSILI AND PERISPOMENI
1F6F
GREEK CAPITAL LETTER OMEGA
WITH DASIA AND PERISPOMENI
1F70
GREEK SMALL LETTER ALPHA
WITH VARIA
1F71
GREEK SMALL LETTER ALPHA
WITH OXIA
1F72
GREEK SMALL LETTER EPSILON
WITH VARIA
1F73
GREEK SMALL LETTER EPSILON
WITH OXIA
1F74
GREEK SMALL LETTER ETA WITH
VARIA
1F75
GREEK SMALL LETTER ETA WITH
OXIA
1F76
GREEK SMALL LETTER IOTA
WITH VARIA
1F77
GREEK SMALL LETTER IOTA
WITH OXIA
1F78
GREEK SMALL LETTER OMICRON
WITH VARIA
1F79
GREEK SMALL LETTER OMICRON
WITH OXIA
1F7A
GREEK SMALL LETTER UPSILON
WITH VARIA
1F7B
GREEK SMALL LETTER UPSILON
WITH OXIA
1F7C
GREEK SMALL LETTER OMEGA
WITH VARIA
1F7D
GREEK SMALL LETTER OMEGA
WITH OXIA
1F80
GREEK SMALL LETTER ALPHA
WITH PSILI AND
YPOGEGRAMMENI
1F81
GREEK SMALL LETTER ALPHA
WITH DASIA AND
YPOGEGRAMMENI
1F82
GREEK SMALL LETTER ALPHA
WITH PSILI AND VARIA AND
YPOGEGRAMMENI
1F83
GREEK SMALL LETTER ALPHA
WITH DASIA AND VARIA AND
YPOGEGRAMMENI
1F84
GREEK SMALL LETTER ALPHA
WITH PSILI AND OXIA AND
YPOGEGRAMMENI
1F85
GREEK SMALL LETTER ALPHA
WITH DASIA AND OXIA AND
YPOGEGRAMMENI
1F86
GREEK SMALL LETTER ALPHA
WITH PSILI AND PERISPOMENI
AND YPOGEGRAMMENI
1F87
GREEK SMALL LETTER ALPHA
WITH DASIA AND PERISPOMENI
AND YPOGEGRAMMENI
1F88
GREEK CAPITAL LETTER ALPHA
WITH PSILI AND
PROSGEGRAMMENI
1F89
GREEK CAPITAL LETTER ALPHA
WITH DASIA AND
PROSGEGRAMMENI
1F8A
GREEK CAPITAL LETTER ALPHA
WITH PSILI AND VARIA AND
PROSGEGRAMMENI
1F8B
GREEK CAPITAL LETTER ALPHA
WITH DASIA AND VARIA AND
PROSGEGRAMMENI
1F8C
GREEK CAPITAL LETTER ALPHA
WITH PSILI AND OXIA AND
PROSGEGRAMMENI
1F8D
GREEK CAPITAL LETTER ALPHA
WITH DASIA AND OXIA AND
PROSGEGRAMMENI
1F8E
GREEK CAPITAL LETTER ALPHA
WITH PSILI AND PERISPOMENI
AND PROSGEGRAMMENI
1F8F
GREEK CAPITAL LETTER ALPHA
WITH DASIA AND PERISPOMENI
AND PROSGEGRAMMENI
1F90
GREEK SMALL LETTER ETA WITH
PSILI AND YPOGEGRAMMENI
1F91
GREEK SMALL LETTER ETA WITH
DASIA AND YPOGEGRAMMENI
1F92
GREEK SMALL LETTER ETA WITH
PSILI AND VARIA AND
YPOGEGRAMMENI
1F93
GREEK SMALL LETTER ETA WITH
DASIA AND VARIA AND
YPOGEGRAMMENI
1F94
GREEK SMALL LETTER ETA WITH
PSILI AND OXIA AND
YPOGEGRAMMENI
1F95
GREEK SMALL LETTER ETA WITH
DASIA AND OXIA AND
YPOGEGRAMMENI
1F96
GREEK SMALL LETTER ETA WITH
PSILI AND PERISPOMENI AND
YPOGEGRAMMENI
1F97
GREEK SMALL LETTER ETA WITH
DASIA AND PERISPOMENI AND
YPOGEGRAMMENI
1F98
GREEK CAPITAL LETTER ETA
WITH PSILI AND
PROSGEGRAMMENI
1F99
GREEK CAPITAL LETTER ETA
WITH DASIA AND
PROSGEGRAMMENI
1F9A
GREEK CAPITAL LETTER ETA
WITH PSILI AND VARIA AND
PROSGEGRAMMENI
1F9B
GREEK CAPITAL LETTER ETA
WITH DASIA AND VARIA AND
PROSGEGRAMMENI
1F9C
GREEK CAPITAL LETTER ETA
WITH PSILI AND OXIA AND
PROSGEGRAMMENI
1F9D
GREEK CAPITAL LETTER ETA
WITH DASIA AND OXIA AND
PROSGEGRAMMENI
1F9E
GREEK CAPITAL LETTER ETA
WITH PSILI AND PERISPOMENI
AND PROSGEGRAMMENI
1F9F
GREEK CAPITAL LETTER ETA
WITH DASIA AND PERISPOMENI
AND PROSGEGRAMMENI
1FA0
GREEK SMALL LETTER OMEGA
WITH PSILI AND
YPOGEGRAMMENI
1FA1
GREEK SMALL LETTER OMEGA
WITH DASIA AND
YPOGEGRAMMENI
1FA2
GREEK SMALL LETTER OMEGA
WITH PSILI AND VARIA AND
YPOGEGRAMMENI
1FA3
GREEK SMALL LETTER OMEGA
WITH DASIA AND VARIA AND
YPOGEGRAMMENI
1FA4
GREEK SMALL LETTER OMEGA
WITH PSILI AND OXIA AND
YPOGEGRAMMENI
1FA5
GREEK SMALL LETTER OMEGA
WITH DASIA AND OXIA AND
YPOGEGRAMMENI
1FA6
GREEK SMALL LETTER OMEGA
WITH PSILI AND PERISPOMENI
AND YPOGEGRAMMENI
1FA7
GREEK SMALL LETTER OMEGA
WITH DASIA AND PERISPOMENI
AND YPOGEGRAMMENI
1FA8
GREEK CAPITAL LETTER OMEGA
WITH PSILI AND
PROSGEGRAMMENI
1FA9
GREEK CAPITAL LETTER OMEGA
WITH DASIA AND
PROSGEGRAMMENI
1FAA
GREEK CAPITAL LETTER OMEGA
WITH PSILI AND VARIA AND
PROSGEGRAMMENI
1FAB
GREEK CAPITAL LETTER OMEGA
WITH DASIA AND VARIA AND
PROSGEGRAMMENI
1FAC
GREEK CAPITAL LETTER OMEGA
WITH PSILI AND OXIA AND
PROSGEGRAMMENI
1FAD
GREEK CAPITAL LETTER OMEGA
WITH DASIA AND OXIA AND
PROSGEGRAMMENI
1FAE
GREEK CAPITAL LETTER OMEGA
WITH PSILI AND PERISPOMENI
AND PROSGEGRAMMENI
1FAF
GREEK CAPITAL LETTER OMEGA
WITH DASIA AND PERISPOMENI
AND PROSGEGRAMMENI
1FB0
GREEK SMALL LETTER ALPHA
WITH VRACHY
1FB1
GREEK SMALL LETTER ALPHA
WITH MACRON
1FB2
GREEK SMALL LETTER ALPHA
WITH VARIA AND
YPOGEGRAMMENI
1FB3
GREEK SMALL LETTER ALPHA
WITH YPOGEGRAMMENI
1FB4
GREEK SMALL LETTER ALPHA
WITH OXIA AND YPOGEGRAMMENI
1FB6
GREEK SMALL LETTER ALPHA
WITH PERISPOMENI
1FB7
GREEK SMALL LETTER ALPHA
WITH PERISPOMENI AND
YPOGEGRAMMENI
1FB8
GREEK CAPITAL LETTER ALPHA
WITH VRACHY
1FB9
GREEK CAPITAL LETTER ALPHA
WITH MACRON
1FBA
GREEK CAPITAL LETTER ALPHA
WITH VARIA
1FBB
GREEK CAPITAL LETTER ALPHA
WITH OXIA
1FBC
GREEK CAPITAL LETTER ALPHA
WITH PROSGEGRAMMENI
1FBD
GREEK KORONIS
1FBE
GREEK PROSGEGRAMMENI
1FBF
GREEK PSILI
1FC0
GREEK PERISPOMENI
1FC1
GREEK DIALYTIKA AND
PERISPOMENI
1FC2
GREEK SMALL LETTER ETA WITH
VARIA AND YPOGEGRAMMENI
1FC3
GREEK SMALL LETTER ETA WITH
YPOGEGRAMMENI
1FC4
GREEK SMALL LETTER ETA WITH
OXIA AND YPOGEGRAMMENI
1FC6
GREEK SMALL LETTER ETA WITH
PERISPOMENI
1FC7
GREEK SMALL LETTER ETA WITH
PERISPOMENI AND
YPOGEGRAMMENI
1FC8
GREEK CAPITAL LETTER
EPSILON WITH VARIA
1FC9
GREEK CAPITAL LETTER
EPSILON WITH OXIA
1FCA
GREEK CAPITAL LETTER ETA
WITH VARIA
1FCB
GREEK CAPITAL LETTER ETA
WITH OXIA
1FCC
GREEK CAPITAL LETTER ETA
WITH PROSGEGRAMMENI
1FCD
GREEK PSILI AND VARIA
1FCE
GREEK PSILI AND OXIA
1FCF
GREEK PSILI AND PERISPOMENI
1FD0
GREEK SMALL LETTER IOTA
WITH VRACHY
1FD1
GREEK SMALL LETTER IOTA
WITH MACRON
1FD2
GREEK SMALL LETTER IOTA
WITH DIALYTIKA AND VARIA
1FD3
GREEK SMALL LETTER IOTA
WITH DIALYTIKA AND OXIA
1FD6
GREEK SMALL LETTER IOTA
WITH PERISPOMENI
1FD7
GREEK SMALL LETTER IOTA
WITH DIALYTIKA AND
PERISPOMENI
1FD8
GREEK CAPITAL LETTER IOTA
WITH VRACHY
1FD9
GREEK CAPITAL LETTER IOTA
WITH MACRON
1FDA
GREEK CAPITAL LETTER IOTA
WITH VARIA
1FDB
GREEK CAPITAL LETTER IOTA
WITH OXIA
1FDD
GREEK DASIA AND VARIA
1FDE
GREEK DASIA AND OXIA
1FDF
GREEK DASIA AND PERISPOMENI
1FE0
GREEK SMALL LETTER UPSILON
WITH VRACHY
1FE1
GREEK SMALL LETTER UPSILON
WITH MACRON
1FE2
GREEK SMALL LETTER UPSILON
WITH DIALYTIKA AND VARIA
1FE3
GREEK SMALL LETTER UPSILON
WITH DIALYTIKA AND OXIA
1FE4
GREEK SMALL LETTER RHO WITH
PSILI
1FE5
GREEK SMALL LETTER RHO WITH
DASIA
1FE6
GREEK SMALL LETTER UPSILON
WITH PERISPOMENI
1FE7
GREEK SMALL LETTER UPSILON
WITH DIALYTIKA AND
PERISPOMENI
1FE8
GREEK CAPITAL LETTER
UPSILON WITH VRACHY
1FE9
GREEK CAPITAL LETTER
UPSILON WITH MACRON
1FEA
GREEK CAPITAL LETTER
UPSILON WITH VARIA
1FEB
GREEK CAPITAL LETTER
UPSILON WITH OXIA
1FEC
GREEK CAPITAL LETTER RHO
WITH DASIA
1FED
GREEK DIALYTIKA AND VARIA
1FEE
GREEK DIALYTIKA AND OXIA
1FEF
GREEK VARIA
1FF2
GREEK SMALL LETTER OMEGA
WITH VARIA AND
YPOGEGRAMMENI
1FF3
GREEK SMALL LETTER OMEGA
WITH YPOGEGRAMMENI
1FF4
GREEK SMALL LETTER OMEGA
WITH OXIA AND YPOGEGRAMMENI
1FF6
GREEK SMALL LETTER OMEGA
WITH PERISPOMENI
1FF7
GREEK SMALL LETTER OMEGA
WITH PERISPOMENI AND
YPOGEGRAMMENI
1FF8
GREEK CAPITAL LETTER
OMICRON WITH VARIA
1FF9
GREEK CAPITAL LETTER
OMICRON WITH OXIA
1FFA
GREEK CAPITAL LETTER OMEGA
WITH VARIA
1FFB
GREEK CAPITAL LETTER OMEGA
WITH OXIA
1FFC
GREEK CAPITAL LETTER OMEGA
WITH PROSGEGRAMMENI
1FFD
GREEK OXIA
1FFE
GREEK DASIA
2013
EN DASH
2014
EM DASH
2015
HORIZONTAL BAR
2017
DOUBLE LOW LINE
2018
LEFT SINGLE QUOTATION MARK
2019
RIGHT SINGLE QUOTATION MARK
201A
SINGLE LOW-9 QUOTATION MARK
201B
SINGLE HIGH-REVERSED-9
QUOTATION MARK
201C
LEFT DOUBLE QUOTATION MARK
201D
RIGHT DOUBLE QUOTATION MARK
201E
DOUBLE LOW-9 QUOTATION MARK
2020
DAGGER
2021
DOUBLE DAGGER
2022
BULLET
2026
HORIZONTAL ELLIPSIS
2030
PER MILLE SIGN
2032
PRIME
2033
DOUBLE PRIME
2039
SINGLE LEFT-POINTING ANGLE
QUOTATION MARK
203A
SINGLE RIGHT-POINTING ANGLE
QUOTATION MARK
203C
DOUBLE EXCLAMATION MARK
203E
OVERLINE
2044
FRACTION SLASH
204A
TIRONIAN ET SIGN
207F
SUPERSCRIPT LATIN SMALL
LETTER N
2082
SUBSCRIPT TWO
20A3
FRENCH FRANC SIGN
20A4
LIRA SIGN
20A7
PESETA SIGN
20AC
EURO SIGN
20AF
DRACHMA SIGN
2105
CARE OF
2116
NUMERO SIGN
2122
TRADE MARK SIGN
2126
OHM SIGN
215B
VULGAR FRACTION ONE EIGHTH
215C
VULGAR FRACTION THREE
EIGHTHS
215D
VULGAR FRACTION FIVE
EIGHTHS
215E
VULGAR FRACTION SEVEN
EIGHTHS
2190
LEFTWARDS ARROW
2191
UPWARDS ARROW
2192
RIGHTWARDS ARROW
2193
DOWNWARDS ARROW
2194
LEFT RIGHT ARROW
2195
UP DOWN ARROW
21A8
UP DOWN ARROW WITH BASE
2200
FOR ALL
2202
PARTIAL DIFFERENTIAL
2203
THERE EXISTS
2206
INCREMENT
2208
ELEMENT OF
2209
NOT AN ELEMENT OF
220F
N-ARY PRODUCT
2211
N-ARY SUMMATION
2212
MINUS SIGN
2219
BULLET OPERATOR
221A
SQUARE ROOT
221E
INFINITY
221F
RIGHT ANGLE
2227
LOGICAL AND
2228
LOGICAL OR
2229
INTERSECTION
222A
UNION
222B
INTEGRAL
2248
ALMOST EQUAL TO
2259
ESTIMATES
2260
NOT EQUAL TO
2261
IDENTICAL TO
2264
LESS-THAN OR EQUAL TO
2265
GREATER-THAN OR EQUAL TO
2282
SUBSET OF
2283
SUPERSET OF
2295
CIRCLED PLUS
2297
CIRCLED TIMES
2302
HOUSE
2310
REVERSED NOT SIGN
2320
TOP HALF INTEGRAL
2321
BOTTOM HALF INTEGRAL
2329
LEFT-POINTING ANGLE BRACKET
232A
RIGHT-POINTING ANGLE
BRACKET
2500
BOX DRAWINGS LIGHT
HORIZONTAL
2502
BOX DRAWINGS LIGHT VERTICAL
250C
BOX DRAWINGS LIGHT DOWN AND
RIGHT
2510
BOX DRAWINGS LIGHT DOWN AND
LEFT
2514
BOX DRAWINGS LIGHT UP AND
RIGHT
2518
BOX DRAWINGS LIGHT UP AND
LEFT
251C
BOX DRAWINGS LIGHT VERTICAL
AND RIGHT
2524
BOX DRAWINGS LIGHT VERTICAL
AND LEFT
252C
BOX DRAWINGS LIGHT DOWN AND
HORIZONTAL
2534
BOX DRAWINGS LIGHT UP AND
HORIZONTAL
253C
BOX DRAWINGS LIGHT VERTICAL
AND HORIZONTAL
2550
BOX DRAWINGS DOUBLE
HORIZONTAL
2551
BOX DRAWINGS DOUBLE
VERTICAL
2552
BOX DRAWINGS DOWN SINGLE
AND RIGHT DOUBLE
2553
BOX DRAWINGS DOWN DOUBLE
AND RIGHT SINGLE
2554
BOX DRAWINGS DOUBLE DOWN
AND RIGHT
2555
BOX DRAWINGS DOWN SINGLE
AND LEFT DOUBLE
2556
BOX DRAWINGS DOWN DOUBLE
AND LEFT SINGLE
2557
BOX DRAWINGS DOUBLE DOWN
AND LEFT
2558
BOX DRAWINGS UP SINGLE AND
RIGHT DOUBLE
2559
BOX DRAWINGS UP DOUBLE AND
RIGHT SINGLE
255A
BOX DRAWINGS DOUBLE UP AND
RIGHT
255B
BOX DRAWINGS UP SINGLE AND
LEFT DOUBLE
255C
BOX DRAWINGS UP DOUBLE AND
LEFT SINGLE
255D
BOX DRAWINGS DOUBLE UP AND
LEFT
255E
BOX DRAWINGS VERTICAL
SINGLE AND RIGHT DOUBLE
255F
BOX DRAWINGS VERTICAL
DOUBLE AND RIGHT SINGLE
2560
BOX DRAWINGS DOUBLE
VERTICAL AND RIGHT
2561
BOX DRAWINGS VERTICAL
SINGLE AND LEFT DOUBLE
2562
BOX DRAWINGS VERTICAL
DOUBLE AND LEFT SINGLE
2563
BOX DRAWINGS DOUBLE
VERTICAL AND LEFT
2564
BOX DRAWINGS DOWN SINGLE
AND HORIZONTAL DOUBLE
2565
BOX DRAWINGS DOWN DOUBLE
AND HORIZONTAL SINGLE
2566
BOX DRAWINGS DOUBLE DOWN
AND HORIZONTAL
2567
BOX DRAWINGS UP SINGLE AND
HORIZONTAL DOUBLE
2568
BOX DRAWINGS UP DOUBLE AND
HORIZONTAL SINGLE
2569
BOX DRAWINGS DOUBLE UP AND
HORIZONTAL
256A
BOX DRAWINGS VERTICAL
SINGLE AND HORIZONTAL
DOUBLE
256B
BOX DRAWINGS VERTICAL
DOUBLE AND HORIZONTAL
SINGLE
256C
BOX DRAWINGS DOUBLE
VERTICAL AND HORIZONTAL
2580
UPPER HALF BLOCK
2584
LOWER HALF BLOCK
2588
FULL BLOCK
258C
LEFT HALF BLOCK
2590
RIGHT HALF BLOCK
2591
LIGHT SHADE
2592
MEDIUM SHADE
2593
DARK SHADE
25A0
BLACK SQUARE
25AC
BLACK RECTANGLE
25B2
BLACK UP-POINTING TRIANGLE
25BA
BLACK RIGHT-POINTING
POINTER
25BC
BLACK DOWN-POINTING
TRIANGLE
25C4
BLACK LEFT-POINTING POINTER
25CA
LOZENGE
25CB
WHITE CIRCLE
25D8
INVERSE BULLET
25D9
INVERSE WHITE CIRCLE
263A
WHITE SMILING FACE
263B
BLACK SMILING FACE
263C
WHITE SUN WITH RAYS
2640
FEMALE SIGN
2642
MALE SIGN
2660
BLACK SPADE SUIT
2663
BLACK CLUB SUIT
2665
BLACK HEART SUIT
2666
BLACK DIAMOND SUIT
266A
EIGHTH NOTE
266B
BEAMED EIGHTH NOTES
FB01
LATIN SMALL LIGATURE FI
FB02
LATIN SMALL LIGATURE FL
FFFD
REPLACEMENT CHARACTER