Chinese Domain Name Consortium (CDNC) Status Update for IDN CDNC Vincent WS Chen, TWNIC 2003.03.25

Preview:

Citation preview

Chinese Domain Name Consortium (CDNC)

Status Update for IDN

CDNCVincent WS Chen, TWNIC

2003.03.25

CDNC Status update CDNC Introduction Major Activities on IDN/CDN Status of Chinese Variant Table Issues Next Step

The Chinese Domain Name Consortium (CDNC)

Since May, 2000 CDNC was founded by CNNIC, TWNIC, HKNIC, MO

NIC in Beijing. Currently co-chairs

Prof. Hua-lin Qian (CNNIC) Dr. Shian-shyong Tseng (TWNIC)

Members CNNIC, TWNIC, HKNIC, MONIC Other organizations with DN technology and kno

wledge, which are interested in CDN ( Chinese domain name ) .

Other related Organizations JET

Joint Engineer Task force for IDN effort Members: TWNIC, CNNIC, JPNIC, KRNIC

TWNIC IDN Task force Since May, 2000 Mission

Study CDN technology and implementation solutions TW Variant Table Working Group

Since Feb. 2002 Mission

Edit Traditional Chinese Variant Table and coordinate Simplified Chinese Variant Table with CNNIC for CDN

Major Activities on IDN/CDN CDNC announcement on current IDN solution

in Feb, and June 2002 No solution for Chinese TC/SC equivalence in IETF

architecture No complementary documents for corresponding I

DN registration and administration Aug. 2002, 8th CDNC meeting in Yokohama

Joint development for CDN client software Modification on algorithm of IDN Admin. guideline

draft Consider to make Chinese variant table

Oct. 2002, 9th CDNC meeting in shanghai Discussion on Chinese character variant tables

Plan to merge CN/TW tables into one table in Dec. 2002 Development of client and its supporting system Suggestions on options of the algorithm of

guideline draft Feb. 2003, 10th CDNC meeting in Taipei

Merge CN/TW tables CDN demonstration system to get user experience

for draft table usage

Major Activities on IDN/CDN

IETF draft Internationalized Domain Names and Unique Identifiers/Names(u

name) Traditional and Simplified Chinese Conversion(TSCONV) Requirements of Chinese Domain Name(CDNREQ) Phased Implementation for Internationalized Domain Names in A

pplications (PIIDNA) And other drafts

IDN Admin Guideline http://www1.ietf.org/mail-archive/ietf-announce/Current/msg18

308.html http://www1.ietf.org/mail-archive/ietf-announce/Current/msg20

812.html http://www.ietf.org/internet-drafts/draft-jseng-idn-admin-02.txt

Major Activities on IDN/CDN

TC/SC interchangeable is the must Chinese variant table for CDN

registration and administration TW variant table CN variant table Chinese variant table (CVT)

One Table includes TW and CN variant table

Status of Chinese Variant Table

Why TC/SC In 1956 and 1964, mainland China pub

lish “A Complete Set of Simplified Chinese Characters” It convert some Tradition Chinese to Sim

plified Chinese Ex:

釘 钉許 许

TC/SC mapping TC SC : 1 to 1 mapping TC SC : many to 1 mapping

Ex:發 发髮 发

TC SC : 1 to many mapping Ex:

餘 馀餘 余

TW variant Table Working Group The 16th TWVT WG meeting has held at

February 20, 2003 Members

Linguist from Academia SINICA Computer expert from National Central University

(NCU) Linguist from Taipei Computer Association (TCA) Linguist from CMEX (Foundation) Linguist from Directorate General of Budget

Accounting and Statistics Executive Yuan R.O.C. (DGBAS)

Linguist from IBM Taiwan

Status of TW Variant Table Submit revised draft table to the Bureau

of Standard, Taiwan June, 2002 October, 2002 December, 2002

Next revised draft table Expected to submit in this March After reviewed by the Bureau of Standard,

Taiwan the Chinese Variant Table would become CNS standard

Status of CN Variant Table Invite linguists as the advisers to revie

w the variant table created by TW variant Table Working Group

CN variant table 1st version has been finished in last month

Based on table of TWNIC, minor modification will be adopted Will be completed soon

CVT Structure and Definition

Character for registration (CR) All the Chinese character code point that could

be registered as CDN TW Corresponding character

T-source Chinese character code point which correspond to CR

CN Corresponding character G-source Chinese character code point which

correspond to CR Relevant character

All the variant code point related to CR

CVT Structure and Definition

Character for registration CJK Han character

20902 CJK Unified Ideographs (4E00-9FA5) 52 Characters in Extension A

LDH RFC 1035 Alphabet ‘a-z’ Numeric ‘0-9’ Symbol ‘-’

21017 code points will be included in this table

Sample of CVTCharacter for registration

(CR)

TW Corresponding character

(TWCC)

CNCorresponding

character(CNCC)

Relevant character(s)

(RC)Remarks

公 公 公 公Mapping to oneself司 司 司 司

會 會 会 會会

1-n mappingOnly CR CC 2 RCConsistent RCs

会 會 会 會会

議 議 议 議议

议 議 议 議议

Sample of TCVTCharacter for registration

(CR)

TW Corresponding character

(TWCC)

CN Corresponding

character(CNCC)

Relevant character(s)

(RC)Remarks

風 風 风 風风凨1-1 mappingConsistent RCs

风 風 风 風风凨

凨 風 风 風风凨

台 台臺颱檯 台 台臺颱檯

1-n mappingInconsistent RCs

臺 台臺 台 台臺

颱 颱 台 台颱

檯 檯 台 台檯

CVT Structure and Definition TW Corresponding character

Adopt the corresponding character selected by TW

1 to 1: 19,029 1 to many: 1,925

CN Corresponding character Adopt the corresponding character selected

by CN 1 to 1: 20943 1 to many: 11

CVT Structure and Definition Relevant character

All the variant code points related to Character for Registration

CVT Operation algorithm Character for registration (CR)

Let user register one CDN Put the registered CDN in zone file

TW Corresponding character (TWCC) If only one TWCC

Put the TWCC DN in zone file If more than one TWCC

Let user choose one TWCC DN then put in zone file

CVT Operation algorithm CN Corresponding character (CNCC)

If only one CNCC Put the CNCC DN in zone file

If more than one CNCC Let user choose one CNCC DN then put in zone

file

Relevant character (RC) Reserve RC DN in registration database If Active RC DN

Put the ARCDN in zone file

CVT Operation algorithm CDN registration package

CDN-CR (character for registration) TWCC DN CNCC DN RC DN ARC DN

Demonstration( 公司 .TW) Register CDN: 公司 .TW TWCC DN: 公司 .TW CNCC DN: 公司 .TW RC DN: none Zone file only 1 CDN: 公司 .TW

Demonstration( 會議公司 .TW)

Register CDN: 會議公司 .TW TWCC DN: 會議公司 .TW CNCC DN: 会议公司 .TW RC DN

1. 會议公司 .TW2. 会議公司 .TW

Zone file 2 CDN1. 會議公司 .TW2. 会议公司 .TW

Demonstration( 會議公司 .TW) RC DN

1. 會议公司 .TW2. 会議公司 .TW

RC DN could be active by registrant1. 會议公司 .TW

Zone file 3 CDN1. 會議公司 .TW2. 会议公司 .TW3. 會议公司 .TW

Demonstration( 颱風 .TW) Register CDN: 颱風 .TW TWCC DN: 颱風 .TW CNCC DN: 台风 .TW RC DN

1. 台凨 .TW2. 台風 .TW3. 颱凨 .TW4. 颱风 .TW

Zone file: 2 CDN1. 颱風 .TW2. 台风 .TW

Demonstration( 台風 .TW) Register CDN: 台風 .TW

There are 4 TWCC DN1. 檯風 .TW2. 臺風 .TW3. 颱風 .TW4. 台風 .TW Option process:

Let user to choose one TWCC DN TWCC DN: 台風 .TW

Demonstration( 台風 .TW)

CNCC DN: 台风 .TW RC DN

1. 台凨 .TW2. 檯凨 .TW3. 檯風 .TW4. 檯风 .TW5. 臺凨 .TW

Zone file 2 CDN1. 台風 .TW2. 台风 .TW

6. 臺風 .TW7. 臺风 .TW8. 颱風 .TW9. 颱凨 .TW10. 颱风 .TW

Corresponding DN overlap CDN 颱風 .TW and 台風 .TW

Have the same CNCC DN: 台风 .TW FCFS principle The second registrant will not get the CNCC DN

4 overlapped RC DN 1. 台凨 .TW2. 台風 .TW3. 颱凨 .TW4. 颱风 .TW

Relevant Character DN overlap The result of RC DN will depending on regi

ster order Case 1

If register 颱風 .TW first Will get 4 RC DN

Then register 台風 .TW Will get 10-4=6 RC DN

Case 2 If register 台風 .TW first

Will get 10 RC DN Then register 颱風 .TW

Will not get RC DN

Case study: .tw CDN Statistics with CVT

CDN length Sample NumbersTotal Numbers of Relventcharacters

Themultiples

2 9349 25532 2.733 6102 28947 4.744 16964 148183 8.745 4131 56936 13.786 5196 137571 26.487 5074 247015 48.688 2218 132665 59.819 1445 191016 132.19

10 2199 476125 216.5211 1291 242827 188.09

12 (above) 2261 1418233 627.26Total 56230 3105050 55.22

Too many Reserved CDN Register CDN: 經濟部標準檢驗局

972 RCDN Register CDN: 經濟部標準檢驗局第一會議室 More than 19,000 RC DN

Issues Technical issue

Reserved CDN overlap increase the registration system more complex

Package register/delete/cancel/active/de-active need to relay on a sophisticated registration system to maintain the correct CDN package

The amount of Zone file will increase after zone delegation

Issues Policy issue

Package register/delete/cancel/active/de-active policies, especially for overlapped characters

Too many RCs and how to regulate by registration policy to decrease the database size and system workload

Backward compatibility before or after registration, including implementing variant table

new DRP for package concept if possible Variant table issue

Table version and its backward compatibility

Next Step To tune Chinese (CN/TW) variant table

Consider users expectation Reduce RC size as possible Reduce TW Corresponding character 1 to many case

1 to many: 1,925 --988 To set up the Rules for table use To implement table and develop API for CDN

Registration and resolution To Work out feasible Registration policy

Adopt IDN Admin Guideline Sunrise period

Q & A

Recommended