UnicodeIUC18
Unicode Standard Conference Board Past Conferences Call for Papers Sponsors Showcase
Registration Accommodation Travel Program Talks and Papers Next Conference
Abstract

Open Source Project for Unicode Locales for Linux using Unicode Databases, Collation Keys and a XML based Locale Data

Kentaro Noji - IBM Japan, Ltd.

Intended Audience: Software Engineer
Session Level: Intermediate

We've developed 140 or more Unicode POSIX locales for Linux glibc (GNU C runtime library) using Unicode online databases, collation keys and a XML based Locale data. They have been provided to Open source community by Li18nux[1] and IBM as IBM Public Licence[2]. Some of them already have been packaged in glibc V2.2.

The functional objective of this development is the following.
- To generate character properties from the Unicode Character Database[3]
- To generate collation data from Unicode Collation Keys database[4]
- To generate other locale data from a XML based locale definition database
- To conform to Linux 2000 specification[4]

Note that the XML based locale definition database above is created from the ICU (Internationalization Class for Unicode) data. ICU, Java and POSIX data in IBM will be all maintained through this XML format locale.

In this paper, we describe the overall of this project and technical methodology used to develop this locale data. We also introduce tools to display this locale data for verification, and XML based locale editor which can be used to modify it.

[1] http://www.li18nux.net/
[2] http://oss.software.ibm.com/developerworks/opensource/locale/
[3] http://www.unicode.org/unicode/onlinedat/online.html
[4] http://www.unicode.org/unicode/reports/tr10/


Unicode
When the world wants to talk, it speaks Unicode

UnicodeIUC18
Unicode Standard Conference Board Past Conferences Call for Papers Sponsors Showcase
Registration Accommodation Travel Program Talks and Papers Next Conference
International Unicode Conferences are organized by Global Meeting Services, Inc., (GMS). GMS is pleased to be able to offer the International Unicode Conferences under an exclusive license granted by the Unicode Consortium. All responsibility for conference finances and operations is borne by GMS. The independent conference board serves solely at the pleasure of GMS and is composed of volunteers active in Unicode and in international software development. All inquiries regarding International Unicode Conferences should be addressed to info@global-conference.com.

Unicode and the Unicode logo are registered trademarks of Unicode, Inc. Used with permission.

11 December 2000, Webmaster