[Blocks] |
Blocks data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/Blocks.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/Blocks.txt |
[Charts] |
Online Code Charts
http://www.unicode.org/charts/
An index to character names with links to the corresponding chart is
found at
http://www.unicode.org/charts/charindex.html
|
[Charts14] |
Charts for the test files
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/LineBreakTest.html
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/LineBreakTest.html |
[Charts15] |
Normalization Charts
http://www.unicode.org/charts/normalization/ |
[Charts29] |
Charts for the test files
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/GraphemeBreakTest.html
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/WordBreakTest.html
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/SentenceBreakTest.html
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/GraphemeBreakTest.html
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/WordBreakTest.html
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/SentenceBreakTest.html |
[CLDR] |
Unicode Locales Project (Unicode Common Locale Data Repository)
http://cldr.unicode.org/ |
[Code9] |
Reference implementations of the Unicode Bidirectional
Algorithm
For C reference code, see:
http://www.unicode.org/Public/PROGRAMS/BidiReferenceC/
For Java reference code, see:
http://www.unicode.org/Public/PROGRAMS/BidiReferenceJava/
|
[Code14] |
Sample implementation of the
Unicode Line Breaking Algorithm
http://www.unicode.org/Public/PROGRAMS/LineBreakSampleCpp/ |
[Corrections] |
Normalization Corrections data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/NormalizationCorrections.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/NormalizationCorrections.txt |
[Corrigendum1] |
Corrigendum #1: UTF-8 Shortest Form
http://www.unicode.org/versions/corrigendum1.html |
[Corrigendum2] |
Corrigendum #2: Yod with Hiriq Normalization
http://www.unicode.org/versions/corrigendum2.html |
[Corrigendum3] |
Corrigendum #3: U+F951 Normalization
http://www.unicode.org/versions/corrigendum3.html |
[Corrigendum4] |
Corrigendum #4: Five CJK Canonical Mapping Errors
http://www.unicode.org/versions/corrigendum4.html |
[Corrigendum5] |
Corrigendum #5: Normalization Idempotency
http://www.unicode.org/versions/corrigendum5.html |
[Corrigendum6] |
Corrigendum #6: Bidi Mirroring
http://www.unicode.org/versions/corrigendum6.html |
[Corrigendum7] |
Corrigendum #7: UAX #14, Unicode Line Breaking Algorithm, rule LB8
http://www.unicode.org/versions/corrigendum7.html |
[Corrigendum8] |
Corrigendum #8: Bidi_Class Fix for U+070F Syriac Abbreviation Mark
http://www.unicode.org/versions/corrigendum8.html |
[Corrigendum9] |
Corrigendum #9: Clarification About Noncharacters
http://www.unicode.org/versions/corrigendum9.html |
[Data9] |
Unicode Bidirectional Algorithm property data files
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/BidiMirroring.txt
http://www.unicode.org/Public/UCD/latest/ucd/BidiBrackets.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/BidiMirroring.txt
http://www.unicode.org/Public/7.0.0/ucd/BidiBrackets.txt |
[Data11] |
East Asian Width property data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/EastAsianWidth.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/EastAsianWidth.txt |
[Data14] |
Unicode Line Breaking Algorithm property data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/LineBreak.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/LineBreak.txt |
[Data24] |
Unicode Script Property data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/Scripts.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/Scripts.txt |
[Data34] |
Unicode Named Character Sequences data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/NamedSequences.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/NamedSequences.txt |
[Data45] |
U-Source Ideographs data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/USourceData.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/USourceData.txt |
[DataProv] |
Provisional Named Sequences data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/NamedSequencesProv.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/NamedSequencesProv.txt |
[Demo9] |
Online demo of a reference implementation of the Unicode Bidirectional Algorithm
http://www.unicode.org/cldr/utility/bidi.jsp |
[DerivedBIDI] |
Derived Bidi Properties
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/extracted/DerivedBidiClass.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/extracted/DerivedBidiClass.txt |
[Errata] |
Updates and Errata
http://www.unicode.org/errata |
[Exclusions] |
Composition Exclusion Table
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/CompositionExclusions.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/CompositionExclusions.txt |
[FAQ] |
Unicode Frequently Asked Questions
http://www.unicode.org/faq/
For answers to common questions on technical issues. |
[Feedback] |
Reporting Form
http://www.unicode.org/reporting.html
For reporting errors and requesting information online. |
[Glossary] |
Unicode Glossary
http://www.unicode.org/glossary/
For explanations of terminology used in this and other documents. |
[Glyphs45] |
U-Source Ideographs glyph table
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/USourceGlyphs.pdf
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/USourceGlyphs.pdf
|
[HangulST] |
Hangul Syllable Types
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/HangulSyllableType.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/HangulSyllableType.txt |
[NormProps] |
Derived Normalization Properties
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/DerivedNormalizationProps.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/DerivedNormalizationProps.txt |
[Policies] |
Unicode Policies
http://www.unicode.org/policies/policies.html |
[Props] |
Unicode Text Segmentation property data files
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/GraphemeBreakProperty.txt
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/WordBreakProperty.txt
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/SentenceBreakProperty.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/GraphemeBreakProperty.txt
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/WordBreakProperty.txt
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/SentenceBreakProperty.txt |
[PropValue] |
Property Value Aliases data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/PropertyValueAliases.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/PropertyValueAliases.txt |
[Reports] |
Unicode Technical Reports
http://www.unicode.org/reports/
For information on the status and development process for technical reports, and for a list of technical reports. |
[Stability] |
Unicode Consortium Stability Policies
http://www.unicode.org/policies/stability_policy.html |
[Tests9] |
Unicode Bidirectional Algorithm test data
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/BidiTest.txt
http://www.unicode.org/Public/UCD/latest/ucd/BidiCharacterTest.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/BidiTest.txt
http://www.unicode.org/Public/7.0.0/ucd/BidiCharacterTest.txt |
[Tests14] |
Unicode Line Breaking Algorithm test data
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/LineBreakTest.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/LineBreakTest.txt |
[Tests15] |
Unicode Normalization Forms test data
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/NormalizationTest.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/NormalizationTest.txt |
[Tests29] |
Unicode Text Segmentation test data
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/GraphemeBreakTest.txt
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/WordBreakTest.txt
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/SentenceBreakTest.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/GraphemeBreakTest.txt
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/WordBreakTest.txt
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/SentenceBreakTest.txt |
[UAX9] |
UAX #9: Unicode Bidirectional Algorithm
http://www.unicode.org/reports/tr9/ |
[UAX11] |
UAX #11: East Asian Width
http://www.unicode.org/reports/tr11/ |
[UAX14] |
UAX #14: Unicode Line Breaking Algorithm
http://www.unicode.org/reports/tr14/ |
[UAX15] |
UAX #15: Unicode Normalization Forms
http://www.unicode.org/reports/tr15/ |
[UAX24] |
UAX #24: Unicode Script Property
http://www.unicode.org/reports/tr24/ |
[UAX29] |
UAX #29: Unicode Text Segmentation
http://www.unicode.org/reports/tr29/ |
[UAX31] |
UAX #31: Unicode Identifier and Pattern Syntax
http://www.unicode.org/reports/tr31/ |
[UAX34] |
UAX #34: Unicode Named Character Sequences
http://www.unicode.org/reports/tr34/ |
[UAX38] |
UAX #38: Unicode Han Database (Unihan)
http://www.unicode.org/reports/tr38/ |
[UAX41] |
UAX #41: Common References for Unicode Standard Annexes
http://www.unicode.org/reports/tr41/ |
[UAX42] |
UAX #42:Unicode Character Database in XML
http://www.unicode.org/reports/tr42/ |
[UAX44] |
UAX #44:Unicode Character Database
http://www.unicode.org/reports/tr44/ |
[UAX45] |
UAX #45:U-Source Ideographs
http://www.unicode.org/reports/tr45/ |
[UCD] |
Unicode Character Database
http://www.unicode.org/ucd/
For detailed documentation about the Unicode Character Database, see Unicode Standard Annex #44: Unicode Character Database
http://www.unicode.org/reports/tr44/ |
[Unicode] |
The Unicode Standard For the latest version, see:
http://www.unicode.org/versions/latest/
For the 7.0.0 version, see:
http://www.unicode.org/versions/Unicode7.0.0/ |
[Unicode3.0] |
The Unicode Consortium. The Unicode Standard, Version 3.0
(Reading, MA, Addison-Wesley, 2000. ISBN 0-201-61633-5). |
[Unicode3.1] |
The Unicode Consortium. The Unicode
Standard, Version 3.1.0, defined by: The Unicode Standard, Version
3.0 (Reading, MA, Addison-Wesley, 2000. ISBN 0-201-61633-5), as
amended by the Unicode Standard Annex #27: Unicode 3.1
http://www.unicode.org/reports/tr27/ |
[Unicode3.2] |
The Unicode Consortium. The Unicode
Standard, Version 3.2.0, defined by: The Unicode Standard, Version 3.0
(Reading, MA, Addison-Wesley, 2000. ISBN 0-201-61633-5), as amended by
the Unicode Standard Annex #27: Unicode 3.1 and the Unicode
Standard Annex #28: Unicode 3.2
http://www.unicode.org/reports/tr28/ |
[Unicode4.0] |
The Unicode Consortium.
The Unicode Standard, Version 4.0
(Boston, MA, Addison-Wesley, 2003. ISBN 0-321-18578-1). |
[Unicode4.0.1] |
The Unicode Consortium. The Unicode Standard, Version 4.0.1, defined by:
The Unicode Standard, Version 4.0 (Boston, MA, Addison-Wesley, 2003. ISBN
0-321-18578-1), as amended by
Unicode 4.0.1
http://www.unicode.org/versions/Unicode4.0.1/ |
[Unicode4.1] |
The Unicode Consortium. The Unicode Standard, Version 4.1.0, defined by:
The Unicode Standard, Version 4.0
(Boston, MA, Addison-Wesley, 2003. ISBN 0-321-18578-1), as amended by
Unicode 4.0.1 and by
Unicode 4.1.0
http://www.unicode.org/versions/Unicode4.1.0/ |
[Unicode5.0] |
The Unicode Consortium.
The Unicode Standard, Version
5.0 (Boston, MA, Addison-Wesley, 2007. ISBN 0-321-48091-0). |
[Unicode5.1] |
The Unicode Consortium. The Unicode Standard, Version 5.1.0, defined by: The Unicode Standard, Version 5.0
(Boston, MA, Addison-Wesley, 2007. ISBN 0-321-48091-0), as amended by
Unicode 5.1.0 |
[Unicode5.2] |
The Unicode Consortium. The Unicode Standard, Version 5.2.0, defined
by: The Unicode Standard, Version 5.2 (Mountain View, CA: The Unicode Consortium, 2009. ISBN 978-1-936213-00-9) |
[Unicode6.0] |
The Unicode Consortium. The Unicode Standard, Version 6.0.0
(Mountain View, CA: The Unicode Consortium, 2011. ISBN 978-1-936213-01-6)
http://www.unicode.org/versions/Unicode6.0.0/ |
[Unicode6.1] |
The Unicode Consortium. The Unicode Standard, Version 6.1.0
(Mountain View, CA: The Unicode Consortium, 2012. ISBN 978-1-936213-02-3)
http://www.unicode.org/versions/Unicode6.1.0/ |
[Unicode6.2] |
The Unicode Consortium. The Unicode Standard, Version 6.2.0
(Mountain View, CA: The Unicode Consortium, 2012. ISBN 978-1-936213-07-8)
http://www.unicode.org/versions/Unicode6.2.0/ |
[Unicode6.3] |
The Unicode Consortium. The Unicode Standard, Version 6.3.0
(Mountain View, CA: The Unicode Consortium, 2013. ISBN 978-1-936213-08-5)
http://www.unicode.org/versions/Unicode6.3.0/ |
[Unicode7.0] |
The Unicode Consortium. The Unicode Standard, Version 7.0.0
(Mountain View, CA: The Unicode Consortium, 2014. ISBN 978-1-936213-09-2)
http://www.unicode.org/versions/Unicode7.0.0/ |
[UTC] |
Unicode Technical Committee
http://www.unicode.org/consortium/utc.html |
[UTN5] |
UTN #5: Canonical Equivalences in Applications
http://www.unicode.org/notes/tn5 |
[UTR17] |
UTR #17: Unicode Character Encoding Model
http://www.unicode.org/reports/tr17/ |
[UTR20] |
UTR # 20: Unicode in XML and other Markup Languages
http://www.unicode.org/reports/tr20/ |
[UTR23] |
UTR # 23: The Unicode Character Property Model
http://www.unicode.org/reports/tr23/ |
[UTR25] |
UTR # 25: Unicode Support for Mathematics
http://www.unicode.org/reports/tr25/ |
[UTR33] |
UTR # 33: Unicode Conformance Model
http://www.unicode.org/reports/tr33/ |
[UTR36] |
UTR #36: Unicode Security Considerations
http://www.unicode.org/reports/tr36/ |
[UTR50] |
UTR #50: Unicode Vertical Text Layout
http://www.unicode.org/reports/tr50/ |
[UTS6] |
UTS #6: A Standard Compression Scheme
for Unicode
http://www.unicode.org/reports/tr6/ |
[UTS10] |
UTS #10: Unicode Collation Algorithm
(UCA)
http://www.unicode.org/reports/tr10/ |
[UTS18] |
UTS #18: Unicode Regular Expressions
http://www.unicode.org/reports/tr18/ |
[UTS22] |
UTS #22: Unicode Character Mapping Markup Language
http://www.unicode.org/reports/tr22/ |
[UTS35] |
UTS #35: Unicode Locale Data Markup Language (LDML)
http://www.unicode.org/reports/tr35/ |
[UTS37] |
UTS #37: Unicode Ideographic Variation Database
http://www.unicode.org/reports/tr37/ |
[UTS39] |
UTS #39: Unicode Security Mechanisms
http://www.unicode.org/reports/tr39/ |
[UTS46] |
UTS #46: Unicode IDNA Compatibility Processing
http://www.unicode.org/reports/tr46/ |
[Versions] |
Versions of the Unicode Standard
http://www.unicode.org/versions/
For information on version numbering, and citing and referencing the Unicode Standard,
the Unicode Character Database, and Unicode Technical Reports. |