[Unicode]  Unicode 4.0.0 Home | Site Map | Search
 

Errata Fixed in Unicode 4.0.0

This page contains the definitive listing of all errata of record since the publication of The Unicode Standard, Version 3.2 and considered resolved by the release of Unicode Version 4.0.0. These errata are listed by date in the table below. For prior errata resolved in Unicode 3.2, see the errata listed in Unicode Standard Annex #28: Unicode 3.2 For errata still pending subsequent to the release of Unicode 4.0, see the list of current Updates and Errata.

Date  Summary 
2003-3-11 The glyphs for U+3D0A, U+48B4, and U+29FCE are incorrect. For all three characters, the glyph on the left below shows the character in the current charts; the glyph on the right is the corrected glyph which will be used in the future.

   

   

   

2003-3-03 The glyph for U+2A01 N-ARY CIRCLED PLUS OPERATOR is missing a vertical stroke.
2002-11-25 U+06DD ARABIC END OF AYAH  has been changed from a combining character to a format character. As a result its representation in the code charts has been changed in accordance with this. The glyph on the left below shows the character in previous charts; the glyph on the right shows the character as it should appear (without a dotted circle, but with a dashed box).

   

2002-11-25 0A33 GURMUKHI LETTER LLA: The representative glyph on the left below shows the character in previous charts; the glyph on the right shows the character as it should appear (with dot moved).

   

2002-11-25 0B66 ORIYA DIGIT ZERO: The representative glyph on the left below shows the character in previous charts; the glyph on the right shows the character reduced in size so it is smaller than U+0B20 ORIYA LETTER TTHA. 

   

2002-11-25 0ABD GUJARATI SIGN AVAGRAHA: The glyph has been corrected. The glyph on the left below shows the character in previous charts; the glyph on the right shows the character as it should appear (flipped vertically). 

   

2002-11-25 U+1D176 MUSICAL SYMBOL END TIE: The glyph below shows as it should appear (correct name in dashed box). 

2002-11-25 1D785 MATHEMATICAL SANS-SERIF BOLD SMALL PHI and 1D78D MATHEMATICAL SANS-SERIF BOLD SMALL PHI SYMBOL: The glyphs for these two characters were swapped. The left diagram below shows the previous association of glyphs to characters. The right diagram below shows the correct association of glyphs to characters.

2002-11-25 1D7BF MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL PHI and 1D7C7 MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL PHI SYMBOL: The glyphs for these two characters were swapped. The left diagram below shows the previous association of glyphs to characters. The right diagram below shows the correct association of glyphs to characters.

2002-10-31

There are two errors in SpecialCasing.txt.

1. Missing semicolons on two lines. Correct as follows:

< 0130; 0069; 0130; 0130; tr # LATIN CAPITAL LETTER I WITH DOT ABOVE
< 0130; 0069; 0130; 0130; az # LATIN CAPITAL LETTER I WITH DOT ABOVE
---
> 0130; 0069; 0130; 0130; tr; # LATIN CAPITAL LETTER I WITH DOT ABOVE
> 0130; 0069; 0130; 0130; az; # LATIN CAPITAL LETTER I WITH DOT ABOVE

2. An incorrect context definition. Correct as follows:

< 0307; ; 0307; 0307; tr After_Soft_Dotted; # COMBINING DOT ABOVE
< 0307; ; 0307; 0307; az After_Soft_Dotted; # COMBINING DOT ABOVE
---
> 0307; ; 0307; 0307; tr After_I; # COMBINING DOT ABOVE
> 0307; ; 0307; 0307; az After_I; # COMBINING DOT ABOVE

where the context After_I is defined as:

The last preceding base character was an uppercase I, and there is no intervening combining character class 230 (ABOVE).

2002-10-30

The general category of the character U+00AD SOFT HYPHEN should be Cf (Other, Format) instead of Pd (Punctuation, Dash). The general category of the characters U+058A ARMENIAN HYPHEN and U+1806 MONGOLIAN TODO SOFT HYPHEN remains Pd (Punctuation, Dash).

2002-5-10 The present version (tracking number 12) of UAX #14: Line Breaking Properties does not correctly handle Jamo characters. The correct behavior is documented in http://www.unicode.org/reports/tr28/, Table 3-5, "Hangul Syllable No-Break Rules."
2002-4-29 In UAX #28: Unicode 3.2, the figure illustrating canonical and compatibility decomposables in Article III, Section 2.2, Unicode Design Principles has incorrect code points. The code points should be 03D3 rather than 038E and 03A5 rather than 0385 for the two instances of each.