Unicode Utilities: Character Property Index

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid

CategoryDatatypeSourcePropertyValues
BidirectionalBinaryUCDBidi_ControlNo (N),
Yes (Y)
Bidi_MirroredNo (N),
Yes (Y)
EnumeratedBidi_ClassShow Values
Bidi_Paired_Bracket_TypeClose (c),
None (n),
Open (o)
StringBidi_Mirroring_GlyphShow Values
Bidi_Paired_BracketShow Values
CaseBinaryUCDCase_IgnorableNo (N),
Yes (Y)
CasedNo (N),
Yes (Y)
Changes_When_CasefoldedNo (N),
Yes (Y)
Changes_When_CasemappedNo (N),
Yes (Y)
Changes_When_LowercasedNo (N),
Yes (Y)
Changes_When_TitlecasedNo (N),
Yes (Y)
Changes_When_UppercasedNo (N),
Yes (Y)
LowercaseNo (N),
Yes (Y)
Soft_DottedNo (N),
Yes (Y)
UppercaseNo (N),
Yes (Y)
StringCase_FoldingShow Values
Lowercase_MappingShow Values
Simple_Case_FoldingShow Values
Simple_Lowercase_MappingShow Values
Simple_Titlecase_MappingShow Values
Simple_Uppercase_MappingShow Values
Titlecase_MappingShow Values
Uppercase_MappingShow Values
UnicodetoCasefoldShow Values
toLowercaseShow Values
toTitlecaseShow Values
toUppercaseShow Values
CJKBinaryUCDIDS_Binary_OperatorNo (N),
Yes (Y)
IDS_Trinary_OperatorNo (N),
Yes (Y)
IdeographicNo (N),
Yes (Y)
RadicalNo (N),
Yes (Y)
Unified_IdeographNo (N),
Yes (Y)
EnumeratedX-DemoHanTypeHan, Hans, Hant,
na
StringUCDCJK_RadicalShow Values
Equivalent_Unified_IdeographShow Values
kSimplifiedVariantShow Values
kTraditionalVariantShow Values
EmojiBinaryUCDExtended_PictographicNo (N),
Yes (Y)
UTSBasic_EmojiNo (N),
Yes (Y)
EmojiNo (N),
Yes (Y)
Emoji_ComponentNo (N),
Yes (Y)
Emoji_ModifierNo (N),
Yes (Y)
Emoji_Modifier_BaseNo (N),
Yes (Y)
Emoji_PresentationNo (N),
Yes (Y)
RGI_EmojiNo,
Yes
RGI_Emoji_Flag_SequenceNo (N),
Yes (Y)
RGI_Emoji_Keycap_SequenceNo (N),
Yes (Y)
RGI_Emoji_Modifier_SequenceNo (N),
Yes (Y)
RGI_Emoji_Tag_SequenceNo (N),
Yes (Y)
RGI_Emoji_Zwj_SequenceNo (N),
Yes (Y)
EnumeratedUCDRegional_IndicatorNo (N),
Yes (Y)
GeneralBinaryUCDAlphabeticNo (N),
Yes (Y)
Default_Ignorable_Code_PointNo (N),
Yes (Y)
DeprecatedNo (N),
Yes (Y)
Logical_Order_ExceptionNo (N),
Yes (Y)
Noncharacter_Code_PointNo (N),
Yes (Y)
Variation_SelectorNo (N),
Yes (Y)
White_SpaceNo (N),
Yes (Y)
CatalogAgeShow Values
BlockShow Values
ScriptShow Values
EnumeratedGeneral_CategoryShow Values
Hangul_Syllable_TypeLeading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT),
Not_Applicable (NA),
Trailing_Jamo (T),
Vowel_Jamo (V)
Name_AliasShow Values
Named_SequencesShow Values
Named_Sequences_Prov
StringNameslistsubheadA. Man and his occupations, A. The human body and clothing, Aa. Unclassified, Abbreviation mark, Abbreviations involving liter symbols, Abkhaz letters, Accent marks, Accessibility symbols, Accidental, Accidentals for quarter tones, Accommodation symbol, Activities, Addition for German typography, Addition for Kashmiri, Addition for Sanskrit, Addition for Torwali, Addition for UPA, Addition to group V, Additional archaic letters for Bactrian, Additional components, Additional diacritical mark for symbols, Additional historic chillu letters, Additional historic vowel, Additional letter for Ossetian, Additional letters for Gaulish, Additional letters for Mingrelian and Svan, Additional letters for Ossetian and Abkhaz, Additional marks for UPA, Additional medieval letters, Additional nasal vowels, Additional vowel and vowel sign, Additional vowels for Sanskrit, Additions based on 1989 IPA, Additions for Africanist linguistics, Additions for Assamese, Additions for Bosnian orthographies, Additions for Burushaski, Additions for early Persian, Additions for early Persian and Azerbaijani, Additions for Greek, Additions for IPA, Additions for Khowar, Additions for Lithuanian dialectology, Additions for Livonian, Additions for Nivkh, Additions for Pegon orthographies, Additions for Quranic orthographies, Additions for Romanian, Additions for Sencoten, Additions for Shan Pali, Additions for Shona, Additions for Sinology, Additions for Slovenian, Additions for the Uralic Phonetic Alphabet, Additions for Uyghur, Afona or Ypostaseis (Mutes or Hypostases), African letters for clicks, Africanist tone letters, Agogika (Conduits), Agricultural symbols, Aiton symbols and letters, Aleut letters, Alloioseis (Differentiators), Alternate consonant forms for Chinese, Alternate digits, Alternate forms, Alternate forms of punctuation, Alternate letters, Alternate visarga signs, Alternate vowel signs, Ampersand and ligature et ornaments, Analytics, Ancient Gaulish and Celtic epigraphic letters, Ancient Greek acrophonic numerals, Ancient Greek instrumental notation, Ancient Greek papyrological numbers, Ancient Greek symbols, Ancient Greek textual symbols, Ancient Greek vocalic notation, Ancient Near-Eastern linguistic symbol, Ancient Roman epigraphic letters, Angles, Animal faces, Animal symbols, Animals and nature, Annotation marks, Anusvaras, APL, Apparatus, Arabic letter for Balti, Arabic letter for Berber, Arabic letter for Punjabi, Arabic letters for African languages, Arabic letters for Arwi, Arabic letters for Bravanese, Arabic letters for European and Central Asian languages, Arabic letters for Hausa, Wolof and other African orthographies, Arabic letters for Hindko, Arabic letters for Warsh orthography, Arabic pedagogical symbols, Arabic-Indic digits, Arc consonants, Archaic baseline tone marks, Archaic letter, Archaic letters for Ewe, Archaic letters for Volapük, Archaic lowercase syllable, Archaic punctuation, Archaic Roman numerals, Archaic uppercase syllable, Ardhavisarga, Argies (Retards), ARIB compatibility ideographs, Armenian ligatures, Arrow tails, Arrowheads, Arrows, Arrows combined with operators, Arrows combined with relations, Arrows for legacy computing, Arrows with bent tips, Arrows with equilateral triangle arrowheads, Arrows with modifications, Arrows with triangle arrowheads, Articulation, ASCII digits, ASCII math operator, ASCII mathematical operators, ASCII punctuation, ASCII punctuation and symbols, Astrological aspects, Astrological digits, Astrological signs, Astrological symbols for Eris and Sedna, Astrological symbols for Pluto, Astronomical symbol, Attached affixes, Audio and video symbols, Augmentation dot, Auspicious signs, Azerbaijani letters,
B. Animals, B. Woman and her occupations, Back of card, Ballot symbols, Bamboo suit tiles, Bars, Based on GB 2312, Based on ISO 8859-6, Based on ISO 8859-8, Baseline tone marks, Basic consonants, Basic glyphs for Arabic language contextual forms, Basic letters, Basic nasal vowels, Basic Russian alphabet, Basic syllables, Basic vowels, Beams and slurs, Beverage and food symbols, Beverage symbols, Biblical editorial symbol, Black arrow, Black circles, Black curved arrows, Black diamonds, Black lozenges, Black squares, Blank and lost signs, Block elements, Block mosaic terminal graphic characters, Body parts, Body shapes, Bohairic Coptic letters, Bold digits, Bold Fraktur symbols, Bold Greek symbols, Bold italic Greek symbols, Bold italic symbols, Bold script symbols, Bold symbols, Bowtie symbols, Bracket pieces, Brackets, Brackets with ticks, Braille patterns, Breathing mark for the Samaveda, Bubble symbols, Building and map symbols, Buryat letter,
C. Anthropomorphic deities, C. Plants, C0 controls, C1 controls, Candrabindu, Candrabindu and candra ornaments, Cantillation marks, Cantillation marks (svara) for the Samaveda, Capital letters, Capital letters (Khutsuri), Capital letters (Mtavruli), Cat faces, Ceilings and floors, Celebration symbols, Centred geometric shapes, Cereals and plants, Character cell arcs, Character cell diagonals, Character suit tiles, Characters found through Phase A, Characters found through Phase B, Characters found through Phase C, Characters found through Phase D, Characters found through Phase E, Characters found through Phase F, Chemistry symbol, Chess equihoppers, Chess equihoppers rotated 90 degrees, Chess symbols, Chess symbols rotated 45 degrees, Chess symbols rotated 90 degrees, Chess symbols rotated 135 degrees, Chess symbols rotated 180 degrees (turned), Chess symbols rotated 225 degrees, Chess symbols rotated 270 degrees, Chess symbols rotated 315 degrees, Chillu letters, Chinantec tone marks, Chinook punctuation, Chukchi letters, Chuvash letters, Circle, Circle suit tiles, Circle symbols, Circle vowels, Circled arrows, Circled Hangul letters, Circled Hangul syllable, Circled Hangul syllables, Circled ideographs, Circled ideographs from ARIB STD B24, Circled italic Latin letters, Circled Katakana, Circled Korean words, Circled Latin letters, Circled Latin letters or letter sequences, Circled numbers, Circled numbers on black squares from ARIB STD B24, Circled sans-serif digits, Circles, CJK angle brackets, CJK brackets, CJK corner brackets, CJK punctuation, CJK radicals supplement too many values to show
UCDNameShow Values
Script_ExtensionsShow Values
IdentifiersBinaryUCDID_ContinueNo (N),
Yes (Y)
ID_StartNo (N),
Yes (Y)
Pattern_SyntaxNo (N),
Yes (Y)
Pattern_White_SpaceNo (N),
Yes (Y)
XID_ContinueNo (N),
Yes (Y)
XID_StartNo (N),
Yes (Y)
IDNAEnumeratedUTSIdn_2008na (na),
NV8 (nv8),
XV8 (xv8)
Idn_Statusdeviation (dv), disallowed (da), disallowed_STD3_mapped (ds3m), disallowed_STD3_valid (ds3v),
ignored (i),
mapped (m),
valid (v)
idna2003deviation, disallowed,
ignored,
mapped,
valid
idna2008CONTEXTJ, CONTEXTO,
DISALLOWED,
PVALID,
UNASSIGNED
idna2008cdeviation, disallowed,
ignored,
mapped,
valid
uts46deviation, disallowed,
ignored,
mapped,
valid
StringIdn_MappingShow Values
toIdna2003Show Values
toUts46nShow Values
toUts46tShow Values
MiscellaneousBinaryUCDDashNo (N),
Yes (Y)
DiacriticNo (N),
Yes (Y)
ExtenderNo (N),
Yes (Y)
Grapheme_BaseNo (N),
Yes (Y)
Grapheme_ExtendNo (N),
Yes (Y)
Grapheme_LinkNo (N),
Yes (Y)
HyphenNo (N),
Yes (Y)
MathNo (N),
Yes (Y)
Quotation_MarkNo (N),
Yes (Y)
STermNo (N),
Yes (Y)
Terminal_PunctuationNo (N),
Yes (Y)
EnumeratedIndic_Positional_CategoryShow Values
Indic_Syllabic_CategoryShow Values
MiscellaneousISO_CommentShow Values
Unicode_1_NameShow Values
NormalizationBinaryICUisNFMNo,
Yes
UCDChanges_When_NFKC_CasefoldedNo (N),
Yes (Y)
Full_Composition_ExclusionNo (N),
Yes (Y)
UnicodeisNFCNo,
Yes
isNFDNo,
Yes
isNFKCNo,
Yes
isNFKDNo,
Yes
EnumeratedUCDCanonical_Combining_ClassShow Values
Decomposition_TypeShow Values
NFC_Quick_CheckMaybe (M),
No (N),
Yes (Y)
NFD_Quick_CheckNo (N),
Yes (Y)
NFKC_Quick_CheckMaybe (M),
No (N),
Yes (Y)
NFKD_Quick_CheckNo (N),
Yes (Y)
StringICUtoNFMShow Values
UCDNFKC_CasefoldShow Values
UnicodetoNFCShow Values
toNFDShow Values
toNFKCShow Values
toNFKDShow Values
NumericBinaryUCDASCII_Hex_DigitNo (N),
Yes (Y)
Hex_DigitNo (N),
Yes (Y)
EnumeratedNumeric_TypeDecimal (De), Digit (Di),
None (None), Numeric (Nu)
kAccountingNumericShow Values
kOtherNumericShow Values
kPrimaryNumericShow Values
NumericNumeric_ValueShow Values
RegexBinaryUTSANYNo,
Yes
ASCIINo,
Yes
bmpNo,
Yes
SecurityEnumeratedUTSConfusable_MAShow Values
Identifier_StatusAllowed (a),
Restricted (r)
Identifier_TypeShow Values
Shaping and RenderingBinaryUCDJoin_ControlNo (N),
Yes (Y)
EnumeratedEast_Asian_WidthAmbiguous (A),
Fullwidth (F),
Halfwidth (H),
Narrow (Na), Neutral (N),
Wide (W)
Grapheme_Cluster_BreakShow Values
Joining_GroupShow Values
Joining_TypeDual_Joining (D),
Join_Causing (C),
Left_Joining (L),
Non_Joining (U),
Right_Joining (R),
Transparent (T)
Line_BreakShow Values
Prepended_Concatenation_MarkNo (N),
Yes (Y)
Sentence_BreakShow Values
Standardized_VariantShow Values
Vertical_OrientationRotated (R),
Transformed_Rotated (Tr), Transformed_Upright (Tu),
Upright (U)
Word_BreakShow Values
UCABinaryUTSucaShow Values
uca2Show Values
uca2.5Show Values
uca3Show Values
Z-OtherOtherOtherComposition_ExclusionOther
Confusable_MLOther
Confusable_SAOther
Confusable_SLOther
Decomposition_MappingOther
Do_Not_Emit_PreferredOther
Do_Not_Emit_TypeOther
Emoji_DCMOther
Emoji_KDDIOther
Emoji_SBOther
exemplarOther
exemplar_auxOther
exemplar_punctOther
Expands_On_NFCOther
Expands_On_NFDOther
Expands_On_NFKCOther
Expands_On_NFKDOther
FC_NFKC_ClosureOther
ID_Compat_Math_ContinueOther
ID_Compat_Math_StartOther
IDS_Unary_OperatorOther
Indic_Conjunct_BreakOther
Jamo_Short_NameOther
kAlternateTotalStrokesOther
kBigFiveOther
kCangjieOther
kCantoneseOther
kCCCIIOther
kCheungBauerOther
kCheungBauerIndexOther
kCihaiTOther
kCNS1986Other
kCNS1992Other
kCompatibilityVariantOther
kCowlesOther
kDaeJaweonOther
kDefinitionOther
kEACCOther
kEH_CatOther
kEH_CoreOther
kEH_DescOther
kEH_FuncOther
kEH_FValOther
kEH_HGOther
kEH_IFAOOther
kEH_JSeshOther
kEH_NoMirrorOther
kEH_NoRotateOther
kEH_UniKOther
kFanqieOther
kFennOther
kFennIndexOther
kFourCornerCodeOther
kFrequencyOther
kGB0Other
kGB1Other
kGB3Other
kGB5Other
kGB7Other
kGB8Other
kGradeLevelOther
kGSROther
kHangulOther
kHanYuOther
kHanyuPinluOther
kHanyuPinyinOther
kHDZRadBreakOther
kHKGlyphOther
kHKSCSOther
kIBMJapanOther
kIICoreOther
kIRG_GSourceOther
kIRG_HSourceOther
kIRG_JSourceOther
kIRG_KPSourceOther
kIRG_KSourceOther
kIRG_MSourceOther
kIRG_SSourceOther
kIRG_TSourceOther
kIRG_UKSourceOther
kIRG_USourceOther
kIRG_VSourceOther
kIRGDaeJaweonOther
kIRGDaiKanwaZitenOther
kIRGHanyuDaZidianOther
kIRGKangXiOther
kJaOther
kJapaneseOther
kJapaneseKunOther
kJapaneseOnOther
kJinmeiyoKanjiOther
kJis0Other
kJIS0213Other
kJis1Other
kJoyoKanjiOther
kKangXiOther
kKarlgrenOther
kKoreanOther
kKoreanEducationHanjaOther
kKoreanNameOther
kKPS0Other
kKPS1Other
kKSC0Other
kKSC1Other
kLauOther
kMainlandTelegraphOther
kMandarinOther
kMatthewsOther
kMeyerWempeOther
kMojiJohoOther
kMorohashiOther
kNelsonOther
kPhoneticOther
kPseudoGB1Other
kReadingOther
kRSAdobe_Japan1_6Other
kRSJapaneseOther
kRSKangXiOther
kRSKanWaOther
kRSKoreanOther
kRSTUnicodeOther
kRSUnicodeOther
kSBGYOther
kSemanticVariantOther
kSMSZD2003IndexOther
kSMSZD2003ReadingsOther
kSpecializedSemanticVariantOther
kSpoofingVariantOther
kSrc_NushuDubenOther
kStrangeOther
kTaiwanTelegraphOther
kTangOther
kTGHOther
kTGHZ2013Other
kTGT_MergedSrcOther
kTotalStrokesOther
kUnihanCore2020Other
kVietnameseOther
kVietnameseNumericOther
kXeroxOther
kXHC1983Other
kZhuangNumericOther
kZVariantOther
Modifier_Combining_MarkOther
NFKC_Simple_CasefoldOther
Other_AlphabeticOther
Other_Default_Ignorable_Code_PointOther
Other_Grapheme_ExtendOther
Other_ID_ContinueOther
Other_ID_StartOther
Other_Joining_TypeOther
Other_LowercaseOther
Other_MathOther
Other_UppercaseOther

Key

The Categories are from UCD Table 8. Property Summary Table, with some extended categories: Emoji, IDNA, Regex, Security, and UCA.

The Datatypes are from UCD Table 5. Property Type Key.

The Sources are:


Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.9; ICU version: 74.1; Unicode/Emoji version: 15.1.0;