Unicode Utilities: Confusables

Unmarked properties are from Unicode V15.1.0; the beta properties are from Unicode V16.0.0β. For more information, see Unicode Utilities Beta.

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid

Input With this demo, you can supply an Input string and see the combinations that are confusable with it, using data collected by the Unicode consortium. You can also try different restrictions, using characters valid in different approaches to international domain names. For more info, see Data below.
  

Confusable Characters

ȡ                                                                            
0221
LATIN SMALL LETTER D WITH CURL
o ο σ о օ ס ه ٥ ھ ہ ە ۵ 𐐬 𐓪 𑣈 𑣗 𝐨 𝑜 𝒐 𝓸 𝔬 𝕠 𝖔 𝗈 𝗼 𝘰 𝙤 𝚘 𝛐 𝛔 𝜊 𝜎 𝝄 𝝈 𝝾 𝞂 𝞸 𝞼 𞸤 𞹤 𞺄
006F03BF03C3043E058505E10647066506BE06C106D506F509660A660AE60BE60C020C660C820CE60D020D200D660D820E500ED0101D104010FF1D0F1D1121342C9FAB3D1042C104EA118C8118D71D4281D45C1D4901D4F81D52C1D5601D5941D5C81D5FC1D6301D6641D6981D6D01D6D41D70A1D70E1D7441D7481D77E1D7821D7B81D7BC1EE241EE641EE84FBA6FBA7FBA8FBA9FBAAFBABFBACFBADFEE9FEEAFEEBFEECFF4F
LATIN SMALL LETTER OGREEK SMALL LETTER OMICRONGREEK SMALL LETTER SIGMACYRILLIC SMALL LETTER OARMENIAN SMALL LETTER OHHEBREW LETTER SAMEKHARABIC LETTER HEHARABIC-INDIC DIGIT FIVEARABIC LETTER HEH DOACHASHMEEARABIC LETTER HEH GOALARABIC LETTER AEEXTENDED ARABIC-INDIC DIGIT FIVEDEVANAGARI DIGIT ZEROGURMUKHI DIGIT ZEROGUJARATI DIGIT ZEROTAMIL DIGIT ZEROTELUGU SIGN ANUSVARATELUGU DIGIT ZEROKANNADA SIGN ANUSVARAKANNADA DIGIT ZEROMALAYALAM SIGN ANUSVARAMALAYALAM LETTER TTHAMALAYALAM DIGIT ZEROSINHALA SIGN ANUSVARAYATHAI DIGIT ZEROLAO DIGIT ZEROMYANMAR LETTER WAMYANMAR DIGIT ZEROGEORGIAN LETTER LABIAL SIGNLATIN LETTER SMALL CAPITAL OLATIN SMALL LETTER SIDEWAYS OSCRIPT SMALL OCOPTIC SMALL LETTER OLATIN SMALL LETTER BLACKLETTER ODESERET SMALL LETTER LONG OOSAGE SMALL LETTER OWARANG CITI SMALL LETTER EWARANG CITI SMALL LETTER BUMATHEMATICAL BOLD SMALL OMATHEMATICAL ITALIC SMALL OMATHEMATICAL BOLD ITALIC SMALL OMATHEMATICAL BOLD SCRIPT SMALL OMATHEMATICAL FRAKTUR SMALL OMATHEMATICAL DOUBLE-STRUCK SMALL OMATHEMATICAL BOLD FRAKTUR SMALL OMATHEMATICAL SANS-SERIF SMALL OMATHEMATICAL SANS-SERIF BOLD SMALL OMATHEMATICAL SANS-SERIF ITALIC SMALL OMATHEMATICAL SANS-SERIF BOLD ITALIC SMALL OMATHEMATICAL MONOSPACE SMALL OMATHEMATICAL BOLD SMALL OMICRONMATHEMATICAL BOLD SMALL SIGMAMATHEMATICAL ITALIC SMALL OMICRONMATHEMATICAL ITALIC SMALL SIGMAMATHEMATICAL BOLD ITALIC SMALL OMICRONMATHEMATICAL BOLD ITALIC SMALL SIGMAMATHEMATICAL SANS-SERIF BOLD SMALL OMICRONMATHEMATICAL SANS-SERIF BOLD SMALL SIGMAMATHEMATICAL SANS-SERIF BOLD ITALIC SMALL OMICRONMATHEMATICAL SANS-SERIF BOLD ITALIC SMALL SIGMAARABIC MATHEMATICAL INITIAL HEHARABIC MATHEMATICAL STRETCHED HEHARABIC MATHEMATICAL LOOPED HEHARABIC LETTER HEH GOAL ISOLATED FORMARABIC LETTER HEH GOAL FINAL FORMARABIC LETTER HEH GOAL INITIAL FORMARABIC LETTER HEH GOAL MEDIAL FORMARABIC LETTER HEH DOACHASHMEE ISOLATED FORMARABIC LETTER HEH DOACHASHMEE FINAL FORMARABIC LETTER HEH DOACHASHMEE INITIAL FORMARABIC LETTER HEH DOACHASHMEE MEDIAL FORMARABIC LETTER HEH ISOLATED FORMARABIC LETTER HEH FINAL FORMARABIC LETTER HEH INITIAL FORMARABIC LETTER HEH MEDIAL FORMFULLWIDTH LATIN SMALL LETTER O
g ƍ ɡ ց 𝐠 𝑔 𝒈 𝓰 𝔤 𝕘 𝖌 𝗀 𝗴 𝘨 𝙜 𝚐                                                          
0067018D026105811D83210A1D4201D4541D4881D4F01D5241D5581D58C1D5C01D5F41D6281D65C1D690FF47
LATIN SMALL LETTER GLATIN SMALL LETTER TURNED DELTALATIN SMALL LETTER SCRIPT GARMENIAN SMALL LETTER COLATIN SMALL LETTER G WITH PALATAL HOOKSCRIPT SMALL GMATHEMATICAL BOLD SMALL GMATHEMATICAL ITALIC SMALL GMATHEMATICAL BOLD ITALIC SMALL GMATHEMATICAL BOLD SCRIPT SMALL GMATHEMATICAL FRAKTUR SMALL GMATHEMATICAL DOUBLE-STRUCK SMALL GMATHEMATICAL BOLD FRAKTUR SMALL GMATHEMATICAL SANS-SERIF SMALL GMATHEMATICAL SANS-SERIF BOLD SMALL GMATHEMATICAL SANS-SERIF ITALIC SMALL GMATHEMATICAL SANS-SERIF BOLD ITALIC SMALL GMATHEMATICAL MONOSPACE SMALL GFULLWIDTH LATIN SMALL LETTER G
. ٠ ۰ ܁ ܂ 𐩐 𝅭                                                                   
002E066006F0070107022024A4F8A60E10A501D16D
FULL STOPARABIC-INDIC DIGIT ZEROEXTENDED ARABIC-INDIC DIGIT ZEROSYRIAC SUPRALINEAR FULL STOPSYRIAC SUBLINEAR FULL STOPONE DOT LEADERLISU LETTER TONE MYA TIVAI FULL STOPKHAROSHTHI PUNCTUATION DOTMUSICAL SYMBOL COMBINING AUGMENTATION DOT
d ԁ 𝐝 𝑑 𝒅 𝒹 𝓭 𝔡 𝕕 𝖉 𝖽 𝗱 𝘥 𝙙 𝚍                                                         
0064050113E7146F2146217EA4D21D41D1D4511D4851D4B91D4ED1D5211D5551D5891D5BD1D5F11D6251D6591D68D
LATIN SMALL LETTER DCYRILLIC SMALL LETTER KOMI DECHEROKEE LETTER TSUCANADIAN SYLLABICS KODOUBLE-STRUCK ITALIC SMALL DSMALL ROMAN NUMERAL FIVE HUNDREDLISU LETTER PHAMATHEMATICAL BOLD SMALL DMATHEMATICAL ITALIC SMALL DMATHEMATICAL BOLD ITALIC SMALL DMATHEMATICAL SCRIPT SMALL DMATHEMATICAL BOLD SCRIPT SMALL DMATHEMATICAL FRAKTUR SMALL DMATHEMATICAL DOUBLE-STRUCK SMALL DMATHEMATICAL BOLD FRAKTUR SMALL DMATHEMATICAL SANS-SERIF SMALL DMATHEMATICAL SANS-SERIF BOLD SMALL DMATHEMATICAL SANS-SERIF ITALIC SMALL DMATHEMATICAL SANS-SERIF BOLD ITALIC SMALL DMATHEMATICAL MONOSPACE SMALL D
e е ҽ 𝐞 𝑒 𝒆 𝓮 𝔢 𝕖 𝖊 𝖾 𝗲 𝘦 𝙚 𝚎                                                         
0065043504BD212E212F2147AB321D41E1D4521D4861D4EE1D5221D5561D58A1D5BE1D5F21D6261D65A1D68EFF45
LATIN SMALL LETTER ECYRILLIC SMALL LETTER IECYRILLIC SMALL LETTER ABKHASIAN CHEESTIMATED SYMBOLSCRIPT SMALL EDOUBLE-STRUCK ITALIC SMALL ELATIN SMALL LETTER BLACKLETTER EMATHEMATICAL BOLD SMALL EMATHEMATICAL ITALIC SMALL EMATHEMATICAL BOLD ITALIC SMALL EMATHEMATICAL BOLD SCRIPT SMALL EMATHEMATICAL FRAKTUR SMALL EMATHEMATICAL DOUBLE-STRUCK SMALL EMATHEMATICAL BOLD FRAKTUR SMALL EMATHEMATICAL SANS-SERIF SMALL EMATHEMATICAL SANS-SERIF BOLD SMALL EMATHEMATICAL SANS-SERIF ITALIC SMALL EMATHEMATICAL SANS-SERIF BOLD ITALIC SMALL EMATHEMATICAL MONOSPACE SMALL EFULLWIDTH LATIN SMALL LETTER E

Total raw values: 5,776,000

Too many raw items to process.


Data

Confusable characters are those that may be confused with others (in some common UI fonts), such as the Latin letter "o" and the Greek letter omicron "ο". Fonts make a difference: for example, the Hebrew character "ס" looks confusingly similar to "o" in some fonts (such as Arial Hebrew), but not in others. See also unaccented Latin Characters..

The data for confusables and restrictions is from UTS39. You can suggest additions or changes to the Unicode data for future versions of that standard.

For more information on the use of the data, see proposed updates Unicode Security Mechanisms and Unicode Security Considerations.

The restrictions are purely on a character level. For a more detailed view, see idna.

Caveats

The Unicode data is designed for testing, not enumerating, so not all combinations are generated in this demo; In particular, where a character is confusable with a sequence, not all combinations are generated.



Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.9; ICU version: 74.1; Unicode/Emoji version: 15.1.0; Unicodeβ version: 16.0.0;