Unicode Utilities: UnicodeSet

Unmarked properties are from Unicode V15.1.0; the beta properties are from Unicode V16.0.0β. For more information, see Unicode Utilities Beta.

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid

Input
              

128 Code Points


[\u0000-\u0008\u000E-\u001F\u007F \u0009-\u000D \u0020 _ \- , ; \: ! ? . ' " ( ) \[ \] \{ \} @ * / \\ \& # % ` \^ + <-> | ~ \$ 0-9 aA bB cC dD eE fF gG hH iI jJ kK lL mM nN oO pP qQ rR sS tT uU vV wW xX yY zZ]


Basic LatinC0 controls
items: 32

  U+0000NUL; NULL
  U+0001SOH; START OF HEADING
  U+0002START OF TEXT; STX
  U+0003END OF TEXT; ETX
  U+0004END OF TRANSMISSION; EOT
  U+0005ENQ; ENQUIRY
  U+0006ACK; ACKNOWLEDGE
  U+0007ALERT; BEL
  U+0008BACKSPACE; BS
   U+0009CHARACTER TABULATION; HORIZONTAL TABULATION; HT; TAB
   U+000AEND OF LINE; EOL; LF; LINE FEED; NEW LINE; NL
   U+000BLINE TABULATION; VERTICAL TABULATION; VT
   U+000CFF; FORM FEED
   U+000DCARRIAGE RETURN; CR
  U+000ELOCKING-SHIFT ONE; SHIFT OUT; SO
  U+000FLOCKING-SHIFT ZERO; SHIFT IN; SI
  U+0010DATA LINK ESCAPE; DLE
  U+0011DC1; DEVICE CONTROL ONE
  U+0012DC2; DEVICE CONTROL TWO
  U+0013DC3; DEVICE CONTROL THREE
  U+0014DC4; DEVICE CONTROL FOUR
  U+0015NAK; NEGATIVE ACKNOWLEDGE
  U+0016SYN; SYNCHRONOUS IDLE
  U+0017END OF TRANSMISSION BLOCK; ETB
  U+0018CAN; CANCEL
  U+0019EM; END OF MEDIUM; EOM
  U+001ASUB; SUBSTITUTE
  U+001BESC; ESCAPE
  U+001CFILE SEPARATOR; FS; INFORMATION SEPARATOR FOUR
  U+001DGROUP SEPARATOR; GS; INFORMATION SEPARATOR THREE
  U+001EINFORMATION SEPARATOR TWO; RECORD SEPARATOR; RS
  U+001FINFORMATION SEPARATOR ONE; UNIT SEPARATOR; US

Basic LatinASCII punctuation and symbols
items: 21

   U+0020SPACE
 ! U+0021EXCLAMATION MARK
 " U+0022QUOTATION MARK
 # U+0023NUMBER SIGN
 $ U+0024DOLLAR SIGN
 % U+0025PERCENT SIGN
 & U+0026AMPERSAND
 ' U+0027APOSTROPHE
 ( U+0028LEFT PARENTHESIS
 ) U+0029RIGHT PARENTHESIS
 * U+002AASTERISK
 [ U+005BLEFT SQUARE BRACKET
 \ U+005CREVERSE SOLIDUS
 ] U+005DRIGHT SQUARE BRACKET
 ^ U+005ECIRCUMFLEX ACCENT
 _ U+005FLOW LINE
 ` U+0060GRAVE ACCENT
 { U+007BLEFT CURLY BRACKET
 | U+007CVERTICAL LINE
 } U+007DRIGHT CURLY BRACKET
 ~ U+007ETILDE

Basic LatinASCII math operator
items: 1

 + U+002BPLUS SIGN

Basic LatinASCII punctuation
items: 8

 , U+002CCOMMA
 - U+002DHYPHEN-MINUS
 . U+002EFULL STOP
 / U+002FSOLIDUS
 : U+003ACOLON
 ; U+003BSEMICOLON
 ? U+003FQUESTION MARK
 @ U+0040COMMERCIAL AT

Basic LatinASCII digits
items: 10

 0 U+0030DIGIT ZERO
 1 U+0031DIGIT ONE
 2 U+0032DIGIT TWO
 3 U+0033DIGIT THREE
 4 U+0034DIGIT FOUR
 5 U+0035DIGIT FIVE
 6 U+0036DIGIT SIX
 7 U+0037DIGIT SEVEN
 8 U+0038DIGIT EIGHT
 9 U+0039DIGIT NINE

Basic LatinASCII mathematical operators
items: 3

 < U+003CLESS-THAN SIGN
 = U+003DEQUALS SIGN
 > U+003EGREATER-THAN SIGN

Basic LatinUppercase Latin alphabet
items: 26

 A U+0041LATIN CAPITAL LETTER A
 B U+0042LATIN CAPITAL LETTER B
 C U+0043LATIN CAPITAL LETTER C
 D U+0044LATIN CAPITAL LETTER D
 E U+0045LATIN CAPITAL LETTER E
 F U+0046LATIN CAPITAL LETTER F
 G U+0047LATIN CAPITAL LETTER G
 H U+0048LATIN CAPITAL LETTER H
 I U+0049LATIN CAPITAL LETTER I
 J U+004ALATIN CAPITAL LETTER J
 K U+004BLATIN CAPITAL LETTER K
 L U+004CLATIN CAPITAL LETTER L
 M U+004DLATIN CAPITAL LETTER M
 N U+004ELATIN CAPITAL LETTER N
 O U+004FLATIN CAPITAL LETTER O
 P U+0050LATIN CAPITAL LETTER P
 Q U+0051LATIN CAPITAL LETTER Q
 R U+0052LATIN CAPITAL LETTER R
 S U+0053LATIN CAPITAL LETTER S
 T U+0054LATIN CAPITAL LETTER T
 U U+0055LATIN CAPITAL LETTER U
 V U+0056LATIN CAPITAL LETTER V
 W U+0057LATIN CAPITAL LETTER W
 X U+0058LATIN CAPITAL LETTER X
 Y U+0059LATIN CAPITAL LETTER Y
 Z U+005ALATIN CAPITAL LETTER Z

Basic LatinLowercase Latin alphabet
items: 26

 a U+0061LATIN SMALL LETTER A
 b U+0062LATIN SMALL LETTER B
 c U+0063LATIN SMALL LETTER C
 d U+0064LATIN SMALL LETTER D
 e U+0065LATIN SMALL LETTER E
 f U+0066LATIN SMALL LETTER F
 g U+0067LATIN SMALL LETTER G
 h U+0068LATIN SMALL LETTER H
 i U+0069LATIN SMALL LETTER I
 j U+006ALATIN SMALL LETTER J
 k U+006BLATIN SMALL LETTER K
 l U+006CLATIN SMALL LETTER L
 m U+006DLATIN SMALL LETTER M
 n U+006ELATIN SMALL LETTER N
 o U+006FLATIN SMALL LETTER O
 p U+0070LATIN SMALL LETTER P
 q U+0071LATIN SMALL LETTER Q
 r U+0072LATIN SMALL LETTER R
 s U+0073LATIN SMALL LETTER S
 t U+0074LATIN SMALL LETTER T
 u U+0075LATIN SMALL LETTER U
 v U+0076LATIN SMALL LETTER V
 w U+0077LATIN SMALL LETTER W
 x U+0078LATIN SMALL LETTER X
 y U+0079LATIN SMALL LETTER Y
 z U+007ALATIN SMALL LETTER Z

Basic LatinControl character
items: 1

  U+007FDEL; DELETE

Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.9; ICU version: 74.1; Unicode/Emoji version: 15.1.0; Unicodeβ version: 16.0.0;