Unicode Character " " U+2001 Em Quad

Unicode Version 15.1

Summary

The unicode character " " at code point U+2001 is Em Quad. It is a character in the General Punctuation block and is part of the Common script. The character is a space separator. The UTF-8 encoding of " " is 0xE2 0x80 0x81 and the UTF-16 encoding is 0x2001.

General Properties

Code Point U+2001
Version Added 1.1
Name Em Quad
Block General Punctuation
General Category Space Separator
Canonical Combining Class Not Reordered
Bidirectional Class White Space
Decomposition Type Canonical
Decomposition Mapping " " U+2003 Em Space

Encodings

HTML Decimal Encoding  
HTML Hex Encoding  
UTF-8 Encoding 0xE2 0x80 0x81
UTF-16 Encoding 0x2001
UTF-32 Encoding 0x00002001
C/C++/Java Escape \u2001

Unicode Properties

Full Composition Exclusion Yes
Numeric Type None
Numeric Value NaN
Line Break Break After
Changes When NFKC Casefolded Yes
NFKC Casefold "SP" U+0020 Space
NFKC Simple Casefold "SP" U+0020 Space
Script Common
Script Extensions Common
Indic Syllabic Category Other
White Space Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break WSegSpace
Sentence Break Sp