Unicode Character " " U+2001 Em Quad
Unicode Version 15.1
Summary
The unicode character " " at code point U+2001 is Em Quad. It is a character in the General Punctuation block and is part of the Common script. The character is a space separator. The UTF-8 encoding of " " is 0xE2 0x80 0x81 and the UTF-16 encoding is 0x2001.
General Properties
Code Point | U+2001 |
Version Added | 1.1 |
Name | Em Quad |
Block | General Punctuation |
General Category | Space Separator |
Canonical Combining Class | Not Reordered |
Bidirectional Class | White Space |
Decomposition Type | Canonical |
Decomposition Mapping | " " U+2003 Em Space |
Encodings
HTML Decimal Encoding |   |
HTML Hex Encoding |   |
UTF-8 Encoding | 0xE2 0x80 0x81 |
UTF-16 Encoding | 0x2001 |
UTF-32 Encoding | 0x00002001 |
C/C++/Java Escape | \u2001 |
Unicode Properties
Full Composition Exclusion | Yes |
Numeric Type | None |
Numeric Value | NaN |
Line Break | Break After |
Changes When NFKC Casefolded | Yes |
NFKC Casefold | "SP" U+0020 Space |
NFKC Simple Casefold | "SP" U+0020 Space |
Script | Common |
Script Extensions | Common |
Indic Syllabic Category | Other |
White Space | Yes |
Vertical Orientation | Rotated |
Grapheme Base | Yes |
Grapheme Cluster Break | Other |
Word Break | WSegSpace |
Sentence Break | Sp |