Unicode Character "—" U+2014 Em Dash
Unicode Version 15.1
—
Summary
The unicode character "—" at code point U+2014 is Em Dash. It is a character in the General Punctuation block and is part of the Common script. The character is a dash punctuation. The UTF-8 encoding of "—" is 0xE2 0x80 0x94 and the UTF-16 encoding is 0x2014.
General Properties
Code Point | U+2014 |
Version Added | 1.1 |
Name | Em Dash |
Block | General Punctuation |
General Category | Dash Punctuation |
Canonical Combining Class | Not Reordered |
Bidirectional Class | Other Neutral |
Encodings
HTML Entity | — |
HTML Decimal Encoding | — |
HTML Hex Encoding | — |
UTF-8 Encoding | 0xE2 0x80 0x94 |
UTF-16 Encoding | 0x2014 |
UTF-32 Encoding | 0x00002014 |
C/C++/Java Escape | \u2014 |