Unicode Character "‑" U+2011 Non-Breaking Hyphen

Unicode Version 15.1

Summary

The unicode character "‑" at code point U+2011 is Non-Breaking Hyphen. It is a character in the General Punctuation block and is part of the Common script. The character is a dash punctuation. The UTF-8 encoding of "‑" is 0xE2 0x80 0x91 and the UTF-16 encoding is 0x2011.

General Properties

Code Point U+2011
Version Added 1.1
Name Non-Breaking Hyphen
Block General Punctuation
General Category Dash Punctuation
Canonical Combining Class Not Reordered
Bidirectional Class Other Neutral
Decomposition Type Nobreak
Decomposition Mapping "‐" U+2010 Hyphen

Encodings

HTML Decimal Encoding ‑
HTML Hex Encoding ‑
UTF-8 Encoding 0xE2 0x80 0x91
UTF-16 Encoding 0x2011
UTF-32 Encoding 0x00002011
C/C++/Java Escape \u2011

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Non-breaking (“Glue”)
Changes When NFKC Casefolded Yes
NFKC Casefold "‐" U+2010 Hyphen
NFKC Simple Casefold "‐" U+2010 Hyphen
Script Common
Script Extensions Common
Indic Syllabic Category Consonant Placeholder
Pattern Syntax Yes
Dash Yes
Hyphen Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break Other