Unicode Character "‌" U+200C Zero Width Non-Joiner

Unicode Version 15.1

Summary

The unicode character "‌" at code point U+200C is Zero Width Non-Joiner. It is a character in the General Punctuation block and is part of the Inherited script. The character is a format. The UTF-8 encoding of "‌" is 0xE2 0x80 0x8C and the UTF-16 encoding is 0x200C.

General Properties

Code Point U+200C
Version Added 1.1
Name Zero Width Non-Joiner
Block General Punctuation
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral
Alias ZWNJ (abbreviation)

Encodings

HTML Entity ‌
HTML Decimal Encoding ‌
HTML Hex Encoding ‌
UTF-8 Encoding 0xE2 0x80 0x8C
UTF-16 Encoding 0x200C
UTF-32 Encoding 0x0000200C
C/C++/Java Escape \u200c

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Join Control Yes
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Inherited
Script Extensions Inherited
Indic Syllabic Category Non Joiner
ID Continue Yes
Other ID Continue Yes
XID Continue Yes
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Other Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend