Unicode Character "‫" U+202B Right-to-Left Embedding

Unicode Version 15.1

Summary

The unicode character "‫" at code point U+202B is Right-to-Left Embedding. It is a character in the General Punctuation block and is part of the Common script. The character is a format. The UTF-8 encoding of "‫" is 0xE2 0x80 0xAB and the UTF-16 encoding is 0x202B.

General Properties

Code Point U+202B
Version Added 1.1
Name Right-to-Left Embedding
Block General Punctuation
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Right To Left Embedding
Bidirectional Control Yes
Alias RLE (abbreviation)

Encodings

HTML Decimal Encoding ‫
HTML Hex Encoding ‫
UTF-8 Encoding 0xE2 0x80 0xAB
UTF-16 Encoding 0x202B
UTF-32 Encoding 0x0000202B
C/C++/Java Escape \u202b

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Common
Script Extensions Common
Indic Syllabic Category Other
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Cluster Break Control
Word Break Format
Sentence Break Format