Unicode Character "" U+E0001 Language Tag
Unicode Version 15.1
Summary
The unicode character "" at code point U+E0001 is Language Tag. It is a character in the Tags block and is part of the Common script. The character is a format. The UTF-8 encoding of "" is 0xF3 0xA0 0x80 0x81 and the UTF-16 encoding is 0xDB40 0xDC01.
General Properties
Code Point | U+E0001 |
Version Added | 3.1 |
Name | Language Tag |
Block | Tags |
General Category | Format |
Canonical Combining Class | Not Reordered |
Bidirectional Class | Boundary Neutral |
Encodings
HTML Decimal Encoding | 󠀁 |
HTML Hex Encoding | 󠀁 |
UTF-8 Encoding | 0xF3 0xA0 0x80 0x81 |
UTF-16 Encoding | 0xDB40 0xDC01 |
UTF-32 Encoding | 0x000E0001 |
C/C++/Java Escape | \udb40\udc01 |