Unicode Character "𐄁" U+10101 Aegean Word Separator Dot

Unicode Version 15.1

𐄁

Summary

The unicode character "𐄁" at code point U+10101 is Aegean Word Separator Dot. It is a character in the Aegean Numbers block and is part of the Common script. The character is an other punctuation. The UTF-8 encoding of "𐄁" is 0xF0 0x90 0x84 0x81 and the UTF-16 encoding is 0xD800 0xDD01.

General Properties

Code Point	U+10101
Version Added	4.0
Name	Aegean Word Separator Dot
Block	Aegean Numbers
General Category	Other Punctuation
Canonical Combining Class	Not Reordered
Bidirectional Class	Other Neutral

Encodings

HTML Decimal Encoding	𐄁
HTML Hex Encoding	𐄁
UTF-8 Encoding	0xF0 0x90 0x84 0x81
UTF-16 Encoding	0xD800 0xDD01
UTF-32 Encoding	0x00010101
C/C++/Java Escape	\ud800\udd01

Unicode Properties

NFC Quick Check	Yes
NFD Quick Check	Yes
NFKC Quick Check	Yes
NFKD Quick Check	Yes
Numeric Type	None
Numeric Value	NaN
Line Break	Break After
Script	Common
Script Extensions	Cypro Minoan Cypriot Linear B
Indic Syllabic Category	Other
Vertical Orientation	Rotated
Grapheme Base	Yes
Grapheme Cluster Break	Other
Word Break	Other
Sentence Break	Other