Unicode Character "、" U+3001 Ideographic Comma
Unicode Version 15.1
、
Summary
The unicode character "、" at code point U+3001 is Ideographic Comma. It is a character in the CJK Symbols and Punctuation block and is part of the Common script. The character is an other punctuation. The UTF-8 encoding of "、" is 0xE3 0x80 0x81 and the UTF-16 encoding is 0x3001.
General Properties
Code Point | U+3001 |
Version Added | 1.1 |
Name | Ideographic Comma |
Block | CJK Symbols and Punctuation |
General Category | Other Punctuation |
Canonical Combining Class | Not Reordered |
Bidirectional Class | Other Neutral |
Encodings
HTML Decimal Encoding | 、 |
HTML Hex Encoding | 、 |
UTF-8 Encoding | 0xE3 0x80 0x81 |
UTF-16 Encoding | 0x3001 |
UTF-32 Encoding | 0x00003001 |
C/C++/Java Escape | \u3001 |
Unicode Properties
NFC Quick Check | Yes |
NFD Quick Check | Yes |
NFKC Quick Check | Yes |
NFKD Quick Check | Yes |
Numeric Type | None |
Numeric Value | NaN |
Line Break | Close Punctuation |
East Asian Width | Wide |
Script | Common |
Script Extensions | Bopomofo Hangul Han Hiragana Katakana Yi |
Indic Syllabic Category | Other |
Pattern Syntax | Yes |
Terminal Punctuation | Yes |
Vertical Orientation | Transformed Upright |
Grapheme Base | Yes |
Grapheme Cluster Break | Other |
Word Break | Other |
Sentence Break | SContinue |