Unicode Character "、" U+3001 Ideographic Comma

Unicode Version 15.1

Summary

The unicode character "、" at code point U+3001 is Ideographic Comma. It is a character in the CJK Symbols and Punctuation block and is part of the Common script. The character is an other punctuation. The UTF-8 encoding of "、" is 0xE3 0x80 0x81 and the UTF-16 encoding is 0x3001.

General Properties

Code Point U+3001
Version Added 1.1
Name Ideographic Comma
Block CJK Symbols and Punctuation
General Category Other Punctuation
Canonical Combining Class Not Reordered
Bidirectional Class Other Neutral

Encodings

HTML Decimal Encoding 、
HTML Hex Encoding 、
UTF-8 Encoding 0xE3 0x80 0x81
UTF-16 Encoding 0x3001
UTF-32 Encoding 0x00003001
C/C++/Java Escape \u3001

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Close Punctuation
East Asian Width Wide
Script Common
Script Extensions Bopomofo Hangul Han Hiragana Katakana Yi
Indic Syllabic Category Other
Pattern Syntax Yes
Terminal Punctuation Yes
Vertical Orientation Transformed Upright
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break SContinue