Unicode Character "蘁" U+8601 CJK Unified Ideograph-#

Unicode Version 15.1

蘁

Summary

The unicode character "蘁" at code point U+8601 is CJK Unified Ideograph-#. It is a character in the CJK Unified Ideographs block and is part of the Han script. The character is an other letter. The UTF-8 encoding of "蘁" is 0xE8 0x98 0x81 and the UTF-16 encoding is 0x8601.

General Properties

Code Point	U+8601
Version Added	1.1
Name	CJK Unified Ideograph-#
Block	CJK Unified Ideographs
General Category	Other Letter
Canonical Combining Class	Not Reordered
Bidirectional Class	Left To Right

Encodings

HTML Decimal Encoding	蘁
HTML Hex Encoding	蘁
UTF-8 Encoding	0xE8 0x98 0x81
UTF-16 Encoding	0x8601
UTF-32 Encoding	0x00008601
C/C++/Java Escape	\u8601

Unicode Properties

NFC Quick Check	Yes
NFD Quick Check	Yes
NFKC Quick Check	Yes
NFKD Quick Check	Yes
Numeric Type	None
Numeric Value	NaN
Line Break	Ideographic
East Asian Width	Wide
Script	Han
Script Extensions	Han
Indic Syllabic Category	Other
ID Start	Yes
XID Start	Yes
ID Continue	Yes
XID Continue	Yes
Alphabetic	Yes
Vertical Orientation	Upright
Grapheme Base	Yes
Grapheme Cluster Break	Other
Word Break	Other
Sentence Break	OLetter
Ideographic	Yes
Unified Ideograph	Yes

Unihan Properties

kBigFive	F4BA
kCCCII	23275A
kCNS1986	2-6966
kCNS1992	2-6966
kCangjie	TMGR
kCantonese	ng6
kCihaiT	1179.305
kDaeJaweon	1533.160
kFourCornerCode	4410.6
kGB5	7462
kGSR	0768d
kHanYu	53324.060
kHanyuPinyin	53324.060:wù,è
kIRGDaeJaweon	1533.160
kIRGHanyuDaZidian	53324.060
kIRGKangXi	1067.190
kIRG_GSource	G5-6A5E
kIRG_HSource	HB2-F4BA
kIRG_KPSource	KP1-7079
kIRG_KSource	K2-5A5D
kIRG_TSource	T2-6966
kJapanese	ガクゴ
kKangXi	1067.190
kMandarin	wù
kMojiJoho	MJ023139
kMorohashi	32409
kPhonetic	971
kRSUnicode	140.16
kTotalStrokes	19
kUnihanCore2020	HMT