Unicode Character "䰄" U+4C04 CJK Unified Ideograph-#

Unicode Version 15.1

Summary

The unicode character "䰄" at code point U+4C04 is a CJK (Chinese Japanese Korean) ideogram meaning "short hair, bearded, with lots of beard, or whiskers". It is a character in the CJK Unified Ideographs Extension A block and is part of the Han script. The character is an other letter. The UTF-8 encoding of "䰄" is 0xE4 0xB0 0x84 and the UTF-16 encoding is 0x4C04.

General Properties

Code Point U+4C04
Version Added 3.0
Name CJK Unified Ideograph-#
Block CJK Unified Ideographs Extension A
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding 䰄
HTML Hex Encoding 䰄
UTF-8 Encoding 0xE4 0xB0 0x84
UTF-16 Encoding 0x4C04
UTF-32 Encoding 0x00004C04
C/C++/Java Escape \u4c04

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Ideographic
East Asian Width Wide
Script Han
Script Extensions Han
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Upright
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break OLetter
Ideographic Yes
Unified Ideograph Yes

Unihan Properties

kCangjie SHWP
kCantonese soi1
kCihaiT 1519.210
kDefinition short hair, bearded; with lots of beard, whiskers
kHanYu 74530.120
kHanyuPinyin 74530.120:sāi,shì
kIRGHanyuDaZidian 74530.120
kIRGKangXi 1455.200
kIRG_GSource G3-7B55
kIRG_HSource H-A06F
kIRG_KPSource KP1-8B98
kIRG_TSource T4-624E
kJapanese サイ シ
kKangXi 1455.200
kMandarin sāi
kMojiJoho MJ005882
kMorohashi 45501
kPhonetic 1174*
kRSUnicode 190.9
kTotalStrokes 19
kUnihanCore2020 H