Unicode Character "䄇" U+4107 CJK Unified Ideograph-#

Unicode Version 15.1

Summary

The unicode character "䄇" at code point U+4107 is a CJK (Chinese Japanese Korean) ideogram meaning "(corrupted form) a family name". It is a character in the CJK Unified Ideographs Extension A block and is part of the Han script. The character is an other letter. The UTF-8 encoding of "䄇" is 0xE4 0x84 0x87 and the UTF-16 encoding is 0x4107.

General Properties

Code Point U+4107
Version Added 3.0
Name CJK Unified Ideograph-#
Block CJK Unified Ideographs Extension A
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding 䄇
HTML Hex Encoding 䄇
UTF-8 Encoding 0xE4 0x84 0x87
UTF-16 Encoding 0x4107
UTF-32 Encoding 0x00004107
C/C++/Java Escape \u4107

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Ideographic
East Asian Width Wide
Script Han
Script Extensions Han
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Upright
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break OLetter
Ideographic Yes
Unified Ideograph Yes

Unihan Properties

kCangjie IFRHG
kCantonese cing4
kDefinition (corrupted form) a family name
kHanYu 42399.040
kHanyuPinyin 42399.040:chéng
kIRGHanyuDaZidian 42399.040
kIRGKangXi 0843.100
kIRG_GSource GKX-0843.10
kIRG_JSource JA-2538
kIRG_KSource K3-2D3D
kIRG_TSource T3-3D61
kIRG_VSource V2-8E3F
kJapanese テイ ジョウ
kKangXi 0843.100
kMandarin chéng
kMojiJoho MJ003049 MJ003049:E0100 MJ003050:E0101 MJ003048:E0102
kMorohashi 24707:E0102
kPhonetic 1348*
kRSUnicode 113.7
kTotalStrokes 12