Unicode Character "硿" U+787F CJK Unified Ideograph-#

Unicode Version 15.1

Summary

The unicode character "硿" at code point U+787F is CJK Unified Ideograph-#. It is a character in the CJK Unified Ideographs block and is part of the Han script. The character is an other letter. The UTF-8 encoding of "硿" is 0xE7 0xA1 0xBF and the UTF-16 encoding is 0x787F.

General Properties

Code Point U+787F
Version Added 1.1
Name CJK Unified Ideograph-#
Block CJK Unified Ideographs
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding 硿
HTML Hex Encoding 硿
UTF-8 Encoding 0xE7 0xA1 0xBF
UTF-16 Encoding 0x787F
UTF-32 Encoding 0x0000787F
C/C++/Java Escape \u787f

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Ideographic
East Asian Width Wide
Script Han
Script Extensions Han
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Upright
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break OLetter
Ideographic Yes
Unified Ideograph Yes

Unihan Properties

kBigFive B851
kCCCII 22646C
kCNS1986 1-6558
kCNS1992 1-6558
kCangjie MRJCM
kCantonese hung1
kCihaiT 969.206
kDaeJaweon 1248.030
kEACC 22646C
kFourCornerCode 1361.1
kGB3 5627
kHanYu 42440.080
kHanyuPinyin 42440.080:kōng
kIICore AT
kIRGDaeJaweon 1248.030
kIRGHanyuDaZidian 42440.080
kIRGKangXi 0831.280
kIRG_GSource G3-583B
kIRG_HSource HB1-B851
kIRG_KPSource KP1-6020
kIRG_KSource K2-4C71
kIRG_TSource T1-6558
kJapanese コウ ク
kTGH 2013:7509
kKangXi 0831.280
kMandarin kōng
kMojiJoho MJ018541
kMorohashi 24259
kRSUnicode 112.8
kSBGY 027.29
kTGHZ2013 198.080:kòng
kTotalStrokes 13
kUnihanCore2020 GHMT