Unicode Character "砜" U+781C CJK Unified Ideograph-#

Unicode Version 15.1

Summary

The unicode character "砜" at code point U+781C is a CJK (Chinese Japanese Korean) ideogram meaning "an organic compound". It is a character in the CJK Unified Ideographs block and is part of the Han script. The character is an other letter. The UTF-8 encoding of "砜" is 0xE7 0xA0 0x9C and the UTF-16 encoding is 0x781C.

General Properties

Code Point U+781C
Version Added 1.1
Name CJK Unified Ideograph-#
Block CJK Unified Ideographs
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding 砜
HTML Hex Encoding 砜
UTF-8 Encoding 0xE7 0xA0 0x9C
UTF-16 Encoding 0x781C
UTF-32 Encoding 0x0000781C
C/C++/Java Escape \u781c

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Ideographic
East Asian Width Wide
Script Han
Script Extensions Han
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Upright
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break OLetter
Ideographic Yes
Unified Ideograph Yes

Unihan Properties

kCangjie MRHNK
kCantonese fung1
kDefinition an organic compound
kEACC 706D3F
kFourCornerCode 1761
kGB0 7731
kHanYu 42421.050
kIRGHanyuDaZidian 42421.050
kIRGKangXi 0828.241
kIRG_GSource G0-6D3F
kIRG_HSource H-98F3
kTGH 2013:4170
kKangXi 0828.241
kMainlandTelegraph 4277
kMandarin fēng
kRSUnicode 112.4
kSMSZD2003Index 468.14
kTGHZ2013 098.010:fēng
kTotalStrokes 9
kTraditionalVariant "碸" U+78B8 CJK Unified Ideograph-#
kUnihanCore2020 GH
kXHC1983 0331.040:fēng