Unicode Character "萯" U+842F CJK Unified Ideograph-#

Unicode Version 15.1

Summary

The unicode character "萯" at code point U+842F is CJK Unified Ideograph-#. It is a character in the CJK Unified Ideographs block and is part of the Han script. The character is an other letter. The UTF-8 encoding of "萯" is 0xE8 0x90 0xAF and the UTF-16 encoding is 0x842F.

General Properties

Code Point U+842F
Version Added 1.1
Name CJK Unified Ideograph-#
Block CJK Unified Ideographs
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding 萯
HTML Hex Encoding 萯
UTF-8 Encoding 0xE8 0x90 0xAF
UTF-16 Encoding 0x842F
UTF-32 Encoding 0x0000842F
C/C++/Java Escape \u842f

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Ideographic
East Asian Width Wide
Script Han
Script Extensions Han
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Upright
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break OLetter
Ideographic Yes
Unified Ideograph Yes

Unihan Properties

kBigFive DFCA
kCCCII 23232B
kCNS1986 2-466F
kCNS1992 2-466F
kCangjie TNBC
kCantonese fu6
kCihaiT 1149.306
kFourCornerCode 4480.6
kGB3 7281
kGSR 1000c
kHanYu 53253.070
kHanyuPinyin 53253.070:fù,bèi
kIRGHanyuDaZidian 53253.070
kIRGKangXi 1042.390
kIRG_GSource G3-6871
kIRG_HSource HB2-DFCA
kIRG_JSource J1-584D
kIRG_KPSource KP1-6E8B
kIRG_KSource K2-5843
kIRG_TSource T2-466F
kJapanese フウ フ ハイ ベ ブ バイ
kJapaneseKun KARASUURI
kJapaneseOn FUU FU HAI BAI
kJis1 5645
kKangXi 1042.390
kMandarin
kMojiJoho MJ022260 MJ022260:E0101 MJ022261:E0102
kMorohashi 31343
kRSAdobe_Japan1_6 C+22336+140.3.9
kRSUnicode 140.9
kSBGY 322.38
kSimplifiedVariant "𰰷" U+30C37 CJK Unified Ideograph-#
kTotalStrokes 12
kUnihanCore2020 HMT