Unicode Character "倎" U+500E CJK Unified Ideograph-#

Unicode Version 15.1

Summary

The unicode character "倎" at code point U+500E is CJK Unified Ideograph-#. It is a character in the CJK Unified Ideographs block and is part of the Han script. The character is an other letter. The UTF-8 encoding of "倎" is 0xE5 0x80 0x8E and the UTF-16 encoding is 0x500E.

General Properties

Code Point U+500E
Version Added 1.1
Name CJK Unified Ideograph-#
Block CJK Unified Ideographs
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding 倎
HTML Hex Encoding 倎
UTF-8 Encoding 0xE5 0x80 0x8E
UTF-16 Encoding 0x500E
UTF-32 Encoding 0x0000500E
C/C++/Java Escape \u500e

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Ideographic
East Asian Width Wide
Script Han
Script Extensions Han
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Upright
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break OLetter
Ideographic Yes
Unified Ideograph Yes

Unihan Properties

kBigFive D0DD
kCCCII 216663
kCNS1986 2-2D7E
kCNS1992 2-2D7E
kCangjie OTBC
kCantonese tin2
kDaeJaweon 0229.010
kFourCornerCode 2528.1
kGB5 1818
kHanYu 10178.020
kHanyuPinyin 10178.020:tiǎn
kIRGDaeJaweon 0229.010
kIRGHanyuDaZidian 10178.020
kIRGKangXi 0107.210
kIRG_GSource G5-3232
kIRG_HSource HB2-D0DD
kIRG_JSource J13-2E3D
kIRG_KPSource KP1-35CC
kIRG_KSource K2-225C
kIRG_TSource T2-2D7E
kJapanese テン
kJIS0213 1,14,29
kJapaneseKun ATSUI
kJapaneseOn TEN
kJis1 1747
kKangXi 0107.210
kKorean CEN
kMandarin tiǎn
kMojiJoho MJ006828
kMorohashi 00761
kRSAdobe_Japan1_6 C+16783+9.2.8
kRSUnicode 9.8
kTotalStrokes 10
kUnihanCore2020 HMT