Unicode Character "夆" U+2F85C CJK Compatibility Ideograph-#
Unicode Version 15.1
夆
Summary
The unicode character "夆" at code point U+2F85C is CJK Compatibility Ideograph-#. It is a character in the CJK Compatibility Ideographs Supplement block and is part of the Han script. The character is an other letter. The UTF-8 encoding of "夆" is 0xF0 0xAF 0xA1 0x9C and the UTF-16 encoding is 0xD87E 0xDC5C.
General Properties
Code Point | U+2F85C |
Version Added | 3.1 |
Name | CJK Compatibility Ideograph-# |
Block | CJK Compatibility Ideographs Supplement |
General Category | Other Letter |
Canonical Combining Class | Not Reordered |
Bidirectional Class | Left To Right |
Decomposition Type | Canonical |
Decomposition Mapping | "夆" U+5906 CJK Unified Ideograph-# |
Encodings
HTML Decimal Encoding | 夆 |
HTML Hex Encoding | 夆 |
UTF-8 Encoding | 0xF0 0xAF 0xA1 0x9C |
UTF-16 Encoding | 0xD87E 0xDC5C |
UTF-32 Encoding | 0x0002F85C |
C/C++/Java Escape | \ud87e\udc5c |
Unicode Properties
Full Composition Exclusion | Yes |
Numeric Type | None |
Numeric Value | NaN |
Line Break | Ideographic |
East Asian Width | Wide |
Changes When NFKC Casefolded | Yes |
NFKC Casefold | "夆" U+5906 CJK Unified Ideograph-# |
NFKC Simple Casefold | "夆" U+5906 CJK Unified Ideograph-# |
Script | Han |
Script Extensions | Han |
Indic Syllabic Category | Other |
ID Start | Yes |
XID Start | Yes |
ID Continue | Yes |
XID Continue | Yes |
Alphabetic | Yes |
Vertical Orientation | Upright |
Grapheme Base | Yes |
Grapheme Cluster Break | Other |
Word Break | Other |
Sentence Break | OLetter |
Ideographic | Yes |
Unihan Properties
kCompatibilityVariant | U+5906 |
kIRG_TSource | T5-2362 |
kRSUnicode | 34.4 |
kTotalStrokes | 7 |