Unicode Character "墬" U+2F858 CJK Compatibility Ideograph-#
Unicode Version 15.1
墬
Summary
The unicode character "墬" at code point U+2F858 is CJK Compatibility Ideograph-#. It is a character in the CJK Compatibility Ideographs Supplement block and is part of the Han script. The character is an other letter. The UTF-8 encoding of "墬" is 0xF0 0xAF 0xA1 0x98 and the UTF-16 encoding is 0xD87E 0xDC58.
General Properties
Code Point | U+2F858 |
Version Added | 3.1 |
Name | CJK Compatibility Ideograph-# |
Block | CJK Compatibility Ideographs Supplement |
General Category | Other Letter |
Canonical Combining Class | Not Reordered |
Bidirectional Class | Left To Right |
Decomposition Type | Canonical |
Decomposition Mapping | "墬" U+58AC CJK Unified Ideograph-# |
Encodings
HTML Decimal Encoding | 墬 |
HTML Hex Encoding | 墬 |
UTF-8 Encoding | 0xF0 0xAF 0xA1 0x98 |
UTF-16 Encoding | 0xD87E 0xDC58 |
UTF-32 Encoding | 0x0002F858 |
C/C++/Java Escape | \ud87e\udc58 |
Unicode Properties
Full Composition Exclusion | Yes |
Numeric Type | None |
Numeric Value | NaN |
Line Break | Ideographic |
East Asian Width | Wide |
Changes When NFKC Casefolded | Yes |
NFKC Casefold | "墬" U+58AC CJK Unified Ideograph-# |
NFKC Simple Casefold | "墬" U+58AC CJK Unified Ideograph-# |
Script | Han |
Script Extensions | Han |
Indic Syllabic Category | Other |
ID Start | Yes |
XID Start | Yes |
ID Continue | Yes |
XID Continue | Yes |
Alphabetic | Yes |
Vertical Orientation | Upright |
Grapheme Base | Yes |
Grapheme Cluster Break | Other |
Word Break | Other |
Sentence Break | OLetter |
Ideographic | Yes |
Unihan Properties
kCompatibilityVariant | U+58AC |
kIRG_TSource | T7-2176 |
kRSUnicode | 32.12 |
kTotalStrokes | 15 |