Unicode Character "蠁" U+2F9C1 CJK Compatibility Ideograph-#
Unicode Version 15.1
蠁
Summary
The unicode character "蠁" at code point U+2F9C1 is CJK Compatibility Ideograph-#. It is a character in the CJK Compatibility Ideographs Supplement block and is part of the Han script. The character is an other letter. The UTF-8 encoding of "蠁" is 0xF0 0xAF 0xA7 0x81 and the UTF-16 encoding is 0xD87E 0xDDC1.
General Properties
Code Point | U+2F9C1 |
Version Added | 3.1 |
Name | CJK Compatibility Ideograph-# |
Block | CJK Compatibility Ideographs Supplement |
General Category | Other Letter |
Canonical Combining Class | Not Reordered |
Bidirectional Class | Left To Right |
Decomposition Type | Canonical |
Decomposition Mapping | "蠁" U+8801 CJK Unified Ideograph-# |
Encodings
HTML Decimal Encoding | 蠁 |
HTML Hex Encoding | 蠁 |
UTF-8 Encoding | 0xF0 0xAF 0xA7 0x81 |
UTF-16 Encoding | 0xD87E 0xDDC1 |
UTF-32 Encoding | 0x0002F9C1 |
C/C++/Java Escape | \ud87e\uddc1 |
Unicode Properties
Full Composition Exclusion | Yes |
Numeric Type | None |
Numeric Value | NaN |
Line Break | Ideographic |
East Asian Width | Wide |
Changes When NFKC Casefolded | Yes |
NFKC Casefold | "蠁" U+8801 CJK Unified Ideograph-# |
NFKC Simple Casefold | "蠁" U+8801 CJK Unified Ideograph-# |
Script | Han |
Script Extensions | Han |
Indic Syllabic Category | Other |
ID Start | Yes |
XID Start | Yes |
ID Continue | Yes |
XID Continue | Yes |
Alphabetic | Yes |
Vertical Orientation | Upright |
Grapheme Base | Yes |
Grapheme Cluster Break | Other |
Word Break | Other |
Sentence Break | OLetter |
Ideographic | Yes |
Unihan Properties
kCompatibilityVariant | U+8801 |
kIRG_TSource | T3-5B2D |
kRSUnicode | 142.13 |
kTotalStrokes | 19 |