Unicode Character "ソ" U+FF7F Halfwidth Katakana Letter So
Unicode Version 15.1
ソ
Summary
The unicode character "ソ" at code point U+FF7F is Halfwidth Katakana Letter So. It is a character in the Halfwidth and Fullwidth Forms block and is part of the Katakana script. The character is an other letter. The UTF-8 encoding of "ソ" is 0xEF 0xBD 0xBF and the UTF-16 encoding is 0xFF7F.
General Properties
Code Point | U+FF7F |
Version Added | 1.1 |
Name | Halfwidth Katakana Letter So |
Block | Halfwidth and Fullwidth Forms |
General Category | Other Letter |
Canonical Combining Class | Not Reordered |
Bidirectional Class | Left To Right |
Decomposition Type | Narrow |
Decomposition Mapping | "ソ" U+30BD Katakana Letter So |
Encodings
HTML Decimal Encoding | ソ |
HTML Hex Encoding | ソ |
UTF-8 Encoding | 0xEF 0xBD 0xBF |
UTF-16 Encoding | 0xFF7F |
UTF-32 Encoding | 0x0000FF7F |
C/C++/Java Escape | \uff7f |
Unicode Properties
NFC Quick Check | Yes |
NFD Quick Check | Yes |
Numeric Type | None |
Numeric Value | NaN |
Line Break | Ideographic |
East Asian Width | Halfwidth |
Changes When NFKC Casefolded | Yes |
NFKC Casefold | "ソ" U+30BD Katakana Letter So |
NFKC Simple Casefold | "ソ" U+30BD Katakana Letter So |
Script | Katakana |
Script Extensions | Katakana |
Indic Syllabic Category | Other |
ID Start | Yes |
XID Start | Yes |
ID Continue | Yes |
XID Continue | Yes |
Alphabetic | Yes |
Vertical Orientation | Rotated |
Grapheme Base | Yes |
Grapheme Cluster Break | Other |
Word Break | Katakana |
Sentence Break | OLetter |