Unicode Character "ﺜ" U+FE9C Arabic Letter Theh Medial Form
Unicode Version 15.1
ﺜ
Summary
The unicode character "ﺜ" at code point U+FE9C is Arabic Letter Theh Medial Form. It is a character in the Arabic Presentation Forms-B block and is part of the Arabic script. The character is an other letter. The UTF-8 encoding of "ﺜ" is 0xEF 0xBA 0x9C and the UTF-16 encoding is 0xFE9C.
General Properties
Code Point | U+FE9C |
Version Added | 1.1 |
Name | Arabic Letter Theh Medial Form |
Unicode 1.0 Name | Glyph for Medial Arabic Thaa |
Block | Arabic Presentation Forms-B |
General Category | Other Letter |
Canonical Combining Class | Not Reordered |
Bidirectional Class | Arabic Letter |
Decomposition Type | Medial |
Decomposition Mapping | "ث" U+062B Arabic Letter Theh |
Encodings
HTML Decimal Encoding | ﺜ |
HTML Hex Encoding | ﺜ |
UTF-8 Encoding | 0xEF 0xBA 0x9C |
UTF-16 Encoding | 0xFE9C |
UTF-32 Encoding | 0x0000FE9C |
C/C++/Java Escape | \ufe9c |
Unicode Properties
NFC Quick Check | Yes |
NFD Quick Check | Yes |
Numeric Type | None |
Numeric Value | NaN |
Line Break | Alphabetic |
Changes When NFKC Casefolded | Yes |
NFKC Casefold | "ث" U+062B Arabic Letter Theh |
NFKC Simple Casefold | "ث" U+062B Arabic Letter Theh |
Script | Arabic |
Script Extensions | Arabic |
Indic Syllabic Category | Other |
ID Start | Yes |
XID Start | Yes |
ID Continue | Yes |
XID Continue | Yes |
Alphabetic | Yes |
Vertical Orientation | Rotated |
Grapheme Base | Yes |
Grapheme Cluster Break | Other |
Word Break | Alphabetic letter |
Sentence Break | OLetter |