Unicode Character "ü" U+00FC Latin Small Letter U with Diaeresis
Unicode Version 15.1
ü
Summary
The unicode character "ü" at code point U+00FC is Latin Small Letter U with Diaeresis. It is a character in the Latin-1 Supplement block and is part of the Latin script. The character is a lowercase letter. The UTF-8 encoding of "ü" is 0xC3 0xBC and the UTF-16 encoding is 0x00FC.
General Properties
Code Point | U+00FC |
Version Added | 1.1 |
Name | Latin Small Letter U with Diaeresis |
Unicode 1.0 Name | Latin Small Letter U Diaeresis |
Block | Latin-1 Supplement |
General Category | Lowercase Letter |
Canonical Combining Class | Not Reordered |
Bidirectional Class | Left To Right |
Decomposition Type | Canonical |
Decomposition Mapping | "u" U+0075 Latin Small Letter U "̈" U+0308 Combining Diaeresis |
Encodings
HTML Entity | ü |
HTML Decimal Encoding | ü |
HTML Hex Encoding | ü |
UTF-8 Encoding | 0xC3 0xBC |
UTF-16 Encoding | 0x00FC |
UTF-32 Encoding | 0x000000FC |
C/C++/Java Escape | \u00fc |
Unicode Properties
NFC Quick Check | Yes |
NFKC Quick Check | Yes |
Expands On NFD | Yes |
Expands On NKFD | Yes |
Numeric Type | None |
Numeric Value | NaN |
Line Break | Alphabetic |
East Asian Width | Ambiguous |
Lowercase | Yes |
Simple Uppercase Code Point | "Ü" U+00DC Latin Capital Letter U with Diaeresis |
Simple Titlecase Code Point | "Ü" U+00DC Latin Capital Letter U with Diaeresis |
Uppercase Code Point | "Ü" U+00DC Latin Capital Letter U with Diaeresis |
Titlecase Code Point | "Ü" U+00DC Latin Capital Letter U with Diaeresis |
Cased | Yes |
Changes When Casemapped | Yes |
Changes When Titlecased | Yes |
Changes When Uppercased | Yes |
Script | Latin |
Script Extensions | Latin |
Indic Syllabic Category | Other |
ID Start | Yes |
XID Start | Yes |
ID Continue | Yes |
XID Continue | Yes |
Alphabetic | Yes |
Vertical Orientation | Rotated |
Grapheme Base | Yes |
Grapheme Cluster Break | Other |
Word Break | Alphabetic letter |
Sentence Break | Lower |