Unicode Character "ẞ" U+1E9E Latin Capital Letter Sharp S

Unicode Version 15.1

Summary

The unicode character "ẞ" at code point U+1E9E is Latin Capital Letter Sharp S. It is a character in the Latin Extended Additional block and is part of the Latin script. The character is an uppercase letter. The UTF-8 encoding of "ẞ" is 0xE1 0xBA 0x9E and the UTF-16 encoding is 0x1E9E.

General Properties

Code Point U+1E9E
Version Added 5.1
Name Latin Capital Letter Sharp S
Block Latin Extended Additional
General Category Uppercase Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding ẞ
HTML Hex Encoding ẞ
UTF-8 Encoding 0xE1 0xBA 0x9E
UTF-16 Encoding 0x1E9E
UTF-32 Encoding 0x00001E9E
C/C++/Java Escape \u1e9e

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Alphabetic
Uppercase Yes
Simple Lowercase Code Point "ß" U+00DF Latin Small Letter Sharp S
Lowercase Code Point "ß" U+00DF Latin Small Letter Sharp S
Simple Case Folding "ß" U+00DF Latin Small Letter Sharp S
Case Folding "s" U+0073 Latin Small Letter S
"s" U+0073 Latin Small Letter S
Cased Yes
Changes When Casefolded Yes
Changes When Casemapped Yes
Changes When Lowercased Yes
Changes When NFKC Casefolded Yes
NFKC Casefold "s" U+0073 Latin Small Letter S
"s" U+0073 Latin Small Letter S
NFKC Simple Casefold "ß" U+00DF Latin Small Letter Sharp S
Script Latin
Script Extensions Latin
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Alphabetic letter
Sentence Break Upper