Unicode Character "¨" U+00A8 Diaeresis

Unicode Version 15.1

¨

Summary

The unicode character "¨" at code point U+00A8 is Diaeresis. It is a character in the Latin-1 Supplement block and is part of the Common script. The character is a modifier symbol. The UTF-8 encoding of "¨" is 0xC2 0xA8 and the UTF-16 encoding is 0x00A8.

General Properties

Code Point U+00A8
Version Added 1.1
Name Diaeresis
Unicode 1.0 Name Spacing Diaeresis
Block Latin-1 Supplement
General Category Modifier Symbol
Canonical Combining Class Not Reordered
Bidirectional Class Other Neutral
Decomposition Type Compat
Decomposition Mapping "SP" U+0020 Space
"̈" U+0308 Combining Diaeresis

Encodings

HTML Entity ¨
¨
¨
¨
HTML Decimal Encoding ¨
HTML Hex Encoding ¨
UTF-8 Encoding 0xC2 0xA8
UTF-16 Encoding 0x00A8
UTF-32 Encoding 0x000000A8
C/C++/Java Escape \u00a8

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
Expands On NFKC Yes
Expands On NKFD Yes
Numeric Type None
Numeric Value NaN
Line Break Ambiguous (Alphabetic or Ideographic)
East Asian Width Ambiguous
Case Ignorable Yes
Changes When NFKC Casefolded Yes
NFKC Casefold "SP" U+0020 Space
"̈" U+0308 Combining Diaeresis
NFKC Simple Casefold "SP" U+0020 Space
"̈" U+0308 Combining Diaeresis
Script Common
Script Extensions Common
Indic Syllabic Category Other
Diacritic Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break Other