Unicode Character "ౕ" U+0C55 Telugu Length Mark

Unicode Version 15.1

Summary

The unicode character "ౕ" at code point U+0C55 is Telugu Length Mark. It is a character in the Telugu block and is part of the Telugu script. The character is a nonspacing mark. The UTF-8 encoding of "ౕ" is 0xE0 0xB1 0x95 and the UTF-16 encoding is 0x0C55.

General Properties

Code Point U+0C55
Version Added 1.1
Name Telugu Length Mark
Block Telugu
General Category Nonspacing Mark
Canonical Combining Class CCC84
Bidirectional Class Nonspacing Mark

Encodings

HTML Decimal Encoding ౕ
HTML Hex Encoding ౕ
UTF-8 Encoding 0xE0 0xB1 0x95
UTF-16 Encoding 0x0C55
UTF-32 Encoding 0x00000C55
C/C++/Java Escape \u0c55

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Script Telugu
Script Extensions Telugu
Indic Syllabic Category Vowel Dependent
Indic Positional Category Top
Indic Conjunct Break Extend
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Other Alphabetic Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend