Unicode Character "ఀ" U+0C00 Telugu Sign Combining Candrabindu Above

Unicode Version 15.1

Summary

The unicode character "ఀ" at code point U+0C00 is Telugu Sign Combining Candrabindu Above. It is a character in the Telugu block and is part of the Telugu script. The character is a nonspacing mark. The UTF-8 encoding of "ఀ" is 0xE0 0xB0 0x80 and the UTF-16 encoding is 0x0C00.

General Properties

Code Point U+0C00
Version Added 7.0
Name Telugu Sign Combining Candrabindu Above
Block Telugu
General Category Nonspacing Mark
Canonical Combining Class Not Reordered
Bidirectional Class Nonspacing Mark

Encodings

HTML Decimal Encoding ఀ
HTML Hex Encoding ఀ
UTF-8 Encoding 0xE0 0xB0 0x80
UTF-16 Encoding 0x0C00
UTF-32 Encoding 0x00000C00
C/C++/Java Escape \u0c00

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Script Telugu
Script Extensions Telugu
Indic Syllabic Category Bindu
Indic Positional Category Top
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Other Alphabetic Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend