Unicode Character "఼" U+0C3C Telugu Sign Nukta

Unicode Version 15.1

Summary

The unicode character "఼" at code point U+0C3C is Telugu Sign Nukta. It is a character in the Telugu block and is part of the Telugu script. The character is a nonspacing mark. The UTF-8 encoding of "఼" is 0xE0 0xB0 0xBC and the UTF-16 encoding is 0x0C3C.

General Properties

Code Point U+0C3C
Version Added 14.0
Name Telugu Sign Nukta
Block Telugu
General Category Nonspacing Mark
Canonical Combining Class Nukta
Bidirectional Class Nonspacing Mark

Encodings

HTML Decimal Encoding ఼
HTML Hex Encoding ఼
UTF-8 Encoding 0xE0 0xB0 0xBC
UTF-16 Encoding 0x0C3C
UTF-32 Encoding 0x00000C3C
C/C++/Java Escape \u0c3c

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Script Telugu
Script Extensions Telugu
Indic Syllabic Category Nukta
Indic Positional Category Bottom
Indic Conjunct Break Extend
ID Continue Yes
XID Continue Yes
Diacritic Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend