Unicode Character "Ҵ" U+04B4 Cyrillic Capital Ligature Te Tse

Unicode Version 15.1

Summary

The unicode character "Ҵ" at code point U+04B4 is Cyrillic Capital Ligature Te Tse. It is a character in the Cyrillic block and is part of the Cyrillic script. The character is an uppercase letter. The UTF-8 encoding of "Ҵ" is 0xD2 0xB4 and the UTF-16 encoding is 0x04B4.

General Properties

Code Point	U+04B4
Version Added	1.1
Name	Cyrillic Capital Ligature Te Tse
Unicode 1.0 Name	Cyrillic Capital Letter Te Tse
Block	Cyrillic
General Category	Uppercase Letter
Canonical Combining Class	Not Reordered
Bidirectional Class	Left To Right

Encodings

HTML Decimal Encoding	Ҵ
HTML Hex Encoding	Ҵ
UTF-8 Encoding	0xD2 0xB4
UTF-16 Encoding	0x04B4
UTF-32 Encoding	0x000004B4
C/C++/Java Escape	\u04b4

Unicode Properties

NFC Quick Check	Yes
NFD Quick Check	Yes
NFKC Quick Check	Yes
NFKD Quick Check	Yes
Numeric Type	None
Numeric Value	NaN
Line Break	Alphabetic
Uppercase	Yes
Simple Lowercase Code Point	"ҵ" U+04B5 Cyrillic Small Ligature Te Tse
Lowercase Code Point	"ҵ" U+04B5 Cyrillic Small Ligature Te Tse
Simple Case Folding	"ҵ" U+04B5 Cyrillic Small Ligature Te Tse
Case Folding	"ҵ" U+04B5 Cyrillic Small Ligature Te Tse
Cased	Yes
Changes When Casefolded	Yes
Changes When Casemapped	Yes
Changes When Lowercased	Yes
Changes When NFKC Casefolded	Yes
NFKC Casefold	"ҵ" U+04B5 Cyrillic Small Ligature Te Tse
NFKC Simple Casefold	"ҵ" U+04B5 Cyrillic Small Ligature Te Tse
Script	Cyrillic
Script Extensions	Cyrillic
Indic Syllabic Category	Other
ID Start	Yes
XID Start	Yes
ID Continue	Yes
XID Continue	Yes
Alphabetic	Yes
Vertical Orientation	Rotated
Grapheme Base	Yes
Grapheme Cluster Break	Other
Word Break	Alphabetic letter
Sentence Break	Upper