Unicode Character "‱" U+2031 Per Ten Thousand Sign

Unicode Version 15.1

Summary

The unicode character "‱" at code point U+2031 is Per Ten Thousand Sign. It is a character in the General Punctuation block and is part of the Common script. The character is an other punctuation. The UTF-8 encoding of "‱" is 0xE2 0x80 0xB1 and the UTF-16 encoding is 0x2031.

General Properties

Code Point U+2031
Version Added 1.1
Name Per Ten Thousand Sign
Block General Punctuation
General Category Other Punctuation
Canonical Combining Class Not Reordered
Bidirectional Class European Terminator

Encodings

HTML Entity ‱
HTML Decimal Encoding ‱
HTML Hex Encoding ‱
UTF-8 Encoding 0xE2 0x80 0xB1
UTF-16 Encoding 0x2031
UTF-32 Encoding 0x00002031
C/C++/Java Escape \u2031

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Postfix Numeric
Script Common
Script Extensions Common
Indic Syllabic Category Other
Pattern Syntax Yes
Vertical Orientation Upright
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break Other