Perl version

Modules

  • ABCDE
  • FGHIL
  • MNOPS
  • TUX

perluniprops

Perl 5 version 14.0 documentation
Recently read

perluniprops

NAME

perluniprops - Index of Unicode Version 6.0.0 properties in Perl

DESCRIPTION

There are many properties in Unicode, and Perl provides access to almost all of them, as well as some additional extensions and short-cut synonyms.

And just about all of the few that aren't accessible through the Perl core are accessible through the modules: Unicode::Normalize and Unicode::UCD, and for Unihan properties, via the CPAN module Unicode::Unihan.

This document merely lists all available properties and does not attempt to explain what each property really means. There is a brief description of each Perl extension. There is some detail about Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but to find out about the intricacies of the Unicode properties, refer to the Unicode standard. A good starting place is http://www.unicode.org/reports/tr44/. More information on the Perl extensions is in Other Properties in perlunicode.

Note that you can define your own properties; see User-Defined Character Properties in perlunicode.

Properties accessible through \p{} and \P{}

The Perl regular expression \p{} and \P{} constructs give access to most of the Unicode character properties. The table below shows all these constructs, both single and compound forms.

Compound forms consist of two components, separated by an equals sign or a colon. The first component is the property name, and the second component is the particular value of the property to match against, for example, '\p{Script: Greek}' and '\p{Script=Greek}' both mean to match characters whose Script property is Greek.

Single forms, like '\p{Greek}', are mostly Perl-defined shortcuts for their equivalent compound forms. The table shows these equivalences. (In our example, '\p{Greek}' is a just a shortcut for '\p{Script=Greek}'.) There are also a few Perl-defined single forms that are not shortcuts for a compound form. One such is \p{Word}. These are also listed in the table.

In parsing these constructs, Perl always ignores Upper/lower case differences everywhere within the {braces}. Thus '\p{Greek}' means the same thing as '\p{greek}'. But note that changing the case of the 'p' or 'P' before the left brace completely changes the meaning of the construct, from "match" (for '\p{}') to "doesn't match" (for '\P{}'). Casing in this document is for improved legibility.

Also, white space, hyphens, and underscores are also normally ignored everywhere between the {braces}, and hence can be freely added or removed even if the /x modifier hasn't been specified on the regular expression. But a 'T' at the beginning of an entry in the table below means that tighter (stricter) rules are used for that entry:

  • Single form (\p{name}) tighter rules:

    White space, hyphens, and underscores ARE significant except for:

    • white space adjacent to a non-word character
    • underscores separating digits in numbers

    That means, for example, that you can freely add or remove white space adjacent to (but within) the braces without affecting the meaning.

  • Compound form (\p{name=value} or \p{name:value}) tighter rules:

    The tighter rules given above for the single form apply to everything to the right of the colon or equals; the looser rules still apply to everything to the left.

    That means, for example, that you can freely add or remove white space adjacent to (but within) the braces and the colon or equal sign.

Some properties are considered obsolete by Unicode, but still available. There are several varieties of obsolescence:

  • Obsolete

    Properties marked with an 'O' in the table are considered obsolete.

  • Stabilized

    Obsolete properties may be stabilized. Such a determination does not indicate that the property should or should not be used; instead it is a declaration that the property will not be maintained nor extended for newly encoded characters. Such properties are marked with an 'S' in the table.

  • Deprecated

    An obsolete property may be deprecated, perhaps because its original intent has been replaced by another property, or because its specification was somehow defective. This means that its use is strongly discouraged, so much so that a warning will be issued if used, unless the regular expression is in the scope of a no warnings 'deprecated' statement. A 'D' flags each such entry in the table, and the entry there for the longest, most descriptive version of the property will give the reason it is deprecated, and perhaps advice. Perl may issue such a warning, even for properties that aren't officially deprecated by Unicode, when there used to be characters or code points that were matched by them, but no longer. This is to warn you that your program may not work like it did on earlier Unicode releases.

    A deprecated property may be made unavailable in a future Perl version, so it is best to move away from them.

Some Perl extensions are present for backwards compatibility and are discouraged from being used, but not obsolete. An 'X' flags each such entry in the table.

Matches in the Block property have shortcuts that begin with 'In_'. For example, \p{Block=Latin1} can be written as \p{In_Latin1}. For backward compatibility, if there is no conflict with another shortcut, these may also be written as \p{Latin1} or \p{Is_Latin1}. But, N.B., there are numerous such conflicting shortcuts. Use of these forms for Block is discouraged, and are flagged as such, not only because of the potential confusion as to what is meant, but also because a later release of Unicode may preempt the shortcut, and your program would no longer be correct. Use the 'In_' form instead to avoid this, or even more clearly, use the compound form, e.g., \p{blk:latin1}. See Blocks in perlunicode for more information about this.

The table below has two columns. The left column contains the \p{} constructs to look up, possibly preceded by the flags mentioned above; and the right column contains information about them, like a description, or synonyms. It shows both the single and compound forms for each property that has them. If the left column is a short name for a property, the right column will give its longer, more descriptive name; and if the left column is the longest name, the right column will show any equivalent shortest name, in both single and compound forms if applicable.

The right column will also caution you if a property means something different than what might normally be expected.

All single forms are Perl extensions; a few compound forms are as well, and are noted as such.

Numbers in (parentheses) indicate the total number of code points matched by the property. For emphasis, those properties that match no code points at all are listed as well in a separate section following the table.

Most properties match the same code points regardless of whether "/i" case-insensitive matching is specified or not. But a few properties are affected. These are shown with the notation

  1. (/i= other_property)

in the second column. Under case-insensitive matching they match the same code pode points as the property "other_property".

There is no description given for most non-Perl defined properties (See http://www.unicode.org/reports/tr44/ for that).

For compactness, '*' is used as a wildcard instead of showing all possible combinations. For example, entries like:

  1. \p{Gc: *} \p{General_Category: *}

mean that 'Gc' is a synonym for 'General_Category', and anything that is valid for the latter is also valid for the former. Similarly,

  1. \p{Is_*} \p{*}

means that if and only if, for example, \p{Foo} exists, then \p{Is_Foo} and \p{IsFoo} are also valid and all mean the same thing. And similarly, \p{Foo=Bar} means the same as \p{Is_Foo=Bar} and \p{IsFoo=Bar}. '*' here is restricted to something not beginning with an underscore.

Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for 'Y'. And 'No', 'F', and 'False' are all synonyms for 'N'. The table shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries for the other possibilities. Note that not all properties which have values 'Yes' and 'No' are binary, and they have all their values spelled out without using this wild card, and a NOT clause in their description that highlights their not being binary. These also require the compound form to match them, whereas true binary properties have both single and compound forms available.

Note that all non-essential underscores are removed in the display of the short names below.

Summary legend:

  • * is a wild-card
  • (\d+) in the info column gives the number of code points matched by this property.
  • D means this is deprecated.
  • O means this is obsolete.
  • S means this is stabilized.
  • T means tighter (stricter) name matching applies.
  • X means use of this form is discouraged.
  1. NAME INFO
  2. X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64)
  3. T \p{Age: 1.1} Code point's usage introduced in version
  4. 1.1 (33_979)
  5. T \p{Age: 2.0} Code point's usage was introduced in
  6. version 2.0; See also Property
  7. 'Present_In' (144_521)
  8. T \p{Age: 2.1} Code point's usage was introduced in
  9. version 2.1; See also Property
  10. 'Present_In' (2)
  11. T \p{Age: 3.0} Code point's usage was introduced in
  12. version 3.0; See also Property
  13. 'Present_In' (10_307)
  14. T \p{Age: 3.1} Code point's usage was introduced in
  15. version 3.1; See also Property
  16. 'Present_In' (44_978)
  17. T \p{Age: 3.2} Code point's usage was introduced in
  18. version 3.2; See also Property
  19. 'Present_In' (1016)
  20. T \p{Age: 4.0} Code point's usage was introduced in
  21. version 4.0; See also Property
  22. 'Present_In' (1226)
  23. T \p{Age: 4.1} Code point's usage was introduced in
  24. version 4.1; See also Property
  25. 'Present_In' (1273)
  26. T \p{Age: 5.0} Code point's usage was introduced in
  27. version 5.0; See also Property
  28. 'Present_In' (1369)
  29. T \p{Age: 5.1} Code point's usage was introduced in
  30. version 5.1; See also Property
  31. 'Present_In' (1624)
  32. T \p{Age: 5.2} Code point's usage was introduced in
  33. version 5.2; See also Property
  34. 'Present_In' (6648)
  35. T \p{Age: 6.0} Code point's usage was introduced in
  36. version 6.0; See also Property
  37. 'Present_In' (2088)
  38. \p{Age: Unassigned} Code point's usage has not been assigned
  39. in any Unicode release thus far.
  40. (865_081)
  41. \p{AHex} \p{ASCII_Hex_Digit} (= \p{ASCII_Hex_Digit=
  42. Y}) (22)
  43. \p{AHex: *} \p{ASCII_Hex_Digit: *}
  44. X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (128)
  45. \p{All} \p{Any} (1_114_112)
  46. \p{Alnum} Alphabetic and (Decimal) Numeric (101_959)
  47. \p{Alpha} \p{Alphabetic=Y} (101_539)
  48. \p{Alpha: *} \p{Alphabetic: *}
  49. \p{Alphabetic} \p{Alpha} (= \p{Alphabetic=Y}) (101_539)
  50. \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (1_012_573)
  51. \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (101_539)
  52. X \p{Alphabetic_Presentation_Forms} \p{Block=
  53. Alphabetic_Presentation_Forms} (80)
  54. X \p{Ancient_Greek_Musical_Notation} \p{Block=
  55. Ancient_Greek_Musical_Notation} (80)
  56. X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
  57. X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64)
  58. \p{Any} [\x{0000}-\x{10FFFF}] (1_114_112)
  59. \p{Arab} \p{Arabic} (= \p{Script=Arabic}) (NOT
  60. \p{Block=Arabic}) (1051)
  61. \p{Arabic} \p{Script=Arabic} (Short: \p{Arab}; NOT
  62. \p{Block=Arabic}) (1051)
  63. X \p{Arabic_Presentation_Forms_A} \p{Block=
  64. Arabic_Presentation_Forms_A} (688)
  65. X \p{Arabic_Presentation_Forms_B} \p{Block=
  66. Arabic_Presentation_Forms_B} (144)
  67. X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (48)
  68. \p{Armenian} \p{Script=Armenian} (Short: \p{Armn}; NOT
  69. \p{Block=Armenian}) (90)
  70. \p{Armi} \p{Imperial_Aramaic} (= \p{Script=
  71. Imperial_Aramaic}) (NOT \p{Block=
  72. Imperial_Aramaic}) (31)
  73. \p{Armn} \p{Armenian} (= \p{Script=Armenian}) (NOT
  74. \p{Block=Armenian}) (90)
  75. X \p{Arrows} \p{Block=Arrows} (112)
  76. \p{ASCII} \p{Block=Basic_Latin} [[:ASCII:]] (128)
  77. \p{ASCII_Hex_Digit} \p{ASCII_Hex_Digit=Y} (Short: \p{AHex})
  78. (22)
  79. \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090)
  80. \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22)
  81. \p{Assigned} All assigned code points (248_965)
  82. \p{Avestan} \p{Script=Avestan} (Short: \p{Avst}; NOT
  83. \p{Block=Avestan}) (61)
  84. \p{Avst} \p{Avestan} (= \p{Script=Avestan}) (NOT
  85. \p{Block=Avestan}) (61)
  86. \p{Bali} \p{Balinese} (= \p{Script=Balinese}) (NOT
  87. \p{Block=Balinese}) (121)
  88. \p{Balinese} \p{Script=Balinese} (Short: \p{Bali}; NOT
  89. \p{Block=Balinese}) (121)
  90. \p{Bamu} \p{Bamum} (= \p{Script=Bamum}) (NOT
  91. \p{Block=Bamum}) (657)
  92. \p{Bamum} \p{Script=Bamum} (Short: \p{Bamu}; NOT
  93. \p{Block=Bamum}) (657)
  94. X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (576)
  95. X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128)
  96. \p{Batak} \p{Script=Batak} (Short: \p{Batk}; NOT
  97. \p{Block=Batak}) (56)
  98. \p{Batk} \p{Batak} (= \p{Script=Batak}) (NOT
  99. \p{Block=Batak}) (56)
  100. \p{Bc: *} \p{Bidi_Class: *}
  101. \p{Beng} \p{Bengali} (= \p{Script=Bengali}) (NOT
  102. \p{Block=Bengali}) (92)
  103. \p{Bengali} \p{Script=Bengali} (Short: \p{Beng}; NOT
  104. \p{Block=Bengali}) (92)
  105. \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y}) (7)
  106. \p{Bidi_C: *} \p{Bidi_Control: *}
  107. \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1116)
  108. \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (48)
  109. \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1116)
  110. \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (48)
  111. \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7)
  112. \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4015)
  113. \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4015)
  114. \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15)
  115. \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15)
  116. \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (131)
  117. \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12)
  118. \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (64)
  119. \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (131)
  120. \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12)
  121. \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (64)
  122. \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_098_619)
  123. \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_098_619)
  124. \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1)
  125. \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1)
  126. \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1)
  127. \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1)
  128. \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1209)
  129. \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1209)
  130. \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (4412)
  131. \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (4412)
  132. \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7)
  133. \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1)
  134. \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1)
  135. \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (4438)
  136. \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (4438)
  137. \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1)
  138. \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1)
  139. \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1)
  140. \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1)
  141. \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3)
  142. \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3)
  143. \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (18)
  144. \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (18)
  145. \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (7)
  146. \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_105)
  147. \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (7)
  148. \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
  149. (543)
  150. \p{Bidi_M: *} \p{Bidi_Mirrored: *}
  151. \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
  152. (543)
  153. \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_569)
  154. \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (543)
  155. \p{Blank} \h, Horizontal white space (19)
  156. \p{Blk: *} \p{Block: *}
  157. \p{Block: Aegean_Numbers} (Single: \p{InAegeanNumbers}) (64)
  158. \p{Block: Alchemical_Symbols} (Single: \p{InAlchemicalSymbols})
  159. (128)
  160. \p{Block: Alphabetic_Presentation_Forms} (Single:
  161. \p{InAlphabeticPresentationForms}) (80)
  162. \p{Block: Ancient_Greek_Musical_Notation} (Single:
  163. \p{InAncientGreekMusicalNotation}) (80)
  164. \p{Block: Ancient_Greek_Numbers} (Single:
  165. \p{InAncientGreekNumbers}) (80)
  166. \p{Block: Ancient_Symbols} (Single: \p{InAncientSymbols}) (64)
  167. \p{Block: Arabic} (Single: \p{InArabic}; NOT \p{Arabic} NOR
  168. \p{Is_Arabic}) (256)
  169. \p{Block: Arabic_Presentation_Forms_A} (Single:
  170. \p{InArabicPresentationFormsA}) (688)
  171. \p{Block: Arabic_Presentation_Forms_B} (Single:
  172. \p{InArabicPresentationFormsB}) (144)
  173. \p{Block: Arabic_Supplement} (Single: \p{InArabicSupplement}) (48)
  174. \p{Block: Armenian} (Single: \p{InArmenian}; NOT \p{Armenian}
  175. NOR \p{Is_Armenian}) (96)
  176. \p{Block: Arrows} (Single: \p{InArrows}) (112)
  177. \p{Block: ASCII} \p{Block=Basic_Latin} (128)
  178. \p{Block: Avestan} (Single: \p{InAvestan}; NOT \p{Avestan}
  179. NOR \p{Is_Avestan}) (64)
  180. \p{Block: Balinese} (Single: \p{InBalinese}; NOT \p{Balinese}
  181. NOR \p{Is_Balinese}) (128)
  182. \p{Block: Bamum} (Single: \p{InBamum}; NOT \p{Bamum} NOR
  183. \p{Is_Bamum}) (96)
  184. \p{Block: Bamum_Supplement} (Single: \p{InBamumSupplement}) (576)
  185. \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}, \p{ASCII}) (128)
  186. \p{Block: Batak} (Single: \p{InBatak}; NOT \p{Batak} NOR
  187. \p{Is_Batak}) (64)
  188. \p{Block: Bengali} (Single: \p{InBengali}; NOT \p{Bengali}
  189. NOR \p{Is_Bengali}) (128)
  190. \p{Block: Block_Elements} (Single: \p{InBlockElements}) (32)
  191. \p{Block: Bopomofo} (Single: \p{InBopomofo}; NOT \p{Bopomofo}
  192. NOR \p{Is_Bopomofo}) (48)
  193. \p{Block: Bopomofo_Extended} (Single: \p{InBopomofoExtended}) (32)
  194. \p{Block: Box_Drawing} (Single: \p{InBoxDrawing}) (128)
  195. \p{Block: Brahmi} (Single: \p{InBrahmi}; NOT \p{Brahmi} NOR
  196. \p{Is_Brahmi}) (128)
  197. \p{Block: Braille_Patterns} (Single: \p{InBraillePatterns}) (256)
  198. \p{Block: Buginese} (Single: \p{InBuginese}; NOT \p{Buginese}
  199. NOR \p{Is_Buginese}) (32)
  200. \p{Block: Buhid} (Single: \p{InBuhid}; NOT \p{Buhid} NOR
  201. \p{Is_Buhid}) (32)
  202. \p{Block: Byzantine_Musical_Symbols} (Single:
  203. \p{InByzantineMusicalSymbols}) (256)
  204. \p{Block: Canadian_Syllabics} \p{Block=
  205. Unified_Canadian_Aboriginal_Syllabics}
  206. (640)
  207. \p{Block: Carian} (Single: \p{InCarian}; NOT \p{Carian} NOR
  208. \p{Is_Carian}) (64)
  209. \p{Block: Cham} (Single: \p{InCham}; NOT \p{Cham} NOR
  210. \p{Is_Cham}) (96)
  211. \p{Block: Cherokee} (Single: \p{InCherokee}; NOT \p{Cherokee}
  212. NOR \p{Is_Cherokee}) (96)
  213. \p{Block: CJK_Compatibility} (Single: \p{InCJKCompatibility}) (256)
  214. \p{Block: CJK_Compatibility_Forms} (Single:
  215. \p{InCJKCompatibilityForms}) (32)
  216. \p{Block: CJK_Compatibility_Ideographs} (Single:
  217. \p{InCJKCompatibilityIdeographs}) (512)
  218. \p{Block: CJK_Compatibility_Ideographs_Supplement} (Single:
  219. \p{InCJKCompatibilityIdeographs-
  220. Supplement}) (544)
  221. \p{Block: CJK_Radicals_Supplement} (Single:
  222. \p{InCJKRadicalsSupplement}) (128)
  223. \p{Block: CJK_Strokes} (Single: \p{InCJKStrokes}) (48)
  224. \p{Block: CJK_Symbols_And_Punctuation} (Single:
  225. \p{InCJKSymbolsAndPunctuation}) (64)
  226. \p{Block: CJK_Unified_Ideographs} (Single:
  227. \p{InCJKUnifiedIdeographs}) (20_992)
  228. \p{Block: CJK_Unified_Ideographs_Extension_A} (Single:
  229. \p{InCJKUnifiedIdeographsExtensionA})
  230. (6592)
  231. \p{Block: CJK_Unified_Ideographs_Extension_B} (Single:
  232. \p{InCJKUnifiedIdeographsExtensionB})
  233. (42_720)
  234. \p{Block: CJK_Unified_Ideographs_Extension_C} (Single:
  235. \p{InCJKUnifiedIdeographsExtensionC})
  236. (4160)
  237. \p{Block: CJK_Unified_Ideographs_Extension_D} (Single:
  238. \p{InCJKUnifiedIdeographsExtensionD})
  239. (224)
  240. \p{Block: Combining_Diacritical_Marks} (Single:
  241. \p{InCombiningDiacriticalMarks}) (112)
  242. \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
  243. CombiningMarksForSymbols},
  244. \p{InCombiningMarksForSymbols}) (48)
  245. \p{Block: Combining_Diacritical_Marks_Supplement} (Single:
  246. \p{InCombiningDiacriticalMarks-
  247. Supplement}) (64)
  248. \p{Block: Combining_Half_Marks} (Single: \p{InCombiningHalfMarks})
  249. (16)
  250. \p{Block: Combining_Marks_For_Symbols} \p{Block=
  251. Combining_Diacritical_Marks_For_Symbols}
  252. (48)
  253. \p{Block: Common_Indic_Number_Forms} (Single:
  254. \p{InCommonIndicNumberForms}) (16)
  255. \p{Block: Control_Pictures} (Single: \p{InControlPictures}) (64)
  256. \p{Block: Coptic} (Single: \p{InCoptic}; NOT \p{Coptic} NOR
  257. \p{Is_Coptic}) (128)
  258. \p{Block: Counting_Rod_Numerals} (Single:
  259. \p{InCountingRodNumerals}) (32)
  260. \p{Block: Cuneiform} (Single: \p{InCuneiform}; NOT
  261. \p{Cuneiform} NOR \p{Is_Cuneiform})
  262. (1024)
  263. \p{Block: Cuneiform_Numbers_And_Punctuation} (Single:
  264. \p{InCuneiformNumbersAndPunctuation})
  265. (128)
  266. \p{Block: Currency_Symbols} (Single: \p{InCurrencySymbols}) (48)
  267. \p{Block: Cypriot_Syllabary} (Single: \p{InCypriotSyllabary}) (64)
  268. \p{Block: Cyrillic} (Single: \p{InCyrillic}; NOT \p{Cyrillic}
  269. NOR \p{Is_Cyrillic}) (256)
  270. \p{Block: Cyrillic_Extended_A} (Single: \p{InCyrillicExtendedA})
  271. (32)
  272. \p{Block: Cyrillic_Extended_B} (Single: \p{InCyrillicExtendedB})
  273. (96)
  274. \p{Block: Cyrillic_Supplement} (Single: \p{InCyrillicSupplement})
  275. (48)
  276. \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
  277. (48)
  278. \p{Block: Deseret} (Single: \p{InDeseret}) (80)
  279. \p{Block: Devanagari} (Single: \p{InDevanagari}; NOT
  280. \p{Devanagari} NOR \p{Is_Devanagari})
  281. (128)
  282. \p{Block: Devanagari_Extended} (Single: \p{InDevanagariExtended})
  283. (32)
  284. \p{Block: Dingbats} (Single: \p{InDingbats}) (192)
  285. \p{Block: Domino_Tiles} (Single: \p{InDominoTiles}) (112)
  286. \p{Block: Egyptian_Hieroglyphs} (Single:
  287. \p{InEgyptianHieroglyphs}; NOT
  288. \p{Egyptian_Hieroglyphs} NOR
  289. \p{Is_Egyptian_Hieroglyphs}) (1072)
  290. \p{Block: Emoticons} (Single: \p{InEmoticons}) (80)
  291. \p{Block: Enclosed_Alphanumeric_Supplement} (Single:
  292. \p{InEnclosedAlphanumericSupplement})
  293. (256)
  294. \p{Block: Enclosed_Alphanumerics} (Single:
  295. \p{InEnclosedAlphanumerics}) (160)
  296. \p{Block: Enclosed_CJK_Letters_And_Months} (Single:
  297. \p{InEnclosedCJKLettersAndMonths}) (256)
  298. \p{Block: Enclosed_Ideographic_Supplement} (Single:
  299. \p{InEnclosedIdeographicSupplement})
  300. (256)
  301. \p{Block: Ethiopic} (Single: \p{InEthiopic}; NOT \p{Ethiopic}
  302. NOR \p{Is_Ethiopic}) (384)
  303. \p{Block: Ethiopic_Extended} (Single: \p{InEthiopicExtended}) (96)
  304. \p{Block: Ethiopic_Extended_A} (Single: \p{InEthiopicExtendedA})
  305. (48)
  306. \p{Block: Ethiopic_Supplement} (Single: \p{InEthiopicSupplement})
  307. (32)
  308. \p{Block: General_Punctuation} (Single: \p{InGeneralPunctuation})
  309. (112)
  310. \p{Block: Geometric_Shapes} (Single: \p{InGeometricShapes}) (96)
  311. \p{Block: Georgian} (Single: \p{InGeorgian}; NOT \p{Georgian}
  312. NOR \p{Is_Georgian}) (96)
  313. \p{Block: Georgian_Supplement} (Single: \p{InGeorgianSupplement})
  314. (48)
  315. \p{Block: Glagolitic} (Single: \p{InGlagolitic}; NOT
  316. \p{Glagolitic} NOR \p{Is_Glagolitic})
  317. (96)
  318. \p{Block: Gothic} (Single: \p{InGothic}; NOT \p{Gothic} NOR
  319. \p{Is_Gothic}) (32)
  320. \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek}
  321. NOR \p{Is_Greek}) (144)
  322. \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}, \p{InGreek};
  323. NOT \p{Greek} NOR \p{Is_Greek}) (144)
  324. \p{Block: Greek_Extended} (Single: \p{InGreekExtended}) (256)
  325. \p{Block: Gujarati} (Single: \p{InGujarati}; NOT \p{Gujarati}
  326. NOR \p{Is_Gujarati}) (128)
  327. \p{Block: Gurmukhi} (Single: \p{InGurmukhi}; NOT \p{Gurmukhi}
  328. NOR \p{Is_Gurmukhi}) (128)
  329. \p{Block: Halfwidth_And_Fullwidth_Forms} (Single:
  330. \p{InHalfwidthAndFullwidthForms}) (240)
  331. \p{Block: Hangul_Compatibility_Jamo} (Single:
  332. \p{InHangulCompatibilityJamo}) (96)
  333. \p{Block: Hangul_Jamo} (Single: \p{InHangulJamo}) (256)
  334. \p{Block: Hangul_Jamo_Extended_A} (Single:
  335. \p{InHangulJamoExtendedA}) (32)
  336. \p{Block: Hangul_Jamo_Extended_B} (Single:
  337. \p{InHangulJamoExtendedB}) (80)
  338. \p{Block: Hangul_Syllables} (Single: \p{InHangulSyllables})
  339. (11_184)
  340. \p{Block: Hanunoo} (Single: \p{InHanunoo}; NOT \p{Hanunoo}
  341. NOR \p{Is_Hanunoo}) (32)
  342. \p{Block: Hebrew} (Single: \p{InHebrew}; NOT \p{Hebrew} NOR
  343. \p{Is_Hebrew}) (112)
  344. \p{Block: High_Private_Use_Surrogates} (Single:
  345. \p{InHighPrivateUseSurrogates}) (128)
  346. \p{Block: High_Surrogates} (Single: \p{InHighSurrogates}) (896)
  347. \p{Block: Hiragana} (Single: \p{InHiragana}; NOT \p{Hiragana}
  348. NOR \p{Is_Hiragana}) (96)
  349. \p{Block: Ideographic_Description_Characters} (Single:
  350. \p{InIdeographicDescriptionCharacters})
  351. (16)
  352. \p{Block: Imperial_Aramaic} (Single: \p{InImperialAramaic}; NOT
  353. \p{Imperial_Aramaic} NOR
  354. \p{Is_Imperial_Aramaic}) (32)
  355. \p{Block: Inscriptional_Pahlavi} (Single:
  356. \p{InInscriptionalPahlavi}; NOT
  357. \p{Inscriptional_Pahlavi} NOR
  358. \p{Is_Inscriptional_Pahlavi}) (32)
  359. \p{Block: Inscriptional_Parthian} (Single:
  360. \p{InInscriptionalParthian}; NOT
  361. \p{Inscriptional_Parthian} NOR
  362. \p{Is_Inscriptional_Parthian}) (32)
  363. \p{Block: IPA_Extensions} (Single: \p{InIPAExtensions}) (96)
  364. \p{Block: Javanese} (Single: \p{InJavanese}; NOT \p{Javanese}
  365. NOR \p{Is_Javanese}) (96)
  366. \p{Block: Kaithi} (Single: \p{InKaithi}; NOT \p{Kaithi} NOR
  367. \p{Is_Kaithi}) (80)
  368. \p{Block: Kana_Supplement} (Single: \p{InKanaSupplement}) (256)
  369. \p{Block: Kanbun} (Single: \p{InKanbun}) (16)
  370. \p{Block: Kangxi_Radicals} (Single: \p{InKangxiRadicals}) (224)
  371. \p{Block: Kannada} (Single: \p{InKannada}; NOT \p{Kannada}
  372. NOR \p{Is_Kannada}) (128)
  373. \p{Block: Katakana} (Single: \p{InKatakana}; NOT \p{Katakana}
  374. NOR \p{Is_Katakana}) (96)
  375. \p{Block: Katakana_Phonetic_Extensions} (Single:
  376. \p{InKatakanaPhoneticExtensions}) (16)
  377. \p{Block: Kayah_Li} (Single: \p{InKayahLi}) (48)
  378. \p{Block: Kharoshthi} (Single: \p{InKharoshthi}; NOT
  379. \p{Kharoshthi} NOR \p{Is_Kharoshthi})
  380. (96)
  381. \p{Block: Khmer} (Single: \p{InKhmer}; NOT \p{Khmer} NOR
  382. \p{Is_Khmer}) (128)
  383. \p{Block: Khmer_Symbols} (Single: \p{InKhmerSymbols}) (32)
  384. \p{Block: Lao} (Single: \p{InLao}; NOT \p{Lao} NOR
  385. \p{Is_Lao}) (128)
  386. \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128)
  387. \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1},
  388. \p{InLatin1}) (128)
  389. \p{Block: Latin_Extended_A} (Single: \p{InLatinExtendedA}) (128)
  390. \p{Block: Latin_Extended_Additional} (Single:
  391. \p{InLatinExtendedAdditional}) (256)
  392. \p{Block: Latin_Extended_B} (Single: \p{InLatinExtendedB}) (208)
  393. \p{Block: Latin_Extended_C} (Single: \p{InLatinExtendedC}) (32)
  394. \p{Block: Latin_Extended_D} (Single: \p{InLatinExtendedD}) (224)
  395. \p{Block: Lepcha} (Single: \p{InLepcha}; NOT \p{Lepcha} NOR
  396. \p{Is_Lepcha}) (80)
  397. \p{Block: Letterlike_Symbols} (Single: \p{InLetterlikeSymbols})
  398. (80)
  399. \p{Block: Limbu} (Single: \p{InLimbu}; NOT \p{Limbu} NOR
  400. \p{Is_Limbu}) (80)
  401. \p{Block: Linear_B_Ideograms} (Single: \p{InLinearBIdeograms})
  402. (128)
  403. \p{Block: Linear_B_Syllabary} (Single: \p{InLinearBSyllabary})
  404. (128)
  405. \p{Block: Lisu} (Single: \p{InLisu}) (48)
  406. \p{Block: Low_Surrogates} (Single: \p{InLowSurrogates}) (1024)
  407. \p{Block: Lycian} (Single: \p{InLycian}; NOT \p{Lycian} NOR
  408. \p{Is_Lycian}) (32)
  409. \p{Block: Lydian} (Single: \p{InLydian}; NOT \p{Lydian} NOR
  410. \p{Is_Lydian}) (32)
  411. \p{Block: Mahjong_Tiles} (Single: \p{InMahjongTiles}) (48)
  412. \p{Block: Malayalam} (Single: \p{InMalayalam}; NOT
  413. \p{Malayalam} NOR \p{Is_Malayalam}) (128)
  414. \p{Block: Mandaic} (Single: \p{InMandaic}; NOT \p{Mandaic}
  415. NOR \p{Is_Mandaic}) (32)
  416. \p{Block: Mathematical_Alphanumeric_Symbols} (Single:
  417. \p{InMathematicalAlphanumericSymbols})
  418. (1024)
  419. \p{Block: Mathematical_Operators} (Single:
  420. \p{InMathematicalOperators}) (256)
  421. \p{Block: Meetei_Mayek} (Single: \p{InMeeteiMayek}; NOT
  422. \p{Meetei_Mayek} NOR
  423. \p{Is_Meetei_Mayek}) (64)
  424. \p{Block: Miscellaneous_Mathematical_Symbols_A} (Single:
  425. \p{InMiscellaneousMathematicalSymbolsA})
  426. (48)
  427. \p{Block: Miscellaneous_Mathematical_Symbols_B} (Single:
  428. \p{InMiscellaneousMathematicalSymbolsB})
  429. (128)
  430. \p{Block: Miscellaneous_Symbols} (Single:
  431. \p{InMiscellaneousSymbols}) (256)
  432. \p{Block: Miscellaneous_Symbols_And_Arrows} (Single:
  433. \p{InMiscellaneousSymbolsAndArrows})
  434. (256)
  435. \p{Block: Miscellaneous_Symbols_And_Pictographs} (Single:
  436. \p{InMiscellaneousSymbolsAnd-
  437. Pictographs}) (768)
  438. \p{Block: Miscellaneous_Technical} (Single:
  439. \p{InMiscellaneousTechnical}) (256)
  440. \p{Block: Modifier_Tone_Letters} (Single:
  441. \p{InModifierToneLetters}) (32)
  442. \p{Block: Mongolian} (Single: \p{InMongolian}; NOT
  443. \p{Mongolian} NOR \p{Is_Mongolian}) (176)
  444. \p{Block: Musical_Symbols} (Single: \p{InMusicalSymbols}) (256)
  445. \p{Block: Myanmar} (Single: \p{InMyanmar}; NOT \p{Myanmar}
  446. NOR \p{Is_Myanmar}) (160)
  447. \p{Block: Myanmar_Extended_A} (Single: \p{InMyanmarExtendedA}) (32)
  448. \p{Block: New_Tai_Lue} (Single: \p{InNewTaiLue}; NOT
  449. \p{New_Tai_Lue} NOR \p{Is_New_Tai_Lue})
  450. (96)
  451. \p{Block: NKo} (Single: \p{InNKo}; NOT \p{Nko} NOR
  452. \p{Is_NKo}) (64)
  453. \p{Block: No_Block} (Single: \p{InNoBlock}) (861_664)
  454. \p{Block: Number_Forms} (Single: \p{InNumberForms}) (64)
  455. \p{Block: Ogham} (Single: \p{InOgham}; NOT \p{Ogham} NOR
  456. \p{Is_Ogham}) (32)
  457. \p{Block: Ol_Chiki} (Single: \p{InOlChiki}) (48)
  458. \p{Block: Old_Italic} (Single: \p{InOldItalic}; NOT
  459. \p{Old_Italic} NOR \p{Is_Old_Italic})
  460. (48)
  461. \p{Block: Old_Persian} (Single: \p{InOldPersian}; NOT
  462. \p{Old_Persian} NOR \p{Is_Old_Persian})
  463. (64)
  464. \p{Block: Old_South_Arabian} (Single: \p{InOldSouthArabian}) (32)
  465. \p{Block: Old_Turkic} (Single: \p{InOldTurkic}; NOT
  466. \p{Old_Turkic} NOR \p{Is_Old_Turkic})
  467. (80)
  468. \p{Block: Optical_Character_Recognition} (Single:
  469. \p{InOpticalCharacterRecognition}) (32)
  470. \p{Block: Oriya} (Single: \p{InOriya}; NOT \p{Oriya} NOR
  471. \p{Is_Oriya}) (128)
  472. \p{Block: Osmanya} (Single: \p{InOsmanya}; NOT \p{Osmanya}
  473. NOR \p{Is_Osmanya}) (48)
  474. \p{Block: Phags_Pa} (Single: \p{InPhagsPa}; NOT \p{Phags_Pa}
  475. NOR \p{Is_Phags_Pa}) (64)
  476. \p{Block: Phaistos_Disc} (Single: \p{InPhaistosDisc}) (48)
  477. \p{Block: Phoenician} (Single: \p{InPhoenician}; NOT
  478. \p{Phoenician} NOR \p{Is_Phoenician})
  479. (32)
  480. \p{Block: Phonetic_Extensions} (Single: \p{InPhoneticExtensions})
  481. (128)
  482. \p{Block: Phonetic_Extensions_Supplement} (Single:
  483. \p{InPhoneticExtensionsSupplement}) (64)
  484. \p{Block: Playing_Cards} (Single: \p{InPlayingCards}) (96)
  485. \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT
  486. \p{Private_Use} NOR \p{Is_Private_Use})
  487. (6400)
  488. \p{Block: Private_Use_Area} (Short: \p{Blk=PrivateUse},
  489. \p{InPrivateUse}; NOT \p{Private_Use}
  490. NOR \p{Is_Private_Use}) (6400)
  491. \p{Block: Rejang} (Single: \p{InRejang}; NOT \p{Rejang} NOR
  492. \p{Is_Rejang}) (48)
  493. \p{Block: Rumi_Numeral_Symbols} (Single: \p{InRumiNumeralSymbols})
  494. (32)
  495. \p{Block: Runic} (Single: \p{InRunic}; NOT \p{Runic} NOR
  496. \p{Is_Runic}) (96)
  497. \p{Block: Samaritan} (Single: \p{InSamaritan}; NOT
  498. \p{Samaritan} NOR \p{Is_Samaritan}) (64)
  499. \p{Block: Saurashtra} (Single: \p{InSaurashtra}; NOT
  500. \p{Saurashtra} NOR \p{Is_Saurashtra})
  501. (96)
  502. \p{Block: Shavian} (Single: \p{InShavian}) (48)
  503. \p{Block: Sinhala} (Single: \p{InSinhala}; NOT \p{Sinhala}
  504. NOR \p{Is_Sinhala}) (128)
  505. \p{Block: Small_Form_Variants} (Single: \p{InSmallFormVariants})
  506. (32)
  507. \p{Block: Spacing_Modifier_Letters} (Single:
  508. \p{InSpacingModifierLetters}) (80)
  509. \p{Block: Specials} (Single: \p{InSpecials}) (16)
  510. \p{Block: Sundanese} (Single: \p{InSundanese}; NOT
  511. \p{Sundanese} NOR \p{Is_Sundanese}) (64)
  512. \p{Block: Superscripts_And_Subscripts} (Single:
  513. \p{InSuperscriptsAndSubscripts}) (48)
  514. \p{Block: Supplemental_Arrows_A} (Single:
  515. \p{InSupplementalArrowsA}) (16)
  516. \p{Block: Supplemental_Arrows_B} (Single:
  517. \p{InSupplementalArrowsB}) (128)
  518. \p{Block: Supplemental_Mathematical_Operators} (Single:
  519. \p{InSupplementalMathematicalOperators})
  520. (256)
  521. \p{Block: Supplemental_Punctuation} (Single:
  522. \p{InSupplementalPunctuation}) (128)
  523. \p{Block: Supplementary_Private_Use_Area_A} (Single:
  524. \p{InSupplementaryPrivateUseAreaA})
  525. (65_536)
  526. \p{Block: Supplementary_Private_Use_Area_B} (Single:
  527. \p{InSupplementaryPrivateUseAreaB})
  528. (65_536)
  529. \p{Block: Syloti_Nagri} (Single: \p{InSylotiNagri}; NOT
  530. \p{Syloti_Nagri} NOR
  531. \p{Is_Syloti_Nagri}) (48)
  532. \p{Block: Syriac} (Single: \p{InSyriac}; NOT \p{Syriac} NOR
  533. \p{Is_Syriac}) (80)
  534. \p{Block: Tagalog} (Single: \p{InTagalog}; NOT \p{Tagalog}
  535. NOR \p{Is_Tagalog}) (32)
  536. \p{Block: Tagbanwa} (Single: \p{InTagbanwa}; NOT \p{Tagbanwa}
  537. NOR \p{Is_Tagbanwa}) (32)
  538. \p{Block: Tags} (Single: \p{InTags}) (128)
  539. \p{Block: Tai_Le} (Single: \p{InTaiLe}; NOT \p{Tai_Le} NOR
  540. \p{Is_Tai_Le}) (48)
  541. \p{Block: Tai_Tham} (Single: \p{InTaiTham}; NOT \p{Tai_Tham}
  542. NOR \p{Is_Tai_Tham}) (144)
  543. \p{Block: Tai_Viet} (Single: \p{InTaiViet}; NOT \p{Tai_Viet}
  544. NOR \p{Is_Tai_Viet}) (96)
  545. \p{Block: Tai_Xuan_Jing_Symbols} (Single:
  546. \p{InTaiXuanJingSymbols}) (96)
  547. \p{Block: Tamil} (Single: \p{InTamil}; NOT \p{Tamil} NOR
  548. \p{Is_Tamil}) (128)
  549. \p{Block: Telugu} (Single: \p{InTelugu}; NOT \p{Telugu} NOR
  550. \p{Is_Telugu}) (128)
  551. \p{Block: Thaana} (Single: \p{InThaana}; NOT \p{Thaana} NOR
  552. \p{Is_Thaana}) (64)
  553. \p{Block: Thai} (Single: \p{InThai}; NOT \p{Thai} NOR
  554. \p{Is_Thai}) (128)
  555. \p{Block: Tibetan} (Single: \p{InTibetan}; NOT \p{Tibetan}
  556. NOR \p{Is_Tibetan}) (256)
  557. \p{Block: Tifinagh} (Single: \p{InTifinagh}; NOT \p{Tifinagh}
  558. NOR \p{Is_Tifinagh}) (80)
  559. \p{Block: Transport_And_Map_Symbols} (Single:
  560. \p{InTransportAndMapSymbols}) (128)
  561. \p{Block: Ugaritic} (Single: \p{InUgaritic}; NOT \p{Ugaritic}
  562. NOR \p{Is_Ugaritic}) (32)
  563. \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
  564. CanadianSyllabics},
  565. \p{InCanadianSyllabics}) (640)
  566. \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Single:
  567. \p{InUnifiedCanadianAboriginalSyllabics-
  568. Extended}) (80)
  569. \p{Block: Vai} (Single: \p{InVai}; NOT \p{Vai} NOR
  570. \p{Is_Vai}) (320)
  571. \p{Block: Variation_Selectors} (Single: \p{InVariationSelectors})
  572. (16)
  573. \p{Block: Variation_Selectors_Supplement} (Single:
  574. \p{InVariationSelectorsSupplement}) (240)
  575. \p{Block: Vedic_Extensions} (Single: \p{InVedicExtensions}) (48)
  576. \p{Block: Vertical_Forms} (Single: \p{InVerticalForms}) (16)
  577. \p{Block: Yi_Radicals} (Single: \p{InYiRadicals}) (64)
  578. \p{Block: Yi_Syllables} (Single: \p{InYiSyllables}) (1168)
  579. \p{Block: Yijing_Hexagram_Symbols} (Single:
  580. \p{InYijingHexagramSymbols}) (64)
  581. X \p{Block_Elements} \p{Block=Block_Elements} (32)
  582. \p{Bopo} \p{Bopomofo} (= \p{Script=Bopomofo}) (NOT
  583. \p{Block=Bopomofo}) (70)
  584. \p{Bopomofo} \p{Script=Bopomofo} (Short: \p{Bopo}; NOT
  585. \p{Block=Bopomofo}) (70)
  586. X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (32)
  587. X \p{Box_Drawing} \p{Block=Box_Drawing} (128)
  588. \p{Brah} \p{Brahmi} (= \p{Script=Brahmi}) (NOT
  589. \p{Block=Brahmi}) (108)
  590. \p{Brahmi} \p{Script=Brahmi} (Short: \p{Brah}; NOT
  591. \p{Block=Brahmi}) (108)
  592. \p{Brai} \p{Braille} (= \p{Script=Braille}) (256)
  593. \p{Braille} \p{Script=Braille} (Short: \p{Brai}) (256)
  594. X \p{Braille_Patterns} \p{Block=Braille_Patterns} (256)
  595. \p{Bugi} \p{Buginese} (= \p{Script=Buginese}) (NOT
  596. \p{Block=Buginese}) (30)
  597. \p{Buginese} \p{Script=Buginese} (Short: \p{Bugi}; NOT
  598. \p{Block=Buginese}) (30)
  599. \p{Buhd} \p{Buhid} (= \p{Script=Buhid}) (NOT
  600. \p{Block=Buhid}) (20)
  601. \p{Buhid} \p{Script=Buhid} (Short: \p{Buhd}; NOT
  602. \p{Block=Buhid}) (20)
  603. X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
  604. (256)
  605. \p{C} \p{Other} (= \p{General_Category=Other})
  606. (1_004_868)
  607. \p{Canadian_Aboriginal} \p{Script=Canadian_Aboriginal} (Short:
  608. \p{Cans}) (710)
  609. X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics}
  610. (= \p{Block=
  611. Unified_Canadian_Aboriginal_Syllabics})
  612. (640)
  613. T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
  614. Not_Reordered} (1_113_506)
  615. T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
  616. Overlay} (26)
  617. T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
  618. Nukta} (12)
  619. T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
  620. Kana_Voicing} (2)
  621. T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
  622. Virama} (31)
  623. T \p{Canonical_Combining_Class: 10} (Short: \p{Ccc=10}) (1)
  624. T \p{Canonical_Combining_Class: 11} (Short: \p{Ccc=11}) (1)
  625. T \p{Canonical_Combining_Class: 12} (Short: \p{Ccc=12}) (1)
  626. T \p{Canonical_Combining_Class: 13} (Short: \p{Ccc=13}) (1)
  627. T \p{Canonical_Combining_Class: 14} (Short: \p{Ccc=14}) (1)
  628. T \p{Canonical_Combining_Class: 15} (Short: \p{Ccc=15}) (1)
  629. T \p{Canonical_Combining_Class: 16} (Short: \p{Ccc=16}) (1)
  630. T \p{Canonical_Combining_Class: 17} (Short: \p{Ccc=17}) (1)
  631. T \p{Canonical_Combining_Class: 18} (Short: \p{Ccc=18}) (2)
  632. T \p{Canonical_Combining_Class: 19} (Short: \p{Ccc=19}) (2)
  633. T \p{Canonical_Combining_Class: 20} (Short: \p{Ccc=20}) (1)
  634. T \p{Canonical_Combining_Class: 21} (Short: \p{Ccc=21}) (1)
  635. T \p{Canonical_Combining_Class: 22} (Short: \p{Ccc=22}) (1)
  636. T \p{Canonical_Combining_Class: 23} (Short: \p{Ccc=23}) (1)
  637. T \p{Canonical_Combining_Class: 24} (Short: \p{Ccc=24}) (1)
  638. T \p{Canonical_Combining_Class: 25} (Short: \p{Ccc=25}) (1)
  639. T \p{Canonical_Combining_Class: 26} (Short: \p{Ccc=26}) (1)
  640. T \p{Canonical_Combining_Class: 27} (Short: \p{Ccc=27}) (1)
  641. T \p{Canonical_Combining_Class: 28} (Short: \p{Ccc=28}) (1)
  642. T \p{Canonical_Combining_Class: 29} (Short: \p{Ccc=29}) (1)
  643. T \p{Canonical_Combining_Class: 30} (Short: \p{Ccc=30}) (2)
  644. T \p{Canonical_Combining_Class: 31} (Short: \p{Ccc=31}) (2)
  645. T \p{Canonical_Combining_Class: 32} (Short: \p{Ccc=32}) (2)
  646. T \p{Canonical_Combining_Class: 33} (Short: \p{Ccc=33}) (1)
  647. T \p{Canonical_Combining_Class: 34} (Short: \p{Ccc=34}) (1)
  648. T \p{Canonical_Combining_Class: 35} (Short: \p{Ccc=35}) (1)
  649. T \p{Canonical_Combining_Class: 36} (Short: \p{Ccc=36}) (1)
  650. T \p{Canonical_Combining_Class: 84} (Short: \p{Ccc=84}) (1)
  651. T \p{Canonical_Combining_Class: 91} (Short: \p{Ccc=91}) (1)
  652. T \p{Canonical_Combining_Class: 103} (Short: \p{Ccc=103}) (2)
  653. T \p{Canonical_Combining_Class: 107} (Short: \p{Ccc=107}) (4)
  654. T \p{Canonical_Combining_Class: 118} (Short: \p{Ccc=118}) (2)
  655. T \p{Canonical_Combining_Class: 122} (Short: \p{Ccc=122}) (4)
  656. T \p{Canonical_Combining_Class: 129} (Short: \p{Ccc=129}) (1)
  657. T \p{Canonical_Combining_Class: 130} (Short: \p{Ccc=130}) (6)
  658. T \p{Canonical_Combining_Class: 132} (Short: \p{Ccc=132}) (1)
  659. T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
  660. Attached_Below_Left} (0)
  661. T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
  662. Attached_Below} (5)
  663. T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
  664. Attached_Above} (1)
  665. T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
  666. Attached_Above_Right} (9)
  667. T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
  668. Below_Left} (1)
  669. T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
  670. Below} (121)
  671. T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
  672. Below_Right} (4)
  673. T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
  674. Left} (2)
  675. T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
  676. Right} (1)
  677. T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
  678. Above_Left} (3)
  679. T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
  680. Above} (320)
  681. T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
  682. Above_Right} (4)
  683. T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
  684. Double_Below} (4)
  685. T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
  686. Double_Above} (5)
  687. T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
  688. Iota_Subscript} (1)
  689. \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
  690. Above} (320)
  691. \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (320)
  692. \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (3)
  693. \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (4)
  694. \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
  695. Above_Left} (3)
  696. \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
  697. Above_Right} (4)
  698. \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
  699. Attached_Above} (1)
  700. \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
  701. Attached_Above_Right} (9)
  702. \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
  703. Attached_Below} (5)
  704. \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
  705. Attached_Below_Left} (0)
  706. \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
  707. (1)
  708. \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
  709. \p{Ccc=ATAR}) (9)
  710. \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
  711. (5)
  712. \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
  713. ATBL}) (0)
  714. \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
  715. Below} (121)
  716. \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (121)
  717. \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1)
  718. \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4)
  719. \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
  720. Below_Left} (1)
  721. \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
  722. Below_Right} (4)
  723. \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
  724. Double_Above} (5)
  725. \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
  726. Double_Below} (4)
  727. \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA}) (5)
  728. \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB}) (4)
  729. \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
  730. (1)
  731. \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
  732. Iota_Subscript} (1)
  733. \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV}) (2)
  734. \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
  735. Kana_Voicing} (2)
  736. \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
  737. Left} (2)
  738. \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2)
  739. \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
  740. Nukta} (12)
  741. \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
  742. (1_113_506)
  743. \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
  744. Not_Reordered} (1_113_506)
  745. \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (12)
  746. \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
  747. Overlay} (26)
  748. \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (26)
  749. \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
  750. Right} (1)
  751. \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1)
  752. \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (31)
  753. \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
  754. Virama} (31)
  755. \p{Cans} \p{Canadian_Aboriginal} (= \p{Script=
  756. Canadian_Aboriginal}) (710)
  757. \p{Cari} \p{Carian} (= \p{Script=Carian}) (NOT
  758. \p{Block=Carian}) (49)
  759. \p{Carian} \p{Script=Carian} (Short: \p{Cari}; NOT
  760. \p{Block=Carian}) (49)
  761. \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (1692)
  762. \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_112_420)
  763. \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (1692)
  764. \p{Cased} \p{Cased=Y} (3427)
  765. \p{Cased: N*} (Single: \P{Cased}) (1_110_685)
  766. \p{Cased: Y*} (Single: \p{Cased}) (3427)
  767. \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short:
  768. \p{LC}) (3226)
  769. \p{Category: *} \p{General_Category: *}
  770. \p{Cc} \p{Cntrl} (= \p{General_Category=Control})
  771. (65)
  772. \p{Ccc: *} \p{Canonical_Combining_Class: *}
  773. \p{CE} \p{Composition_Exclusion} (=
  774. \p{Composition_Exclusion=Y}) (81)
  775. \p{CE: *} \p{Composition_Exclusion: *}
  776. \p{Cf} \p{Format} (= \p{General_Category=Format})
  777. (140)
  778. \p{Cham} \p{Script=Cham} (NOT \p{Block=Cham}) (83)
  779. \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
  780. \p{CWCF}) (1102)
  781. \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
  782. (1_113_010)
  783. \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
  784. (1102)
  785. \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
  786. \p{CWCM}) (2128)
  787. \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
  788. (1_111_984)
  789. \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
  790. (2128)
  791. \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
  792. \p{CWL}) (1038)
  793. \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
  794. (1_113_074)
  795. \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1038)
  796. \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
  797. Y} (Short: \p{CWKCF}) (9792)
  798. \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
  799. \P{CWKCF}) (1_104_320)
  800. \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
  801. \p{CWKCF}) (9792)
  802. \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
  803. \p{CWT}) (1094)
  804. \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
  805. (1_113_018)
  806. \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1094)
  807. \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
  808. \p{CWU}) (1121)
  809. \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
  810. (1_112_991)
  811. \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1121)
  812. \p{Cher} \p{Cherokee} (= \p{Script=Cherokee}) (NOT
  813. \p{Block=Cherokee}) (85)
  814. \p{Cherokee} \p{Script=Cherokee} (Short: \p{Cher}; NOT
  815. \p{Block=Cherokee}) (85)
  816. \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable=
  817. Y}) (1692)
  818. \p{CI: *} \p{Case_Ignorable: *}
  819. X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (256)
  820. X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms} (32)
  821. X \p{CJK_Compatibility_Ideographs} \p{Block=
  822. CJK_Compatibility_Ideographs} (512)
  823. X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
  824. CJK_Compatibility_Ideographs_Supplement}
  825. (544)
  826. X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement} (128)
  827. X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48)
  828. X \p{CJK_Symbols_And_Punctuation} \p{Block=
  829. CJK_Symbols_And_Punctuation} (64)
  830. X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
  831. (20_992)
  832. X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
  833. CJK_Unified_Ideographs_Extension_A}
  834. (6592)
  835. X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
  836. CJK_Unified_Ideographs_Extension_B}
  837. (42_720)
  838. X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
  839. CJK_Unified_Ideographs_Extension_C}
  840. (4160)
  841. X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
  842. CJK_Unified_Ideographs_Extension_D} (224)
  843. \p{Close_Punctuation} \p{General_Category=Close_Punctuation}
  844. (Short: \p{Pe}) (71)
  845. \p{Cn} \p{Unassigned} (= \p{General_Category=
  846. Unassigned}) (865_147)
  847. \p{Cntrl} \p{General_Category=Control} Control
  848. characters (Short: \p{Cc}) (65)
  849. \p{Co} \p{Private_Use} (= \p{General_Category=
  850. Private_Use}) (NOT \p{Private_Use_Area})
  851. (137_468)
  852. X \p{Combining_Diacritical_Marks} \p{Block=
  853. Combining_Diacritical_Marks} (112)
  854. X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
  855. Combining_Diacritical_Marks_For_Symbols}
  856. (Short: \p{InCombiningMarksForSymbols})
  857. (48)
  858. X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
  859. Combining_Diacritical_Marks_Supplement}
  860. (64)
  861. X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (16)
  862. X \p{Combining_Marks_For_Symbols}
  863. \p{Combining_Diacritical_Marks_For_-
  864. Symbols} (= \p{Block=
  865. Combining_Diacritical_Marks_For_-
  866. Symbols}) (48)
  867. \p{Common} \p{Script=Common} (Short: \p{Zyyy}) (6379)
  868. X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
  869. (16)
  870. \p{Comp_Ex} \p{Full_Composition_Exclusion} (=
  871. \p{Full_Composition_Exclusion=Y}) (1118)
  872. \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *}
  873. \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
  874. \p{CE}) (81)
  875. \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031)
  876. \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81)
  877. \p{Connector_Punctuation} \p{General_Category=
  878. Connector_Punctuation} (Short: \p{Pc})
  879. (10)
  880. \p{Control} \p{Cntrl} (= \p{General_Category=Control})
  881. (65)
  882. X \p{Control_Pictures} \p{Block=Control_Pictures} (64)
  883. \p{Copt} \p{Coptic} (= \p{Script=Coptic}) (NOT
  884. \p{Block=Coptic}) (135)
  885. \p{Coptic} \p{Script=Coptic} (Short: \p{Copt}; NOT
  886. \p{Block=Coptic}) (135)
  887. X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (32)
  888. \p{Cprt} \p{Cypriot} (= \p{Script=Cypriot}) (55)
  889. \p{Cs} \p{Surrogate} (= \p{General_Category=
  890. Surrogate}) (2048)
  891. \p{Cuneiform} \p{Script=Cuneiform} (Short: \p{Xsux}; NOT
  892. \p{Block=Cuneiform}) (982)
  893. X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
  894. Cuneiform_Numbers_And_Punctuation} (128)
  895. \p{Currency_Symbol} \p{General_Category=Currency_Symbol}
  896. (Short: \p{Sc}) (47)
  897. X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48)
  898. \p{CWCF} \p{Changes_When_Casefolded} (=
  899. \p{Changes_When_Casefolded=Y}) (1102)
  900. \p{CWCF: *} \p{Changes_When_Casefolded: *}
  901. \p{CWCM} \p{Changes_When_Casemapped} (=
  902. \p{Changes_When_Casemapped=Y}) (2128)
  903. \p{CWCM: *} \p{Changes_When_Casemapped: *}
  904. \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (=
  905. \p{Changes_When_NFKC_Casefolded=Y})
  906. (9792)
  907. \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *}
  908. \p{CWL} \p{Changes_When_Lowercased} (=
  909. \p{Changes_When_Lowercased=Y}) (1038)
  910. \p{CWL: *} \p{Changes_When_Lowercased: *}
  911. \p{CWT} \p{Changes_When_Titlecased} (=
  912. \p{Changes_When_Titlecased=Y}) (1094)
  913. \p{CWT: *} \p{Changes_When_Titlecased: *}
  914. \p{CWU} \p{Changes_When_Uppercased} (=
  915. \p{Changes_When_Uppercased=Y}) (1121)
  916. \p{CWU: *} \p{Changes_When_Uppercased: *}
  917. \p{Cypriot} \p{Script=Cypriot} (Short: \p{Cprt}) (55)
  918. X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64)
  919. \p{Cyrillic} \p{Script=Cyrillic} (Short: \p{Cyrl}; NOT
  920. \p{Block=Cyrillic}) (408)
  921. X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (32)
  922. X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (96)
  923. X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (48)
  924. X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
  925. Cyrillic_Supplement}) (48)
  926. \p{Cyrl} \p{Cyrillic} (= \p{Script=Cyrillic}) (NOT
  927. \p{Block=Cyrillic}) (408)
  928. \p{Dash} \p{Dash=Y} (25)
  929. \p{Dash: N*} (Single: \P{Dash}) (1_114_087)
  930. \p{Dash: Y*} (Single: \p{Dash}) (25)
  931. \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation}
  932. (Short: \p{Pd}) (21)
  933. \p{Decimal_Number} \p{Digit} (= \p{General_Category=
  934. Decimal_Number}) (420)
  935. \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
  936. (13_221)
  937. \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_221)
  938. \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240)
  939. \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
  940. \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720)
  941. \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
  942. \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
  943. \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240)
  944. \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1043)
  945. \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
  946. \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20)
  947. \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
  948. \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171)
  949. \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
  950. \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238)
  951. \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
  952. \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82)
  953. \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
  954. \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122)
  955. \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
  956. \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5)
  957. \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
  958. Non_Canonical} (Perl extension) (3510)
  959. \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
  960. decompositions (Short: \p{Dt=NonCanon})
  961. (Perl extension) (3510)
  962. \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_381)
  963. \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26)
  964. \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
  965. \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (284)
  966. \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (284)
  967. \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38)
  968. \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (142)
  969. \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (142)
  970. \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
  971. \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35)
  972. \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104)
  973. \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
  974. Y} (Short: \p{DI}) (4167)
  975. \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
  976. (1_109_945)
  977. \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
  978. (4167)
  979. \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (111)
  980. \p{Dep: *} \p{Deprecated: *}
  981. \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (111)
  982. \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_001)
  983. \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (111)
  984. \p{Deseret} \p{Script=Deseret} (Short: \p{Dsrt}) (80)
  985. \p{Deva} \p{Devanagari} (= \p{Script=Devanagari})
  986. (NOT \p{Block=Devanagari}) (150)
  987. \p{Devanagari} \p{Script=Devanagari} (Short: \p{Deva};
  988. NOT \p{Block=Devanagari}) (150)
  989. X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (32)
  990. \p{DI} \p{Default_Ignorable_Code_Point} (=
  991. \p{Default_Ignorable_Code_Point=Y})
  992. (4167)
  993. \p{DI: *} \p{Default_Ignorable_Code_Point: *}
  994. \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (639)
  995. \p{Dia: *} \p{Diacritic: *}
  996. \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (639)
  997. \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_113_473)
  998. \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (639)
  999. \p{Digit} \p{General_Category=Decimal_Number} [0-9]
  1000. + all other decimal digits (Short:
  1001. \p{Nd}) (420)
  1002. X \p{Dingbats} \p{Block=Dingbats} (192)
  1003. X \p{Domino_Tiles} \p{Block=Domino_Tiles} (112)
  1004. \p{Dsrt} \p{Deseret} (= \p{Script=Deseret}) (80)
  1005. \p{Dt: *} \p{Decomposition_Type: *}
  1006. \p{Ea: *} \p{East_Asian_Width: *}
  1007. \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_746)
  1008. \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_746)
  1009. \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
  1010. \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104)
  1011. \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
  1012. \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123)
  1013. \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (801_811)
  1014. \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
  1015. \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111)
  1016. \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (801_811)
  1017. \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (173_217)
  1018. \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (173_217)
  1019. \p{Egyp} \p{Egyptian_Hieroglyphs} (= \p{Script=
  1020. Egyptian_Hieroglyphs}) (NOT \p{Block=
  1021. Egyptian_Hieroglyphs}) (1071)
  1022. \p{Egyptian_Hieroglyphs} \p{Script=Egyptian_Hieroglyphs} (Short:
  1023. \p{Egyp}; NOT \p{Block=
  1024. Egyptian_Hieroglyphs}) (1071)
  1025. X \p{Emoticons} \p{Block=Emoticons} (80)
  1026. X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
  1027. Enclosed_Alphanumeric_Supplement} (256)
  1028. X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics} (160)
  1029. X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
  1030. Enclosed_CJK_Letters_And_Months} (256)
  1031. X \p{Enclosed_Ideographic_Supplement} \p{Block=
  1032. Enclosed_Ideographic_Supplement} (256)
  1033. \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark}
  1034. (Short: \p{Me}) (12)
  1035. \p{Ethi} \p{Ethiopic} (= \p{Script=Ethiopic}) (NOT
  1036. \p{Block=Ethiopic}) (495)
  1037. \p{Ethiopic} \p{Script=Ethiopic} (Short: \p{Ethi}; NOT
  1038. \p{Block=Ethiopic}) (495)
  1039. X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (96)
  1040. X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (48)
  1041. X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (32)
  1042. \p{Ext} \p{Extender} (= \p{Extender=Y}) (28)
  1043. \p{Ext: *} \p{Extender: *}
  1044. \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (28)
  1045. \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_084)
  1046. \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (28)
  1047. \p{Final_Punctuation} \p{General_Category=Final_Punctuation}
  1048. (Short: \p{Pf}) (10)
  1049. \p{Format} \p{General_Category=Format} (Short:
  1050. \p{Cf}) (140)
  1051. \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
  1052. (Short: \p{CompEx}) (1118)
  1053. \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
  1054. \P{CompEx}) (1_112_994)
  1055. \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
  1056. \p{CompEx}) (1118)
  1057. \p{Gc: *} \p{General_Category: *}
  1058. \p{GCB: *} \p{Grapheme_Cluster_Break: *}
  1059. \p{General_Category: C} \p{General_Category=Other} (1_004_868)
  1060. \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
  1061. \p{Gc=LC}, \p{LC}) (3226)
  1062. \p{General_Category: Cc} \p{General_Category=Control} (65)
  1063. \p{General_Category: Cf} \p{General_Category=Format} (140)
  1064. \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
  1065. (71)
  1066. \p{General_Category: Cn} \p{General_Category=Unassigned} (865_147)
  1067. \p{General_Category: Cntrl} \p{General_Category=Control} (65)
  1068. \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
  1069. \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
  1070. \p{Pc}) (10)
  1071. \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65)
  1072. \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
  1073. \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
  1074. (47)
  1075. \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
  1076. (21)
  1077. \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
  1078. (420)
  1079. \p{General_Category: Digit} \p{General_Category=Decimal_Number}
  1080. (420)
  1081. \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
  1082. (12)
  1083. \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
  1084. (10)
  1085. \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (140)
  1086. \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
  1087. \p{Pi}) (12)
  1088. \p{General_Category: L} \p{General_Category=Letter} (100_520)
  1089. X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3226)
  1090. X \p{General_Category: L_} \p{General_Category=Cased_Letter} (3226)
  1091. \p{General_Category: LC} \p{General_Category=Cased_Letter} (3226)
  1092. \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (100_520)
  1093. \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
  1094. (224)
  1095. \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl}) (1)
  1096. \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
  1097. (/i= General_Category=Cased_Letter)
  1098. (1759)
  1099. \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (210)
  1100. \p{General_Category: Lo} \p{General_Category=Other_Letter} (97_084)
  1101. \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
  1102. /i= General_Category=Cased_Letter) (1759)
  1103. \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
  1104. (/i= General_Category=Cased_Letter) (31)
  1105. \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
  1106. (/i= General_Category=Cased_Letter)
  1107. (1436)
  1108. \p{General_Category: M} \p{General_Category=Mark} (1498)
  1109. \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (1498)
  1110. \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948)
  1111. \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (287)
  1112. \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (12)
  1113. \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
  1114. (1199)
  1115. \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
  1116. (210)
  1117. \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
  1118. (115)
  1119. \p{General_Category: N} \p{General_Category=Number} (1100)
  1120. \p{General_Category: Nd} \p{General_Category=Decimal_Number} (420)
  1121. \p{General_Category: Nl} \p{General_Category=Letter_Number} (224)
  1122. \p{General_Category: No} \p{General_Category=Other_Number} (456)
  1123. \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
  1124. (1199)
  1125. \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1100)
  1126. \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
  1127. (72)
  1128. \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (1_004_868)
  1129. \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
  1130. (97_084)
  1131. \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No}) (456)
  1132. \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
  1133. (402)
  1134. \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
  1135. (4398)
  1136. \p{General_Category: P} \p{General_Category=Punctuation} (598)
  1137. \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
  1138. \p{Zp}) (1)
  1139. \p{General_Category: Pc} \p{General_Category=
  1140. Connector_Punctuation} (10)
  1141. \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (21)
  1142. \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
  1143. (71)
  1144. \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
  1145. (10)
  1146. \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
  1147. (12)
  1148. \p{General_Category: Po} \p{General_Category=Other_Punctuation}
  1149. (402)
  1150. \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
  1151. (137_468)
  1152. \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (72)
  1153. \p{General_Category: Punct} \p{General_Category=Punctuation} (598)
  1154. \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (598)
  1155. \p{General_Category: S} \p{General_Category=Symbol} (5508)
  1156. \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (47)
  1157. \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (20)
  1158. \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (115)
  1159. \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
  1160. \p{General_Category: So} \p{General_Category=Other_Symbol} (4398)
  1161. \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
  1162. (18)
  1163. \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc}) (287)
  1164. \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048)
  1165. \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (5508)
  1166. \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
  1167. /i= General_Category=Cased_Letter) (31)
  1168. \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
  1169. (865_147)
  1170. \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
  1171. /i= General_Category=Cased_Letter) (1436)
  1172. \p{General_Category: Z} \p{General_Category=Separator} (20)
  1173. \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
  1174. \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
  1175. (1)
  1176. \p{General_Category: Zs} \p{General_Category=Space_Separator} (18)
  1177. X \p{General_Punctuation} \p{Block=General_Punctuation} (112)
  1178. X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96)
  1179. \p{Geor} \p{Georgian} (= \p{Script=Georgian}) (NOT
  1180. \p{Block=Georgian}) (120)
  1181. \p{Georgian} \p{Script=Georgian} (Short: \p{Geor}; NOT
  1182. \p{Block=Georgian}) (120)
  1183. X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (48)
  1184. \p{Glag} \p{Glagolitic} (= \p{Script=Glagolitic})
  1185. (NOT \p{Block=Glagolitic}) (94)
  1186. \p{Glagolitic} \p{Script=Glagolitic} (Short: \p{Glag};
  1187. NOT \p{Block=Glagolitic}) (94)
  1188. \p{Goth} \p{Gothic} (= \p{Script=Gothic}) (NOT
  1189. \p{Block=Gothic}) (27)
  1190. \p{Gothic} \p{Script=Gothic} (Short: \p{Goth}; NOT
  1191. \p{Block=Gothic}) (27)
  1192. \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
  1193. (108_010)
  1194. \p{Gr_Base: *} \p{Grapheme_Base: *}
  1195. \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend=
  1196. Y}) (1234)
  1197. \p{Gr_Ext: *} \p{Grapheme_Extend: *}
  1198. \p{Graph} Characters that are graphical (246_832)
  1199. \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase})
  1200. (108_010)
  1201. \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase})
  1202. (1_006_102)
  1203. \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase}) (108_010)
  1204. \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
  1205. (203)
  1206. \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (203)
  1207. \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1)
  1208. \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
  1209. (1234)
  1210. \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1234)
  1211. \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125)
  1212. \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1)
  1213. \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399)
  1214. \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773)
  1215. \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_100_854)
  1216. \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
  1217. (15)
  1218. \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (15)
  1219. \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
  1220. SpacingMark} (275)
  1221. \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (275)
  1222. \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137)
  1223. \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95)
  1224. \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
  1225. (1_100_854)
  1226. \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt})
  1227. (1234)
  1228. \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_878)
  1229. \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1234)
  1230. \p{Greek} \p{Script=Greek} (Short: \p{Grek}; NOT
  1231. \p{Greek_And_Coptic}) (511)
  1232. X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short:
  1233. \p{InGreek}) (144)
  1234. X \p{Greek_Extended} \p{Block=Greek_Extended} (256)
  1235. \p{Grek} \p{Greek} (= \p{Script=Greek}) (NOT
  1236. \p{Greek_And_Coptic}) (511)
  1237. \p{Gujarati} \p{Script=Gujarati} (Short: \p{Gujr}; NOT
  1238. \p{Block=Gujarati}) (83)
  1239. \p{Gujr} \p{Gujarati} (= \p{Script=Gujarati}) (NOT
  1240. \p{Block=Gujarati}) (83)
  1241. \p{Gurmukhi} \p{Script=Gurmukhi} (Short: \p{Guru}; NOT
  1242. \p{Block=Gurmukhi}) (79)
  1243. \p{Guru} \p{Gurmukhi} (= \p{Script=Gurmukhi}) (NOT
  1244. \p{Block=Gurmukhi}) (79)
  1245. X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
  1246. Halfwidth_And_Fullwidth_Forms} (240)
  1247. \p{Han} \p{Script=Han} (75_960)
  1248. \p{Hang} \p{Hangul} (= \p{Script=Hangul}) (11_739)
  1249. \p{Hangul} \p{Script=Hangul} (Short: \p{Hang})
  1250. (11_739)
  1251. X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
  1252. (96)
  1253. X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (256)
  1254. X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A} (32)
  1255. X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B} (80)
  1256. \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
  1257. (125)
  1258. \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125)
  1259. \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
  1260. (399)
  1261. \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399)
  1262. \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
  1263. LVT_Syllable} (10_773)
  1264. \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
  1265. (10_773)
  1266. \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
  1267. Not_Applicable} (1_102_583)
  1268. \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
  1269. (1_102_583)
  1270. \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
  1271. (137)
  1272. \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137)
  1273. \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
  1274. (95)
  1275. \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95)
  1276. X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (11_184)
  1277. \p{Hani} \p{Han} (= \p{Script=Han}) (75_960)
  1278. \p{Hano} \p{Hanunoo} (= \p{Script=Hanunoo}) (NOT
  1279. \p{Block=Hanunoo}) (21)
  1280. \p{Hanunoo} \p{Script=Hanunoo} (Short: \p{Hano}; NOT
  1281. \p{Block=Hanunoo}) (21)
  1282. \p{Hebr} \p{Hebrew} (= \p{Script=Hebrew}) (NOT
  1283. \p{Block=Hebrew}) (133)
  1284. \p{Hebrew} \p{Script=Hebrew} (Short: \p{Hebr}; NOT
  1285. \p{Block=Hebrew}) (133)
  1286. \p{Hex} \p{XDigit} (= \p{Hex_Digit=Y}) (44)
  1287. \p{Hex: *} \p{Hex_Digit: *}
  1288. \p{Hex_Digit} \p{XDigit} (= \p{Hex_Digit=Y}) (44)
  1289. \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068)
  1290. \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44)
  1291. X \p{High_Private_Use_Surrogates} \p{Block=
  1292. High_Private_Use_Surrogates} (128)
  1293. X \p{High_Surrogates} \p{Block=High_Surrogates} (896)
  1294. \p{Hira} \p{Hiragana} (= \p{Script=Hiragana}) (NOT
  1295. \p{Block=Hiragana}) (91)
  1296. \p{Hiragana} \p{Script=Hiragana} (Short: \p{Hira}; NOT
  1297. \p{Block=Hiragana}) (91)
  1298. \p{HorizSpace} \p{Blank} (19)
  1299. \p{Hst: *} \p{Hangul_Syllable_Type: *}
  1300. D \p{Hyphen} \p{Hyphen=Y} (11)
  1301. D \p{Hyphen: N*} Supplanted by Line_Break property values;
  1302. see www.unicode.org/reports/tr14
  1303. (Single: \P{Hyphen}) (1_114_101)
  1304. D \p{Hyphen: Y*} Supplanted by Line_Break property values;
  1305. see www.unicode.org/reports/tr14
  1306. (Single: \p{Hyphen}) (11)
  1307. \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC})
  1308. (102_675)
  1309. \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (1_011_437)
  1310. \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (102_675)
  1311. \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (100_747)
  1312. \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (1_013_365)
  1313. \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (100_747)
  1314. \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y})
  1315. (102_675)
  1316. \p{IDC: *} \p{ID_Continue: *}
  1317. \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y})
  1318. (75_630)
  1319. \p{Ideo: *} \p{Ideographic: *}
  1320. \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo})
  1321. (75_630)
  1322. \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_038_482)
  1323. \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (75_630)
  1324. X \p{Ideographic_Description_Characters} \p{Block=
  1325. Ideographic_Description_Characters} (16)
  1326. \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (100_747)
  1327. \p{IDS: *} \p{ID_Start: *}
  1328. \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
  1329. \p{IDSB}) (10)
  1330. \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
  1331. (1_114_102)
  1332. \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10)
  1333. \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
  1334. \p{IDST}) (2)
  1335. \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
  1336. (1_114_110)
  1337. \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2)
  1338. \p{IDSB} \p{IDS_Binary_Operator} (=
  1339. \p{IDS_Binary_Operator=Y}) (10)
  1340. \p{IDSB: *} \p{IDS_Binary_Operator: *}
  1341. \p{IDST} \p{IDS_Trinary_Operator} (=
  1342. \p{IDS_Trinary_Operator=Y}) (2)
  1343. \p{IDST: *} \p{IDS_Trinary_Operator: *}
  1344. \p{Imperial_Aramaic} \p{Script=Imperial_Aramaic} (Short:
  1345. \p{Armi}; NOT \p{Block=
  1346. Imperial_Aramaic}) (31)
  1347. \p{In: *} \p{Present_In: *} (Perl extension)
  1348. \p{In_*} \p{Block: *}
  1349. \p{Inherited} \p{Script=Inherited} (Short: \p{Zinh})
  1350. (523)
  1351. \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
  1352. (Short: \p{Pi}) (12)
  1353. \p{Inscriptional_Pahlavi} \p{Script=Inscriptional_Pahlavi} (Short:
  1354. \p{Phli}; NOT \p{Block=
  1355. Inscriptional_Pahlavi}) (27)
  1356. \p{Inscriptional_Parthian} \p{Script=Inscriptional_Parthian}
  1357. (Short: \p{Prti}; NOT \p{Block=
  1358. Inscriptional_Parthian}) (30)
  1359. X \p{IPA_Extensions} \p{Block=IPA_Extensions} (96)
  1360. \p{Is_*} \p{*} (Any exceptions are individually
  1361. noted beginning with the word NOT.) If
  1362. an entry has flag(s) at its beginning,
  1363. like 'D', the 'Is_' form has the same
  1364. flag(s)
  1365. \p{Ital} \p{Old_Italic} (= \p{Script=Old_Italic})
  1366. (NOT \p{Block=Old_Italic}) (35)
  1367. \p{Java} \p{Javanese} (= \p{Script=Javanese}) (NOT
  1368. \p{Block=Javanese}) (91)
  1369. \p{Javanese} \p{Script=Javanese} (Short: \p{Java}; NOT
  1370. \p{Block=Javanese}) (91)
  1371. \p{Jg: *} \p{Joining_Group: *}
  1372. \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2)
  1373. \p{Join_C: *} \p{Join_Control: *}
  1374. \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2)
  1375. \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110)
  1376. \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2)
  1377. \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (7)
  1378. \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1)
  1379. \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10)
  1380. \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (19)
  1381. \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2)
  1382. \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
  1383. BurushaskiYehBarree}) (2)
  1384. \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (14)
  1385. \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4)
  1386. \p{Joining_Group: E} (Short: \p{Jg=E}) (1)
  1387. \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7)
  1388. \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1)
  1389. \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (9)
  1390. \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1)
  1391. \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (13)
  1392. \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3)
  1393. \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (17)
  1394. \p{Joining_Group: Hamza_On_Heh_Goal} \p{Joining_Group=
  1395. Teh_Marbuta_Goal} (1)
  1396. \p{Joining_Group: He} (Short: \p{Jg=He}) (1)
  1397. \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1)
  1398. \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2)
  1399. \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1)
  1400. \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (5)
  1401. \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1)
  1402. \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1)
  1403. \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2)
  1404. \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (6)
  1405. \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1)
  1406. \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (3)
  1407. \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1)
  1408. \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
  1409. (1_113_882)
  1410. \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8)
  1411. \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1)
  1412. \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1)
  1413. \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1)
  1414. \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (4)
  1415. \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1)
  1416. \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (16)
  1417. \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1)
  1418. \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (5)
  1419. \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1)
  1420. \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11)
  1421. \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1)
  1422. \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1)
  1423. \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1)
  1424. \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1)
  1425. \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (3)
  1426. \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1)
  1427. \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3)
  1428. \p{Joining_Group: Teh_Marbuta_Goal} (Short: \p{Jg=TehMarbutaGoal})
  1429. (1)
  1430. \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2)
  1431. \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (15)
  1432. \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (8)
  1433. \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2)
  1434. \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1)
  1435. \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1)
  1436. \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1)
  1437. \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1)
  1438. \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1)
  1439. \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (3)
  1440. \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (189)
  1441. \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (189)
  1442. \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (3)
  1443. \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (0)
  1444. \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (0)
  1445. \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_112_502)
  1446. \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (74)
  1447. \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (74)
  1448. \p{Joining_Type: T} \p{Joining_Type=Transparent} (1344)
  1449. \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1344)
  1450. \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_112_502)
  1451. \p{Jt: *} \p{Joining_Type: *}
  1452. \p{Kaithi} \p{Script=Kaithi} (Short: \p{Kthi}; NOT
  1453. \p{Block=Kaithi}) (66)
  1454. \p{Kali} \p{Kayah_Li} (= \p{Script=Kayah_Li}) (48)
  1455. \p{Kana} \p{Katakana} (= \p{Script=Katakana}) (NOT
  1456. \p{Block=Katakana}) (300)
  1457. X \p{Kana_Supplement} \p{Block=Kana_Supplement} (256)
  1458. X \p{Kanbun} \p{Block=Kanbun} (16)
  1459. X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (224)
  1460. \p{Kannada} \p{Script=Kannada} (Short: \p{Knda}; NOT
  1461. \p{Block=Kannada}) (86)
  1462. \p{Katakana} \p{Script=Katakana} (Short: \p{Kana}; NOT
  1463. \p{Block=Katakana}) (300)
  1464. X \p{Katakana_Phonetic_Extensions} \p{Block=
  1465. Katakana_Phonetic_Extensions} (16)
  1466. \p{Kayah_Li} \p{Script=Kayah_Li} (Short: \p{Kali}) (48)
  1467. \p{Khar} \p{Kharoshthi} (= \p{Script=Kharoshthi})
  1468. (NOT \p{Block=Kharoshthi}) (65)
  1469. \p{Kharoshthi} \p{Script=Kharoshthi} (Short: \p{Khar};
  1470. NOT \p{Block=Kharoshthi}) (65)
  1471. \p{Khmer} \p{Script=Khmer} (Short: \p{Khmr}; NOT
  1472. \p{Block=Khmer}) (146)
  1473. X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32)
  1474. \p{Khmr} \p{Khmer} (= \p{Script=Khmer}) (NOT
  1475. \p{Block=Khmer}) (146)
  1476. \p{Knda} \p{Kannada} (= \p{Script=Kannada}) (NOT
  1477. \p{Block=Kannada}) (86)
  1478. \p{Kthi} \p{Kaithi} (= \p{Script=Kaithi}) (NOT
  1479. \p{Block=Kaithi}) (66)
  1480. \p{L} \p{Letter} (= \p{General_Category=Letter})
  1481. (100_520)
  1482. \p{L&} \p{Cased_Letter} (= \p{General_Category=
  1483. Cased_Letter}) (3226)
  1484. \p{L_} \p{Cased_Letter} (= \p{General_Category=
  1485. Cased_Letter}) (3226)
  1486. \p{Lana} \p{Tai_Tham} (= \p{Script=Tai_Tham}) (NOT
  1487. \p{Block=Tai_Tham}) (127)
  1488. \p{Lao} \p{Script=Lao} (NOT \p{Block=Lao}) (65)
  1489. \p{Laoo} \p{Lao} (= \p{Script=Lao}) (NOT \p{Block=
  1490. Lao}) (65)
  1491. \p{Latin} \p{Script=Latin} (Short: \p{Latn}) (1267)
  1492. X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block=
  1493. Latin_1_Supplement}) (128)
  1494. X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short:
  1495. \p{InLatin1}) (128)
  1496. X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (128)
  1497. X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
  1498. (256)
  1499. X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (208)
  1500. X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (32)
  1501. X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (224)
  1502. \p{Latn} \p{Latin} (= \p{Script=Latin}) (1267)
  1503. \p{Lb: *} \p{Line_Break: *}
  1504. \p{LC} \p{Cased_Letter} (= \p{General_Category=
  1505. Cased_Letter}) (3226)
  1506. \p{Lepc} \p{Lepcha} (= \p{Script=Lepcha}) (NOT
  1507. \p{Block=Lepcha}) (74)
  1508. \p{Lepcha} \p{Script=Lepcha} (Short: \p{Lepc}; NOT
  1509. \p{Block=Lepcha}) (74)
  1510. \p{Letter} \p{General_Category=Letter} (Short: \p{L})
  1511. (100_520)
  1512. \p{Letter_Number} \p{General_Category=Letter_Number} (Short:
  1513. \p{Nl}) (224)
  1514. X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80)
  1515. \p{Limb} \p{Limbu} (= \p{Script=Limbu}) (NOT
  1516. \p{Block=Limbu}) (66)
  1517. \p{Limbu} \p{Script=Limbu} (Short: \p{Limb}; NOT
  1518. \p{Block=Limbu}) (66)
  1519. \p{Linb} \p{Linear_B} (= \p{Script=Linear_B}) (211)
  1520. \p{Line_Break: AI} \p{Line_Break=Ambiguous} (724)
  1521. \p{Line_Break: AL} \p{Line_Break=Alphabetic} (15_797)
  1522. \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (15_797)
  1523. \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (724)
  1524. \p{Line_Break: B2} \p{Line_Break=Break_Both} (1)
  1525. \p{Line_Break: BA} \p{Line_Break=Break_After} (140)
  1526. \p{Line_Break: BB} \p{Line_Break=Break_Before} (19)
  1527. \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4)
  1528. \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (140)
  1529. \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (19)
  1530. \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (1)
  1531. \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1)
  1532. \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1)
  1533. \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1)
  1534. \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (87)
  1535. \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2)
  1536. \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (87)
  1537. \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (1483)
  1538. \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (1483)
  1539. \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (663)
  1540. \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1)
  1541. \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2)
  1542. \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1)
  1543. \p{Line_Break: EX} \p{Line_Break=Exclamation} (34)
  1544. \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (34)
  1545. \p{Line_Break: GL} \p{Line_Break=Glue} (18)
  1546. \p{Line_Break: Glue} (Short: \p{Lb=GL}) (18)
  1547. \p{Line_Break: H2} (Short: \p{Lb=H2}) (399)
  1548. \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773)
  1549. \p{Line_Break: HY} \p{Line_Break=Hyphen} (1)
  1550. \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1)
  1551. \p{Line_Break: ID} \p{Line_Break=Ideographic} (161_793)
  1552. \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (161_793)
  1553. \p{Line_Break: IN} \p{Line_Break=Inseparable} (4)
  1554. \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13)
  1555. \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (4)
  1556. \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (4)
  1557. \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13)
  1558. \p{Line_Break: JL} (Short: \p{Lb=JL}) (125)
  1559. \p{Line_Break: JT} (Short: \p{Lb=JT}) (137)
  1560. \p{Line_Break: JV} (Short: \p{Lb=JV}) (95)
  1561. \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1)
  1562. \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1)
  1563. \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4)
  1564. \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1)
  1565. \p{Line_Break: NL} \p{Line_Break=Next_Line} (1)
  1566. \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (77)
  1567. \p{Line_Break: NS} \p{Line_Break=Nonstarter} (77)
  1568. \p{Line_Break: NU} \p{Line_Break=Numeric} (412)
  1569. \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (412)
  1570. \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (81)
  1571. \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (81)
  1572. \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (28)
  1573. \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (28)
  1574. \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (44)
  1575. \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (44)
  1576. \p{Line_Break: QU} \p{Line_Break=Quotation} (34)
  1577. \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (34)
  1578. \p{Line_Break: SA} \p{Line_Break=Complex_Context} (663)
  1579. D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048)
  1580. \p{Line_Break: SP} \p{Line_Break=Space} (1)
  1581. \p{Line_Break: Space} (Short: \p{Lb=SP}) (1)
  1582. D \p{Line_Break: Surrogate} Deprecated by Unicode because surrogates
  1583. should never appear in well-formed text,
  1584. and therefore shouldn't be the basis for
  1585. line breaking (Short: \p{Lb=SG}) (2048)
  1586. \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1)
  1587. \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (919_067)
  1588. \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2)
  1589. \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2)
  1590. \p{Line_Break: XX} \p{Line_Break=Unknown} (919_067)
  1591. \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1)
  1592. \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1)
  1593. \p{Line_Separator} \p{General_Category=Line_Separator}
  1594. (Short: \p{Zl}) (1)
  1595. \p{Linear_B} \p{Script=Linear_B} (Short: \p{Linb}) (211)
  1596. X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128)
  1597. X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128)
  1598. \p{Lisu} \p{Script=Lisu} (48)
  1599. \p{Ll} \p{Lowercase_Letter} (=
  1600. \p{General_Category=Lowercase_Letter})
  1601. (/i= General_Category=Cased_Letter)
  1602. (1759)
  1603. \p{Lm} \p{Modifier_Letter} (=
  1604. \p{General_Category=Modifier_Letter})
  1605. (210)
  1606. \p{Lo} \p{Other_Letter} (= \p{General_Category=
  1607. Other_Letter}) (97_084)
  1608. \p{LOE} \p{Logical_Order_Exception} (=
  1609. \p{Logical_Order_Exception=Y}) (15)
  1610. \p{LOE: *} \p{Logical_Order_Exception: *}
  1611. \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
  1612. \p{LOE}) (15)
  1613. \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
  1614. (1_114_097)
  1615. \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (15)
  1616. X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024)
  1617. \p{Lower} \p{Lowercase=Y} (/i= Cased=Yes) (1918)
  1618. \p{Lower: *} \p{Lowercase: *}
  1619. \p{Lowercase} \p{Lower} (= \p{Lowercase=Y}) (/i= Cased=
  1620. Yes) (1918)
  1621. \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
  1622. No) (1_112_194)
  1623. \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
  1624. Yes) (1918)
  1625. \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter}
  1626. (Short: \p{Ll}; /i= General_Category=
  1627. Cased_Letter) (1759)
  1628. \p{Lt} \p{Titlecase_Letter} (=
  1629. \p{General_Category=Titlecase_Letter})
  1630. (/i= General_Category=Cased_Letter) (31)
  1631. \p{Lu} \p{Uppercase_Letter} (=
  1632. \p{General_Category=Uppercase_Letter})
  1633. (/i= General_Category=Cased_Letter)
  1634. (1436)
  1635. \p{Lyci} \p{Lycian} (= \p{Script=Lycian}) (NOT
  1636. \p{Block=Lycian}) (29)
  1637. \p{Lycian} \p{Script=Lycian} (Short: \p{Lyci}; NOT
  1638. \p{Block=Lycian}) (29)
  1639. \p{Lydi} \p{Lydian} (= \p{Script=Lydian}) (NOT
  1640. \p{Block=Lydian}) (27)
  1641. \p{Lydian} \p{Script=Lydian} (Short: \p{Lydi}; NOT
  1642. \p{Block=Lydian}) (27)
  1643. \p{M} \p{Mark} (= \p{General_Category=Mark})
  1644. (1498)
  1645. X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (48)
  1646. \p{Malayalam} \p{Script=Malayalam} (Short: \p{Mlym}; NOT
  1647. \p{Block=Malayalam}) (98)
  1648. \p{Mand} \p{Mandaic} (= \p{Script=Mandaic}) (NOT
  1649. \p{Block=Mandaic}) (29)
  1650. \p{Mandaic} \p{Script=Mandaic} (Short: \p{Mand}; NOT
  1651. \p{Block=Mandaic}) (29)
  1652. \p{Mark} \p{General_Category=Mark} (Short: \p{M})
  1653. (1498)
  1654. \p{Math} \p{Math=Y} (2165)
  1655. \p{Math: N*} (Single: \P{Math}) (1_111_947)
  1656. \p{Math: Y*} (Single: \p{Math}) (2165)
  1657. \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short:
  1658. \p{Sm}) (948)
  1659. X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
  1660. Mathematical_Alphanumeric_Symbols} (1024)
  1661. X \p{Mathematical_Operators} \p{Block=Mathematical_Operators} (256)
  1662. \p{Mc} \p{Spacing_Mark} (= \p{General_Category=
  1663. Spacing_Mark}) (287)
  1664. \p{Me} \p{Enclosing_Mark} (= \p{General_Category=
  1665. Enclosing_Mark}) (12)
  1666. \p{Meetei_Mayek} \p{Script=Meetei_Mayek} (Short: \p{Mtei};
  1667. NOT \p{Block=Meetei_Mayek}) (56)
  1668. X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
  1669. Miscellaneous_Mathematical_Symbols_A}
  1670. (48)
  1671. X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
  1672. Miscellaneous_Mathematical_Symbols_B}
  1673. (128)
  1674. X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (256)
  1675. X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
  1676. Miscellaneous_Symbols_And_Arrows} (256)
  1677. X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
  1678. Miscellaneous_Symbols_And_Pictographs}
  1679. (768)
  1680. X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical} (256)
  1681. \p{Mlym} \p{Malayalam} (= \p{Script=Malayalam})
  1682. (NOT \p{Block=Malayalam}) (98)
  1683. \p{Mn} \p{Nonspacing_Mark} (=
  1684. \p{General_Category=Nonspacing_Mark})
  1685. (1199)
  1686. \p{Modifier_Letter} \p{General_Category=Modifier_Letter}
  1687. (Short: \p{Lm}) (210)
  1688. \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol}
  1689. (Short: \p{Sk}) (115)
  1690. X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
  1691. \p{Mong} \p{Mongolian} (= \p{Script=Mongolian})
  1692. (NOT \p{Block=Mongolian}) (153)
  1693. \p{Mongolian} \p{Script=Mongolian} (Short: \p{Mong}; NOT
  1694. \p{Block=Mongolian}) (153)
  1695. \p{Mtei} \p{Meetei_Mayek} (= \p{Script=
  1696. Meetei_Mayek}) (NOT \p{Block=
  1697. Meetei_Mayek}) (56)
  1698. X \p{Musical_Symbols} \p{Block=Musical_Symbols} (256)
  1699. \p{Myanmar} \p{Script=Myanmar} (Short: \p{Mymr}; NOT
  1700. \p{Block=Myanmar}) (188)
  1701. X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (32)
  1702. \p{Mymr} \p{Myanmar} (= \p{Script=Myanmar}) (NOT
  1703. \p{Block=Myanmar}) (188)
  1704. \p{N} \p{Number} (= \p{General_Category=Number})
  1705. (1100)
  1706. \p{NChar} \p{Noncharacter_Code_Point} (=
  1707. \p{Noncharacter_Code_Point=Y}) (66)
  1708. \p{NChar: *} \p{Noncharacter_Code_Point: *}
  1709. \p{Nd} \p{Digit} (= \p{General_Category=
  1710. Decimal_Number}) (420)
  1711. \p{New_Tai_Lue} \p{Script=New_Tai_Lue} (Short: \p{Talu};
  1712. NOT \p{Block=New_Tai_Lue}) (83)
  1713. \p{NFC_QC: *} \p{NFC_Quick_Check: *}
  1714. \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (103)
  1715. \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (103)
  1716. \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT
  1717. \P{NFC_Quick_Check} NOR \P{NFC_QC} NOR
  1718. \P{Is_NFC_Quick_Check} NOR
  1719. \P{Is_NFC_QC}) (1118)
  1720. \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
  1721. \P{NFC_Quick_Check} NOR \P{NFC_QC} NOR
  1722. \P{Is_NFC_Quick_Check} NOR
  1723. \P{Is_NFC_QC}) (1118)
  1724. \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT
  1725. \p{NFC_Quick_Check} NOR \p{NFC_QC} NOR
  1726. \p{Is_NFC_Quick_Check} NOR
  1727. \p{Is_NFC_QC}) (1_112_891)
  1728. \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
  1729. \p{NFC_Quick_Check} NOR \p{NFC_QC} NOR
  1730. \p{Is_NFC_Quick_Check} NOR
  1731. \p{Is_NFC_QC}) (1_112_891)
  1732. \p{NFD_QC: *} \p{NFD_Quick_Check: *}
  1733. \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT
  1734. \P{NFD_Quick_Check} NOR \P{NFD_QC} NOR
  1735. \P{Is_NFD_Quick_Check} NOR
  1736. \P{Is_NFD_QC}) (13_221)
  1737. \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
  1738. \P{NFD_Quick_Check} NOR \P{NFD_QC} NOR
  1739. \P{Is_NFD_Quick_Check} NOR
  1740. \P{Is_NFD_QC}) (13_221)
  1741. \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT
  1742. \p{NFD_Quick_Check} NOR \p{NFD_QC} NOR
  1743. \p{Is_NFD_Quick_Check} NOR
  1744. \p{Is_NFD_QC}) (1_100_891)
  1745. \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
  1746. \p{NFD_Quick_Check} NOR \p{NFD_QC} NOR
  1747. \p{Is_NFD_Quick_Check} NOR
  1748. \p{Is_NFD_QC}) (1_100_891)
  1749. \p{NFKC_QC: *} \p{NFKC_Quick_Check: *}
  1750. \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (103)
  1751. \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (103)
  1752. \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
  1753. \P{NFKC_Quick_Check} NOR \P{NFKC_QC} NOR
  1754. \P{Is_NFKC_Quick_Check} NOR
  1755. \P{Is_NFKC_QC}) (4640)
  1756. \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
  1757. \P{NFKC_Quick_Check} NOR \P{NFKC_QC} NOR
  1758. \P{Is_NFKC_Quick_Check} NOR
  1759. \P{Is_NFKC_QC}) (4640)
  1760. \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
  1761. \p{NFKC_Quick_Check} NOR \p{NFKC_QC} NOR
  1762. \p{Is_NFKC_Quick_Check} NOR
  1763. \p{Is_NFKC_QC}) (1_109_369)
  1764. \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
  1765. \p{NFKC_Quick_Check} NOR \p{NFKC_QC} NOR
  1766. \p{Is_NFKC_Quick_Check} NOR
  1767. \p{Is_NFKC_QC}) (1_109_369)
  1768. \p{NFKD_QC: *} \p{NFKD_Quick_Check: *}
  1769. \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
  1770. \P{NFKD_Quick_Check} NOR \P{NFKD_QC} NOR
  1771. \P{Is_NFKD_Quick_Check} NOR
  1772. \P{Is_NFKD_QC}) (16_731)
  1773. \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
  1774. \P{NFKD_Quick_Check} NOR \P{NFKD_QC} NOR
  1775. \P{Is_NFKD_Quick_Check} NOR
  1776. \P{Is_NFKD_QC}) (16_731)
  1777. \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
  1778. \p{NFKD_Quick_Check} NOR \p{NFKD_QC} NOR
  1779. \p{Is_NFKD_Quick_Check} NOR
  1780. \p{Is_NFKD_QC}) (1_097_381)
  1781. \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
  1782. \p{NFKD_Quick_Check} NOR \p{NFKD_QC} NOR
  1783. \p{Is_NFKD_Quick_Check} NOR
  1784. \p{Is_NFKD_QC}) (1_097_381)
  1785. \p{Nko} \p{Script=Nko} (NOT \p{NKo}) (59)
  1786. \p{Nkoo} \p{Nko} (= \p{Script=Nko}) (NOT \p{NKo})
  1787. (59)
  1788. \p{Nl} \p{Letter_Number} (= \p{General_Category=
  1789. Letter_Number}) (224)
  1790. \p{No} \p{Other_Number} (= \p{General_Category=
  1791. Other_Number}) (456)
  1792. X \p{No_Block} \p{Block=No_Block} (861_664)
  1793. \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
  1794. \p{NChar}) (66)
  1795. \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
  1796. (1_114_046)
  1797. \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
  1798. (66)
  1799. \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark}
  1800. (Short: \p{Mn}) (1199)
  1801. \p{Nt: *} \p{Numeric_Type: *}
  1802. \p{Number} \p{General_Category=Number} (Short: \p{N})
  1803. (1100)
  1804. X \p{Number_Forms} \p{Block=Number_Forms} (64)
  1805. \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (420)
  1806. \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (420)
  1807. \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128)
  1808. \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128)
  1809. \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_935)
  1810. \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (629)
  1811. \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (629)
  1812. T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1)
  1813. T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (56)
  1814. T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (3)
  1815. T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (1)
  1816. T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1)
  1817. T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (5)
  1818. T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1)
  1819. T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (2)
  1820. T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (3)
  1821. T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (1)
  1822. T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (9)
  1823. T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (4)
  1824. T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1)
  1825. T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1)
  1826. T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (10)
  1827. T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1)
  1828. T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1)
  1829. T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (5)
  1830. T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (6)
  1831. T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1)
  1832. T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (2)
  1833. T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1)
  1834. T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (93)
  1835. T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1)
  1836. T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (96)
  1837. T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1)
  1838. T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (98)
  1839. T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1)
  1840. T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (89)
  1841. T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1)
  1842. T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (86)
  1843. T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1)
  1844. T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (78)
  1845. T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1)
  1846. T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (77)
  1847. T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1)
  1848. T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (73)
  1849. T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1)
  1850. T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (77)
  1851. T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (39)
  1852. T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (6)
  1853. T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (6)
  1854. T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (4)
  1855. T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (4)
  1856. T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (4)
  1857. T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (5)
  1858. T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (5)
  1859. T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (5)
  1860. T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (5)
  1861. T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (18)
  1862. T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1)
  1863. T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1)
  1864. T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1)
  1865. T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1)
  1866. T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1)
  1867. T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1)
  1868. T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1)
  1869. T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1)
  1870. T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1)
  1871. T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (10)
  1872. T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1)
  1873. T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1)
  1874. T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1)
  1875. T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1)
  1876. T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1)
  1877. T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1)
  1878. T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1)
  1879. T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1)
  1880. T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1)
  1881. T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (9)
  1882. T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1)
  1883. T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1)
  1884. T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1)
  1885. T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1)
  1886. T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1)
  1887. T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1)
  1888. T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1)
  1889. T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1)
  1890. T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1)
  1891. T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (19)
  1892. T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (5)
  1893. T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (5)
  1894. T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (5)
  1895. T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (6)
  1896. T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (20)
  1897. T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (2)
  1898. T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (3)
  1899. T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (2)
  1900. T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (12)
  1901. T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (2)
  1902. T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (2)
  1903. T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (2)
  1904. T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (3)
  1905. T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (17)
  1906. T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (1)
  1907. T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (1)
  1908. T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (1)
  1909. T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (5)
  1910. T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (1)
  1911. T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (1)
  1912. T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (1)
  1913. T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (1)
  1914. T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (7)
  1915. T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (1)
  1916. T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (1)
  1917. T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (1)
  1918. T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (4)
  1919. T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (1)
  1920. T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (1)
  1921. T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (1)
  1922. T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (1)
  1923. T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (1)
  1924. T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
  1925. (2)
  1926. T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
  1927. 1000000000000}) (1)
  1928. \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_935)
  1929. \p{Nv: *} \p{Numeric_Value: *}
  1930. D \p{OAlpha} \p{Other_Alphabetic} (=
  1931. \p{Other_Alphabetic=Y}) (795)
  1932. D \p{OAlpha: *} \p{Other_Alphabetic: *}
  1933. D \p{ODI} \p{Other_Default_Ignorable_Code_Point} (=
  1934. \p{Other_Default_Ignorable_Code_Point=
  1935. Y}) (3778)
  1936. D \p{ODI: *} \p{Other_Default_Ignorable_Code_Point: *}
  1937. \p{Ogam} \p{Ogham} (= \p{Script=Ogham}) (NOT
  1938. \p{Block=Ogham}) (29)
  1939. \p{Ogham} \p{Script=Ogham} (Short: \p{Ogam}; NOT
  1940. \p{Block=Ogham}) (29)
  1941. D \p{OGr_Ext} \p{Other_Grapheme_Extend} (=
  1942. \p{Other_Grapheme_Extend=Y}) (23)
  1943. D \p{OGr_Ext: *} \p{Other_Grapheme_Extend: *}
  1944. D \p{OIDC} \p{Other_ID_Continue} (=
  1945. \p{Other_ID_Continue=Y}) (12)
  1946. D \p{OIDC: *} \p{Other_ID_Continue: *}
  1947. D \p{OIDS} \p{Other_ID_Start} (= \p{Other_ID_Start=
  1948. Y}) (4)
  1949. D \p{OIDS: *} \p{Other_ID_Start: *}
  1950. \p{Ol_Chiki} \p{Script=Ol_Chiki} (Short: \p{Olck}) (48)
  1951. \p{Olck} \p{Ol_Chiki} (= \p{Script=Ol_Chiki}) (48)
  1952. \p{Old_Italic} \p{Script=Old_Italic} (Short: \p{Ital};
  1953. NOT \p{Block=Old_Italic}) (35)
  1954. \p{Old_Persian} \p{Script=Old_Persian} (Short: \p{Xpeo};
  1955. NOT \p{Block=Old_Persian}) (50)
  1956. \p{Old_South_Arabian} \p{Script=Old_South_Arabian} (Short:
  1957. \p{Sarb}) (32)
  1958. \p{Old_Turkic} \p{Script=Old_Turkic} (Short: \p{Orkh};
  1959. NOT \p{Block=Old_Turkic}) (73)
  1960. D \p{OLower} \p{Other_Lowercase} (= \p{Other_Lowercase=
  1961. Y}) (159)
  1962. D \p{OLower: *} \p{Other_Lowercase: *}
  1963. D \p{OMath} \p{Other_Math} (= \p{Other_Math=Y}) (1217)
  1964. D \p{OMath: *} \p{Other_Math: *}
  1965. \p{Open_Punctuation} \p{General_Category=Open_Punctuation}
  1966. (Short: \p{Ps}) (72)
  1967. X \p{Optical_Character_Recognition} \p{Block=
  1968. Optical_Character_Recognition} (32)
  1969. \p{Oriya} \p{Script=Oriya} (Short: \p{Orya}; NOT
  1970. \p{Block=Oriya}) (90)
  1971. \p{Orkh} \p{Old_Turkic} (= \p{Script=Old_Turkic})
  1972. (NOT \p{Block=Old_Turkic}) (73)
  1973. \p{Orya} \p{Oriya} (= \p{Script=Oriya}) (NOT
  1974. \p{Block=Oriya}) (90)
  1975. \p{Osma} \p{Osmanya} (= \p{Script=Osmanya}) (NOT
  1976. \p{Block=Osmanya}) (40)
  1977. \p{Osmanya} \p{Script=Osmanya} (Short: \p{Osma}; NOT
  1978. \p{Block=Osmanya}) (40)
  1979. \p{Other} \p{General_Category=Other} (Short: \p{C})
  1980. (1_004_868)
  1981. D \p{Other_Alphabetic} \p{Other_Alphabetic=Y} (Short: \p{OAlpha})
  1982. (795)
  1983. D \p{Other_Alphabetic: N*} Used by Unicode internally for generating
  1984. the Alphabetic property (which should be
  1985. used instead) and not intended to be
  1986. used stand-alone (Short: \p{OAlpha=N},
  1987. \P{OAlpha}) (1_113_317)
  1988. D \p{Other_Alphabetic: Y*} Used by Unicode internally for generating
  1989. the Alphabetic property (which should be
  1990. used instead) and not intended to be
  1991. used stand-alone (Short: \p{OAlpha=Y},
  1992. \p{OAlpha}) (795)
  1993. D \p{Other_Default_Ignorable_Code_Point}
  1994. \p{Other_Default_Ignorable_Code_Point=Y}
  1995. (Short: \p{ODI}) (3778)
  1996. D \p{Other_Default_Ignorable_Code_Point: N*} Used by Unicode
  1997. internally for generating the
  1998. Default_Ignorable_Code_Point property
  1999. (which should be used instead) and not
  2000. intended to be used stand-alone (Short:
  2001. \p{ODI=N}, \P{ODI}) (1_110_334)
  2002. D \p{Other_Default_Ignorable_Code_Point: Y*} Used by Unicode
  2003. internally for generating the
  2004. Default_Ignorable_Code_Point property
  2005. (which should be used instead) and not
  2006. intended to be used stand-alone (Short:
  2007. \p{ODI=Y}, \p{ODI}) (3778)
  2008. D \p{Other_Grapheme_Extend} \p{Other_Grapheme_Extend=Y} (Short:
  2009. \p{OGrExt}) (23)
  2010. D \p{Other_Grapheme_Extend: N*} Used by Unicode internally for
  2011. generating the Grapheme_Extend property
  2012. (which should be used instead) and not
  2013. intended to be used stand-alone (Short:
  2014. \p{OGrExt=N}, \P{OGrExt}) (1_114_089)
  2015. D \p{Other_Grapheme_Extend: Y*} Used by Unicode internally for
  2016. generating the Grapheme_Extend property
  2017. (which should be used instead) and not
  2018. intended to be used stand-alone (Short:
  2019. \p{OGrExt=Y}, \p{OGrExt}) (23)
  2020. D \p{Other_ID_Continue} \p{Other_ID_Continue=Y} (Short: \p{OIDC})
  2021. (12)
  2022. D \p{Other_ID_Continue: N*} Used by Unicode internally for
  2023. generating the ID_Continue property
  2024. (which should be used instead) and not
  2025. intended to be used stand-alone (Short:
  2026. \p{OIDC=N}, \P{OIDC}) (1_114_100)
  2027. D \p{Other_ID_Continue: Y*} Used by Unicode internally for
  2028. generating the ID_Continue property
  2029. (which should be used instead) and not
  2030. intended to be used stand-alone (Short:
  2031. \p{OIDC=Y}, \p{OIDC}) (12)
  2032. D \p{Other_ID_Start} \p{Other_ID_Start=Y} (Short: \p{OIDS}) (4)
  2033. D \p{Other_ID_Start: N*} Used by Unicode internally for generating
  2034. the ID_Start property (which should be
  2035. used instead) and not intended to be
  2036. used stand-alone (Short: \p{OIDS=N},
  2037. \P{OIDS}) (1_114_108)
  2038. D \p{Other_ID_Start: Y*} Used by Unicode internally for generating
  2039. the ID_Start property (which should be
  2040. used instead) and not intended to be
  2041. used stand-alone (Short: \p{OIDS=Y},
  2042. \p{OIDS}) (4)
  2043. \p{Other_Letter} \p{General_Category=Other_Letter} (Short:
  2044. \p{Lo}) (97_084)
  2045. D \p{Other_Lowercase} \p{Other_Lowercase=Y} (Short: \p{OLower})
  2046. (159)
  2047. D \p{Other_Lowercase: N*} Used by Unicode internally for generating
  2048. the Lowercase property (which should be
  2049. used instead) and not intended to be
  2050. used stand-alone (Short: \p{OLower=N},
  2051. \P{OLower}) (1_113_953)
  2052. D \p{Other_Lowercase: Y*} Used by Unicode internally for generating
  2053. the Lowercase property (which should be
  2054. used instead) and not intended to be
  2055. used stand-alone (Short: \p{OLower=Y},
  2056. \p{OLower}) (159)
  2057. D \p{Other_Math} \p{Other_Math=Y} (Short: \p{OMath}) (1217)
  2058. D \p{Other_Math: N*} Used by Unicode internally for generating
  2059. the Math property (which should be used
  2060. instead) and not intended to be used
  2061. stand-alone (Short: \p{OMath=N},
  2062. \P{OMath}) (1_112_895)
  2063. D \p{Other_Math: Y*} Used by Unicode internally for generating
  2064. the Math property (which should be used
  2065. instead) and not intended to be used
  2066. stand-alone (Short: \p{OMath=Y},
  2067. \p{OMath}) (1217)
  2068. \p{Other_Number} \p{General_Category=Other_Number} (Short:
  2069. \p{No}) (456)
  2070. \p{Other_Punctuation} \p{General_Category=Other_Punctuation}
  2071. (Short: \p{Po}) (402)
  2072. \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short:
  2073. \p{So}) (4398)
  2074. D \p{Other_Uppercase} \p{Other_Uppercase=Y} (Short: \p{OUpper})
  2075. (42)
  2076. D \p{Other_Uppercase: N*} Used by Unicode internally for generating
  2077. the Uppercase property (which should be
  2078. used instead) and not intended to be
  2079. used stand-alone (Short: \p{OUpper=N},
  2080. \P{OUpper}) (1_114_070)
  2081. D \p{Other_Uppercase: Y*} Used by Unicode internally for generating
  2082. the Uppercase property (which should be
  2083. used instead) and not intended to be
  2084. used stand-alone (Short: \p{OUpper=Y},
  2085. \p{OUpper}) (42)
  2086. D \p{OUpper} \p{Other_Uppercase} (= \p{Other_Uppercase=
  2087. Y}) (42)
  2088. D \p{OUpper: *} \p{Other_Uppercase: *}
  2089. \p{P} \p{Punct} (= \p{General_Category=
  2090. Punctuation}) (598)
  2091. \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
  2092. (Short: \p{Zp}) (1)
  2093. \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax=
  2094. Y}) (2760)
  2095. \p{Pat_Syn: *} \p{Pattern_Syntax: *}
  2096. \p{Pat_WS} \p{Pattern_White_Space} (=
  2097. \p{Pattern_White_Space=Y}) (11)
  2098. \p{Pat_WS: *} \p{Pattern_White_Space: *}
  2099. \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
  2100. (2760)
  2101. \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn})
  2102. (1_111_352)
  2103. \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760)
  2104. \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
  2105. \p{PatWS}) (11)
  2106. \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
  2107. (1_114_101)
  2108. \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11)
  2109. \p{Pc} \p{Connector_Punctuation} (=
  2110. \p{General_Category=
  2111. Connector_Punctuation}) (10)
  2112. \p{Pd} \p{Dash_Punctuation} (=
  2113. \p{General_Category=Dash_Punctuation})
  2114. (21)
  2115. \p{Pe} \p{Close_Punctuation} (=
  2116. \p{General_Category=Close_Punctuation})
  2117. (71)
  2118. \p{PerlSpace} \s, restricted to ASCII (5)
  2119. \p{PerlWord} \w, restricted to ASCII = [A-Za-z0-9_] (63)
  2120. \p{Pf} \p{Final_Punctuation} (=
  2121. \p{General_Category=Final_Punctuation})
  2122. (10)
  2123. \p{Phag} \p{Phags_Pa} (= \p{Script=Phags_Pa}) (NOT
  2124. \p{Block=Phags_Pa}) (56)
  2125. \p{Phags_Pa} \p{Script=Phags_Pa} (Short: \p{Phag}; NOT
  2126. \p{Block=Phags_Pa}) (56)
  2127. X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (48)
  2128. \p{Phli} \p{Inscriptional_Pahlavi} (= \p{Script=
  2129. Inscriptional_Pahlavi}) (NOT \p{Block=
  2130. Inscriptional_Pahlavi}) (27)
  2131. \p{Phnx} \p{Phoenician} (= \p{Script=Phoenician})
  2132. (NOT \p{Block=Phoenician}) (29)
  2133. \p{Phoenician} \p{Script=Phoenician} (Short: \p{Phnx};
  2134. NOT \p{Block=Phoenician}) (29)
  2135. X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (128)
  2136. X \p{Phonetic_Extensions_Supplement} \p{Block=
  2137. Phonetic_Extensions_Supplement} (64)
  2138. \p{Pi} \p{Initial_Punctuation} (=
  2139. \p{General_Category=
  2140. Initial_Punctuation}) (12)
  2141. X \p{Playing_Cards} \p{Block=Playing_Cards} (96)
  2142. \p{Po} \p{Other_Punctuation} (=
  2143. \p{General_Category=Other_Punctuation})
  2144. (402)
  2145. \p{PosixAlnum} [A-Za-z0-9] (62)
  2146. \p{PosixAlpha} [A-Za-z] (52)
  2147. \p{PosixBlank} \t and ' ' (2)
  2148. \p{PosixCntrl} ASCII control characters: NUL, SOH, STX,
  2149. ETX, EOT, ENQ, ACK, BEL, BS, HT, LF, VT,
  2150. FF, CR, SO, SI, DLE, DC1, DC2, DC3, DC4,
  2151. NAK, SYN, ETB, CAN, EOM, SUB, ESC, FS,
  2152. GS, RS, US, and DEL (33)
  2153. \p{PosixDigit} [0-9] (10)
  2154. \p{PosixGraph} [-!"#$%&'()*+,./:;<>?@[\\]^_`{|}~0-9A-Za-
  2155. z] (94)
  2156. \p{PosixLower} [a-z] (/i= PosixAlpha) (26)
  2157. \p{PosixPrint} [- 0-9A-Za-
  2158. z!"#$%&'()*+,./:;<>?@[\\]^_`{|}~] (95)
  2159. \p{PosixPunct} [-!"#$%&'()*+,./:;<>?@[\\]^_`{|}~] (32)
  2160. \p{PosixSpace} \t, \n, \cK, \f, \r, and ' '. (\cK is
  2161. vertical tab) (6)
  2162. \p{PosixUpper} [A-Z] (/i= PosixAlpha) (26)
  2163. \p{PosixWord} \p{PerlWord} (63)
  2164. \p{PosixXDigit} [0-9A-Fa-f] (22)
  2165. T \p{Present_In: 1.1} \p{Age=1.1} (Short: \p{In=1.1}) (Perl
  2166. extension) (33_979)
  2167. T \p{Present_In: 2.0} Code point's usage introduced in version
  2168. 2.0 or earlier (Short: \p{In=2.0}) (Perl
  2169. extension) (178_500)
  2170. T \p{Present_In: 2.1} Code point's usage introduced in version
  2171. 2.1 or earlier (Short: \p{In=2.1}) (Perl
  2172. extension) (178_502)
  2173. T \p{Present_In: 3.0} Code point's usage introduced in version
  2174. 3.0 or earlier (Short: \p{In=3.0}) (Perl
  2175. extension) (188_809)
  2176. T \p{Present_In: 3.1} Code point's usage introduced in version
  2177. 3.1 or earlier (Short: \p{In=3.1}) (Perl
  2178. extension) (233_787)
  2179. T \p{Present_In: 3.2} Code point's usage introduced in version
  2180. 3.2 or earlier (Short: \p{In=3.2}) (Perl
  2181. extension) (234_803)
  2182. T \p{Present_In: 4.0} Code point's usage introduced in version
  2183. 4.0 or earlier (Short: \p{In=4.0}) (Perl
  2184. extension) (236_029)
  2185. T \p{Present_In: 4.1} Code point's usage introduced in version
  2186. 4.1 or earlier (Short: \p{In=4.1}) (Perl
  2187. extension) (237_302)
  2188. T \p{Present_In: 5.0} Code point's usage introduced in version
  2189. 5.0 or earlier (Short: \p{In=5.0}) (Perl
  2190. extension) (238_671)
  2191. T \p{Present_In: 5.1} Code point's usage introduced in version
  2192. 5.1 or earlier (Short: \p{In=5.1}) (Perl
  2193. extension) (240_295)
  2194. T \p{Present_In: 5.2} Code point's usage introduced in version
  2195. 5.2 or earlier (Short: \p{In=5.2}) (Perl
  2196. extension) (246_943)
  2197. T \p{Present_In: 6.0} Code point's usage introduced in version
  2198. 6.0 or earlier (Short: \p{In=6.0}) (Perl
  2199. extension) (249_031)
  2200. \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
  2201. Unassigned}) (Perl extension) (865_081)
  2202. \p{Print} Characters that are graphical plus space
  2203. characters (but no controls) (246_850)
  2204. \p{Private_Use} \p{General_Category=Private_Use} (Short:
  2205. \p{Co}; NOT \p{Private_Use_Area})
  2206. (137_468)
  2207. X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short:
  2208. \p{InPrivateUse}) (6400)
  2209. \p{Prti} \p{Inscriptional_Parthian} (= \p{Script=
  2210. Inscriptional_Parthian}) (NOT \p{Block=
  2211. Inscriptional_Parthian}) (30)
  2212. \p{Ps} \p{Open_Punctuation} (=
  2213. \p{General_Category=Open_Punctuation})
  2214. (72)
  2215. \p{Punct} \p{General_Category=Punctuation} (Short:
  2216. \p{P}) (598)
  2217. \p{Punctuation} \p{Punct} (= \p{General_Category=
  2218. Punctuation}) (598)
  2219. \p{Qaac} \p{Coptic} (= \p{Script=Coptic}) (NOT
  2220. \p{Block=Coptic}) (135)
  2221. \p{Qaai} \p{Inherited} (= \p{Script=Inherited})
  2222. (523)
  2223. \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark=
  2224. Y}) (29)
  2225. \p{QMark: *} \p{Quotation_Mark: *}
  2226. \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark})
  2227. (29)
  2228. \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_083)
  2229. \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (29)
  2230. \p{Radical} \p{Radical=Y} (329)
  2231. \p{Radical: N*} (Single: \P{Radical}) (1_113_783)
  2232. \p{Radical: Y*} (Single: \p{Radical}) (329)
  2233. \p{Rejang} \p{Script=Rejang} (Short: \p{Rjng}; NOT
  2234. \p{Block=Rejang}) (37)
  2235. \p{Rjng} \p{Rejang} (= \p{Script=Rejang}) (NOT
  2236. \p{Block=Rejang}) (37)
  2237. X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (32)
  2238. \p{Runic} \p{Script=Runic} (Short: \p{Runr}; NOT
  2239. \p{Block=Runic}) (78)
  2240. \p{Runr} \p{Runic} (= \p{Script=Runic}) (NOT
  2241. \p{Block=Runic}) (78)
  2242. \p{S} \p{Symbol} (= \p{General_Category=Symbol})
  2243. (5508)
  2244. \p{Samaritan} \p{Script=Samaritan} (Short: \p{Samr}; NOT
  2245. \p{Block=Samaritan}) (61)
  2246. \p{Samr} \p{Samaritan} (= \p{Script=Samaritan})
  2247. (NOT \p{Block=Samaritan}) (61)
  2248. \p{Sarb} \p{Old_South_Arabian} (= \p{Script=
  2249. Old_South_Arabian}) (32)
  2250. \p{Saur} \p{Saurashtra} (= \p{Script=Saurashtra})
  2251. (NOT \p{Block=Saurashtra}) (81)
  2252. \p{Saurashtra} \p{Script=Saurashtra} (Short: \p{Saur};
  2253. NOT \p{Block=Saurashtra}) (81)
  2254. \p{SB: *} \p{Sentence_Break: *}
  2255. \p{Sc} \p{Currency_Symbol} (=
  2256. \p{General_Category=Currency_Symbol})
  2257. (47)
  2258. \p{Sc: *} \p{Script: *}
  2259. \p{Script: Arab} \p{Script=Arabic} (1051)
  2260. \p{Script: Arabic} (Short: \p{Sc=Arab}, \p{Arab}) (1051)
  2261. \p{Script: Armenian} (Short: \p{Sc=Armn}, \p{Armn}) (90)
  2262. \p{Script: Armi} \p{Script=Imperial_Aramaic} (31)
  2263. \p{Script: Armn} \p{Script=Armenian} (90)
  2264. \p{Script: Avestan} (Short: \p{Sc=Avst}, \p{Avst}) (61)
  2265. \p{Script: Avst} \p{Script=Avestan} (61)
  2266. \p{Script: Bali} \p{Script=Balinese} (121)
  2267. \p{Script: Balinese} (Short: \p{Sc=Bali}, \p{Bali}) (121)
  2268. \p{Script: Bamu} \p{Script=Bamum} (657)
  2269. \p{Script: Bamum} (Short: \p{Sc=Bamu}, \p{Bamu}) (657)
  2270. \p{Script: Batak} (Short: \p{Sc=Batk}, \p{Batk}) (56)
  2271. \p{Script: Batk} \p{Script=Batak} (56)
  2272. \p{Script: Beng} \p{Script=Bengali} (92)
  2273. \p{Script: Bengali} (Short: \p{Sc=Beng}, \p{Beng}) (92)
  2274. \p{Script: Bopo} \p{Script=Bopomofo} (70)
  2275. \p{Script: Bopomofo} (Short: \p{Sc=Bopo}, \p{Bopo}) (70)
  2276. \p{Script: Brah} \p{Script=Brahmi} (108)
  2277. \p{Script: Brahmi} (Short: \p{Sc=Brah}, \p{Brah}) (108)
  2278. \p{Script: Brai} \p{Script=Braille} (256)
  2279. \p{Script: Braille} (Short: \p{Sc=Brai}, \p{Brai}) (256)
  2280. \p{Script: Bugi} \p{Script=Buginese} (30)
  2281. \p{Script: Buginese} (Short: \p{Sc=Bugi}, \p{Bugi}) (30)
  2282. \p{Script: Buhd} \p{Script=Buhid} (20)
  2283. \p{Script: Buhid} (Short: \p{Sc=Buhd}, \p{Buhd}) (20)
  2284. \p{Script: Canadian_Aboriginal} (Short: \p{Sc=Cans}, \p{Cans})
  2285. (710)
  2286. \p{Script: Cans} \p{Script=Canadian_Aboriginal} (710)
  2287. \p{Script: Cari} \p{Script=Carian} (49)
  2288. \p{Script: Carian} (Short: \p{Sc=Cari}, \p{Cari}) (49)
  2289. \p{Script: Cham} (Short: \p{Sc=Cham}, \p{Cham}) (83)
  2290. \p{Script: Cher} \p{Script=Cherokee} (85)
  2291. \p{Script: Cherokee} (Short: \p{Sc=Cher}, \p{Cher}) (85)
  2292. \p{Script: Common} (Short: \p{Sc=Zyyy}, \p{Zyyy}) (6379)
  2293. \p{Script: Copt} \p{Script=Coptic} (135)
  2294. \p{Script: Coptic} (Short: \p{Sc=Copt}, \p{Copt}) (135)
  2295. \p{Script: Cprt} \p{Script=Cypriot} (55)
  2296. \p{Script: Cuneiform} (Short: \p{Sc=Xsux}, \p{Xsux}) (982)
  2297. \p{Script: Cypriot} (Short: \p{Sc=Cprt}, \p{Cprt}) (55)
  2298. \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}, \p{Cyrl}) (408)
  2299. \p{Script: Cyrl} \p{Script=Cyrillic} (408)
  2300. \p{Script: Deseret} (Short: \p{Sc=Dsrt}, \p{Dsrt}) (80)
  2301. \p{Script: Deva} \p{Script=Devanagari} (150)
  2302. \p{Script: Devanagari} (Short: \p{Sc=Deva}, \p{Deva}) (150)
  2303. \p{Script: Dsrt} \p{Script=Deseret} (80)
  2304. \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (1071)
  2305. \p{Script: Egyptian_Hieroglyphs} (Short: \p{Sc=Egyp}, \p{Egyp})
  2306. (1071)
  2307. \p{Script: Ethi} \p{Script=Ethiopic} (495)
  2308. \p{Script: Ethiopic} (Short: \p{Sc=Ethi}, \p{Ethi}) (495)
  2309. \p{Script: Geor} \p{Script=Georgian} (120)
  2310. \p{Script: Georgian} (Short: \p{Sc=Geor}, \p{Geor}) (120)
  2311. \p{Script: Glag} \p{Script=Glagolitic} (94)
  2312. \p{Script: Glagolitic} (Short: \p{Sc=Glag}, \p{Glag}) (94)
  2313. \p{Script: Goth} \p{Script=Gothic} (27)
  2314. \p{Script: Gothic} (Short: \p{Sc=Goth}, \p{Goth}) (27)
  2315. \p{Script: Greek} (Short: \p{Sc=Grek}, \p{Grek}) (511)
  2316. \p{Script: Grek} \p{Script=Greek} (511)
  2317. \p{Script: Gujarati} (Short: \p{Sc=Gujr}, \p{Gujr}) (83)
  2318. \p{Script: Gujr} \p{Script=Gujarati} (83)
  2319. \p{Script: Gurmukhi} (Short: \p{Sc=Guru}, \p{Guru}) (79)
  2320. \p{Script: Guru} \p{Script=Gurmukhi} (79)
  2321. \p{Script: Han} (Short: \p{Sc=Han}, \p{Han}) (75_960)
  2322. \p{Script: Hang} \p{Script=Hangul} (11_739)
  2323. \p{Script: Hangul} (Short: \p{Sc=Hang}, \p{Hang}) (11_739)
  2324. \p{Script: Hani} \p{Script=Han} (75_960)
  2325. \p{Script: Hano} \p{Script=Hanunoo} (21)
  2326. \p{Script: Hanunoo} (Short: \p{Sc=Hano}, \p{Hano}) (21)
  2327. \p{Script: Hebr} \p{Script=Hebrew} (133)
  2328. \p{Script: Hebrew} (Short: \p{Sc=Hebr}, \p{Hebr}) (133)
  2329. \p{Script: Hira} \p{Script=Hiragana} (91)
  2330. \p{Script: Hiragana} (Short: \p{Sc=Hira}, \p{Hira}) (91)
  2331. \p{Script: Imperial_Aramaic} (Short: \p{Sc=Armi}, \p{Armi}) (31)
  2332. \p{Script: Inherited} (Short: \p{Sc=Zinh}, \p{Zinh}) (523)
  2333. \p{Script: Inscriptional_Pahlavi} (Short: \p{Sc=Phli}, \p{Phli})
  2334. (27)
  2335. \p{Script: Inscriptional_Parthian} (Short: \p{Sc=Prti}, \p{Prti})
  2336. (30)
  2337. \p{Script: Ital} \p{Script=Old_Italic} (35)
  2338. \p{Script: Java} \p{Script=Javanese} (91)
  2339. \p{Script: Javanese} (Short: \p{Sc=Java}, \p{Java}) (91)
  2340. \p{Script: Kaithi} (Short: \p{Sc=Kthi}, \p{Kthi}) (66)
  2341. \p{Script: Kali} \p{Script=Kayah_Li} (48)
  2342. \p{Script: Kana} \p{Script=Katakana} (300)
  2343. \p{Script: Kannada} (Short: \p{Sc=Knda}, \p{Knda}) (86)
  2344. \p{Script: Katakana} (Short: \p{Sc=Kana}, \p{Kana}) (300)
  2345. \p{Script: Kayah_Li} (Short: \p{Sc=Kali}, \p{Kali}) (48)
  2346. \p{Script: Khar} \p{Script=Kharoshthi} (65)
  2347. \p{Script: Kharoshthi} (Short: \p{Sc=Khar}, \p{Khar}) (65)
  2348. \p{Script: Khmer} (Short: \p{Sc=Khmr}, \p{Khmr}) (146)
  2349. \p{Script: Khmr} \p{Script=Khmer} (146)
  2350. \p{Script: Knda} \p{Script=Kannada} (86)
  2351. \p{Script: Kthi} \p{Script=Kaithi} (66)
  2352. \p{Script: Lana} \p{Script=Tai_Tham} (127)
  2353. \p{Script: Lao} (Short: \p{Sc=Lao}, \p{Lao}) (65)
  2354. \p{Script: Laoo} \p{Script=Lao} (65)
  2355. \p{Script: Latin} (Short: \p{Sc=Latn}, \p{Latn}) (1267)
  2356. \p{Script: Latn} \p{Script=Latin} (1267)
  2357. \p{Script: Lepc} \p{Script=Lepcha} (74)
  2358. \p{Script: Lepcha} (Short: \p{Sc=Lepc}, \p{Lepc}) (74)
  2359. \p{Script: Limb} \p{Script=Limbu} (66)
  2360. \p{Script: Limbu} (Short: \p{Sc=Limb}, \p{Limb}) (66)
  2361. \p{Script: Linb} \p{Script=Linear_B} (211)
  2362. \p{Script: Linear_B} (Short: \p{Sc=Linb}, \p{Linb}) (211)
  2363. \p{Script: Lisu} (Short: \p{Sc=Lisu}, \p{Lisu}) (48)
  2364. \p{Script: Lyci} \p{Script=Lycian} (29)
  2365. \p{Script: Lycian} (Short: \p{Sc=Lyci}, \p{Lyci}) (29)
  2366. \p{Script: Lydi} \p{Script=Lydian} (27)
  2367. \p{Script: Lydian} (Short: \p{Sc=Lydi}, \p{Lydi}) (27)
  2368. \p{Script: Malayalam} (Short: \p{Sc=Mlym}, \p{Mlym}) (98)
  2369. \p{Script: Mand} \p{Script=Mandaic} (29)
  2370. \p{Script: Mandaic} (Short: \p{Sc=Mand}, \p{Mand}) (29)
  2371. \p{Script: Meetei_Mayek} (Short: \p{Sc=Mtei}, \p{Mtei}) (56)
  2372. \p{Script: Mlym} \p{Script=Malayalam} (98)
  2373. \p{Script: Mong} \p{Script=Mongolian} (153)
  2374. \p{Script: Mongolian} (Short: \p{Sc=Mong}, \p{Mong}) (153)
  2375. \p{Script: Mtei} \p{Script=Meetei_Mayek} (56)
  2376. \p{Script: Myanmar} (Short: \p{Sc=Mymr}, \p{Mymr}) (188)
  2377. \p{Script: Mymr} \p{Script=Myanmar} (188)
  2378. \p{Script: New_Tai_Lue} (Short: \p{Sc=Talu}, \p{Talu}) (83)
  2379. \p{Script: Nko} (Short: \p{Sc=Nko}, \p{Nko}) (59)
  2380. \p{Script: Nkoo} \p{Script=Nko} (59)
  2381. \p{Script: Ogam} \p{Script=Ogham} (29)
  2382. \p{Script: Ogham} (Short: \p{Sc=Ogam}, \p{Ogam}) (29)
  2383. \p{Script: Ol_Chiki} (Short: \p{Sc=Olck}, \p{Olck}) (48)
  2384. \p{Script: Olck} \p{Script=Ol_Chiki} (48)
  2385. \p{Script: Old_Italic} (Short: \p{Sc=Ital}, \p{Ital}) (35)
  2386. \p{Script: Old_Persian} (Short: \p{Sc=Xpeo}, \p{Xpeo}) (50)
  2387. \p{Script: Old_South_Arabian} (Short: \p{Sc=Sarb}, \p{Sarb}) (32)
  2388. \p{Script: Old_Turkic} (Short: \p{Sc=Orkh}, \p{Orkh}) (73)
  2389. \p{Script: Oriya} (Short: \p{Sc=Orya}, \p{Orya}) (90)
  2390. \p{Script: Orkh} \p{Script=Old_Turkic} (73)
  2391. \p{Script: Orya} \p{Script=Oriya} (90)
  2392. \p{Script: Osma} \p{Script=Osmanya} (40)
  2393. \p{Script: Osmanya} (Short: \p{Sc=Osma}, \p{Osma}) (40)
  2394. \p{Script: Phag} \p{Script=Phags_Pa} (56)
  2395. \p{Script: Phags_Pa} (Short: \p{Sc=Phag}, \p{Phag}) (56)
  2396. \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (27)
  2397. \p{Script: Phnx} \p{Script=Phoenician} (29)
  2398. \p{Script: Phoenician} (Short: \p{Sc=Phnx}, \p{Phnx}) (29)
  2399. \p{Script: Prti} \p{Script=Inscriptional_Parthian} (30)
  2400. \p{Script: Qaac} \p{Script=Coptic} (135)
  2401. \p{Script: Qaai} \p{Script=Inherited} (523)
  2402. \p{Script: Rejang} (Short: \p{Sc=Rjng}, \p{Rjng}) (37)
  2403. \p{Script: Rjng} \p{Script=Rejang} (37)
  2404. \p{Script: Runic} (Short: \p{Sc=Runr}, \p{Runr}) (78)
  2405. \p{Script: Runr} \p{Script=Runic} (78)
  2406. \p{Script: Samaritan} (Short: \p{Sc=Samr}, \p{Samr}) (61)
  2407. \p{Script: Samr} \p{Script=Samaritan} (61)
  2408. \p{Script: Sarb} \p{Script=Old_South_Arabian} (32)
  2409. \p{Script: Saur} \p{Script=Saurashtra} (81)
  2410. \p{Script: Saurashtra} (Short: \p{Sc=Saur}, \p{Saur}) (81)
  2411. \p{Script: Shavian} (Short: \p{Sc=Shaw}, \p{Shaw}) (48)
  2412. \p{Script: Shaw} \p{Script=Shavian} (48)
  2413. \p{Script: Sinh} \p{Script=Sinhala} (80)
  2414. \p{Script: Sinhala} (Short: \p{Sc=Sinh}, \p{Sinh}) (80)
  2415. \p{Script: Sund} \p{Script=Sundanese} (55)
  2416. \p{Script: Sundanese} (Short: \p{Sc=Sund}, \p{Sund}) (55)
  2417. \p{Script: Sylo} \p{Script=Syloti_Nagri} (44)
  2418. \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}, \p{Sylo}) (44)
  2419. \p{Script: Syrc} \p{Script=Syriac} (77)
  2420. \p{Script: Syriac} (Short: \p{Sc=Syrc}, \p{Syrc}) (77)
  2421. \p{Script: Tagalog} (Short: \p{Sc=Tglg}, \p{Tglg}) (20)
  2422. \p{Script: Tagb} \p{Script=Tagbanwa} (18)
  2423. \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}, \p{Tagb}) (18)
  2424. \p{Script: Tai_Le} (Short: \p{Sc=Tale}, \p{Tale}) (35)
  2425. \p{Script: Tai_Tham} (Short: \p{Sc=Lana}, \p{Lana}) (127)
  2426. \p{Script: Tai_Viet} (Short: \p{Sc=Tavt}, \p{Tavt}) (72)
  2427. \p{Script: Tale} \p{Script=Tai_Le} (35)
  2428. \p{Script: Talu} \p{Script=New_Tai_Lue} (83)
  2429. \p{Script: Tamil} (Short: \p{Sc=Taml}, \p{Taml}) (72)
  2430. \p{Script: Taml} \p{Script=Tamil} (72)
  2431. \p{Script: Tavt} \p{Script=Tai_Viet} (72)
  2432. \p{Script: Telu} \p{Script=Telugu} (93)
  2433. \p{Script: Telugu} (Short: \p{Sc=Telu}, \p{Telu}) (93)
  2434. \p{Script: Tfng} \p{Script=Tifinagh} (57)
  2435. \p{Script: Tglg} \p{Script=Tagalog} (20)
  2436. \p{Script: Thaa} \p{Script=Thaana} (50)
  2437. \p{Script: Thaana} (Short: \p{Sc=Thaa}, \p{Thaa}) (50)
  2438. \p{Script: Thai} (Short: \p{Sc=Thai}, \p{Thai}) (86)
  2439. \p{Script: Tibetan} (Short: \p{Sc=Tibt}, \p{Tibt}) (207)
  2440. \p{Script: Tibt} \p{Script=Tibetan} (207)
  2441. \p{Script: Tifinagh} (Short: \p{Sc=Tfng}, \p{Tfng}) (57)
  2442. \p{Script: Ugar} \p{Script=Ugaritic} (31)
  2443. \p{Script: Ugaritic} (Short: \p{Sc=Ugar}, \p{Ugar}) (31)
  2444. \p{Script: Unknown} (Short: \p{Sc=Zzzz}, \p{Zzzz}) (1_004_663)
  2445. \p{Script: Vai} (Short: \p{Sc=Vai}, \p{Vai}) (300)
  2446. \p{Script: Vaii} \p{Script=Vai} (300)
  2447. \p{Script: Xpeo} \p{Script=Old_Persian} (50)
  2448. \p{Script: Xsux} \p{Script=Cuneiform} (982)
  2449. \p{Script: Yi} (Short: \p{Sc=Yi}, \p{Yi}) (1220)
  2450. \p{Script: Yiii} \p{Script=Yi} (1220)
  2451. \p{Script: Zinh} \p{Script=Inherited} (523)
  2452. \p{Script: Zyyy} \p{Script=Common} (6379)
  2453. \p{Script: Zzzz} \p{Script=Unknown} (1_004_663)
  2454. \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
  2455. \p{SD: *} \p{Soft_Dotted: *}
  2456. \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4)
  2457. \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4)
  2458. \p{Sentence_Break: CL} \p{Sentence_Break=Close} (177)
  2459. \p{Sentence_Break: Close} (Short: \p{SB=CL}) (177)
  2460. \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1)
  2461. \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (1502)
  2462. \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (1502)
  2463. \p{Sentence_Break: FO} \p{Sentence_Break=Format} (138)
  2464. \p{Sentence_Break: Format} (Short: \p{SB=FO}) (138)
  2465. \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (97_369)
  2466. \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1)
  2467. \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (1917)
  2468. \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (1917)
  2469. \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (412)
  2470. \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (412)
  2471. \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (97_369)
  2472. \p{Sentence_Break: Other} (Short: \p{SB=XX}) (1_010_959)
  2473. \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26)
  2474. \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26)
  2475. \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3)
  2476. \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3)
  2477. \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (21)
  2478. \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (73)
  2479. \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (73)
  2480. \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1509)
  2481. \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1509)
  2482. \p{Sentence_Break: XX} \p{Sentence_Break=Other} (1_010_959)
  2483. \p{Separator} \p{General_Category=Separator} (Short:
  2484. \p{Z}) (20)
  2485. \p{Shavian} \p{Script=Shavian} (Short: \p{Shaw}) (48)
  2486. \p{Shaw} \p{Shavian} (= \p{Script=Shavian}) (48)
  2487. \p{Sinh} \p{Sinhala} (= \p{Script=Sinhala}) (NOT
  2488. \p{Block=Sinhala}) (80)
  2489. \p{Sinhala} \p{Script=Sinhala} (Short: \p{Sinh}; NOT
  2490. \p{Block=Sinhala}) (80)
  2491. \p{Sk} \p{Modifier_Symbol} (=
  2492. \p{General_Category=Modifier_Symbol})
  2493. (115)
  2494. \p{Sm} \p{Math_Symbol} (= \p{General_Category=
  2495. Math_Symbol}) (948)
  2496. X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (32)
  2497. \p{So} \p{Other_Symbol} (= \p{General_Category=
  2498. Other_Symbol}) (4398)
  2499. \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
  2500. \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_066)
  2501. \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (46)
  2502. \p{Space} \p{White_Space=Y} \s including beyond
  2503. ASCII plus vertical tab (26)
  2504. \p{Space: *} \p{White_Space: *}
  2505. \p{Space_Separator} \p{General_Category=Space_Separator}
  2506. (Short: \p{Zs}) (18)
  2507. \p{SpacePerl} \p{XPerlSpace} (25)
  2508. \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short:
  2509. \p{Mc}) (287)
  2510. X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
  2511. (80)
  2512. X \p{Specials} \p{Block=Specials} (16)
  2513. \p{STerm} \p{STerm=Y} (76)
  2514. \p{STerm: N*} (Single: \P{STerm}) (1_114_036)
  2515. \p{STerm: Y*} (Single: \p{STerm}) (76)
  2516. \p{Sund} \p{Sundanese} (= \p{Script=Sundanese})
  2517. (NOT \p{Block=Sundanese}) (55)
  2518. \p{Sundanese} \p{Script=Sundanese} (Short: \p{Sund}; NOT
  2519. \p{Block=Sundanese}) (55)
  2520. X \p{Superscripts_And_Subscripts} \p{Block=
  2521. Superscripts_And_Subscripts} (48)
  2522. X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
  2523. X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
  2524. X \p{Supplemental_Mathematical_Operators} \p{Block=
  2525. Supplemental_Mathematical_Operators}
  2526. (256)
  2527. X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
  2528. (128)
  2529. X \p{Supplementary_Private_Use_Area_A} \p{Block=
  2530. Supplementary_Private_Use_Area_A}
  2531. (65_536)
  2532. X \p{Supplementary_Private_Use_Area_B} \p{Block=
  2533. Supplementary_Private_Use_Area_B}
  2534. (65_536)
  2535. \p{Surrogate} \p{General_Category=Surrogate} (Short:
  2536. \p{Cs}) (2048)
  2537. \p{Sylo} \p{Syloti_Nagri} (= \p{Script=
  2538. Syloti_Nagri}) (NOT \p{Block=
  2539. Syloti_Nagri}) (44)
  2540. \p{Syloti_Nagri} \p{Script=Syloti_Nagri} (Short: \p{Sylo};
  2541. NOT \p{Block=Syloti_Nagri}) (44)
  2542. \p{Symbol} \p{General_Category=Symbol} (Short: \p{S})
  2543. (5508)
  2544. \p{Syrc} \p{Syriac} (= \p{Script=Syriac}) (NOT
  2545. \p{Block=Syriac}) (77)
  2546. \p{Syriac} \p{Script=Syriac} (Short: \p{Syrc}; NOT
  2547. \p{Block=Syriac}) (77)
  2548. \p{Tagalog} \p{Script=Tagalog} (Short: \p{Tglg}; NOT
  2549. \p{Block=Tagalog}) (20)
  2550. \p{Tagb} \p{Tagbanwa} (= \p{Script=Tagbanwa}) (NOT
  2551. \p{Block=Tagbanwa}) (18)
  2552. \p{Tagbanwa} \p{Script=Tagbanwa} (Short: \p{Tagb}; NOT
  2553. \p{Block=Tagbanwa}) (18)
  2554. X \p{Tags} \p{Block=Tags} (128)
  2555. \p{Tai_Le} \p{Script=Tai_Le} (Short: \p{Tale}; NOT
  2556. \p{Block=Tai_Le}) (35)
  2557. \p{Tai_Tham} \p{Script=Tai_Tham} (Short: \p{Lana}; NOT
  2558. \p{Block=Tai_Tham}) (127)
  2559. \p{Tai_Viet} \p{Script=Tai_Viet} (Short: \p{Tavt}; NOT
  2560. \p{Block=Tai_Viet}) (72)
  2561. X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (96)
  2562. \p{Tale} \p{Tai_Le} (= \p{Script=Tai_Le}) (NOT
  2563. \p{Block=Tai_Le}) (35)
  2564. \p{Talu} \p{New_Tai_Lue} (= \p{Script=New_Tai_Lue})
  2565. (NOT \p{Block=New_Tai_Lue}) (83)
  2566. \p{Tamil} \p{Script=Tamil} (Short: \p{Taml}; NOT
  2567. \p{Block=Tamil}) (72)
  2568. \p{Taml} \p{Tamil} (= \p{Script=Tamil}) (NOT
  2569. \p{Block=Tamil}) (72)
  2570. \p{Tavt} \p{Tai_Viet} (= \p{Script=Tai_Viet}) (NOT
  2571. \p{Block=Tai_Viet}) (72)
  2572. \p{Telu} \p{Telugu} (= \p{Script=Telugu}) (NOT
  2573. \p{Block=Telugu}) (93)
  2574. \p{Telugu} \p{Script=Telugu} (Short: \p{Telu}; NOT
  2575. \p{Block=Telugu}) (93)
  2576. \p{Term} \p{Terminal_Punctuation} (=
  2577. \p{Terminal_Punctuation=Y}) (169)
  2578. \p{Term: *} \p{Terminal_Punctuation: *}
  2579. \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
  2580. \p{Term}) (169)
  2581. \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
  2582. (1_113_943)
  2583. \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (169)
  2584. \p{Tfng} \p{Tifinagh} (= \p{Script=Tifinagh}) (NOT
  2585. \p{Block=Tifinagh}) (57)
  2586. \p{Tglg} \p{Tagalog} (= \p{Script=Tagalog}) (NOT
  2587. \p{Block=Tagalog}) (20)
  2588. \p{Thaa} \p{Thaana} (= \p{Script=Thaana}) (NOT
  2589. \p{Block=Thaana}) (50)
  2590. \p{Thaana} \p{Script=Thaana} (Short: \p{Thaa}; NOT
  2591. \p{Block=Thaana}) (50)
  2592. \p{Thai} \p{Script=Thai} (NOT \p{Block=Thai}) (86)
  2593. \p{Tibetan} \p{Script=Tibetan} (Short: \p{Tibt}; NOT
  2594. \p{Block=Tibetan}) (207)
  2595. \p{Tibt} \p{Tibetan} (= \p{Script=Tibetan}) (NOT
  2596. \p{Block=Tibetan}) (207)
  2597. \p{Tifinagh} \p{Script=Tifinagh} (Short: \p{Tfng}; NOT
  2598. \p{Block=Tifinagh}) (57)
  2599. \p{Title} (/i= Cased=Yes) (31)
  2600. \p{Titlecase} \p{Title} (/i= Cased=Yes) (31)
  2601. \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter}
  2602. (Short: \p{Lt}; /i= General_Category=
  2603. Cased_Letter) (31)
  2604. X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
  2605. (128)
  2606. \p{Ugar} \p{Ugaritic} (= \p{Script=Ugaritic}) (NOT
  2607. \p{Block=Ugaritic}) (31)
  2608. \p{Ugaritic} \p{Script=Ugaritic} (Short: \p{Ugar}; NOT
  2609. \p{Block=Ugaritic}) (31)
  2610. \p{UIdeo} \p{Unified_Ideograph} (=
  2611. \p{Unified_Ideograph=Y}) (74_616)
  2612. \p{UIdeo: *} \p{Unified_Ideograph: *}
  2613. \p{Unassigned} \p{General_Category=Unassigned} (Short:
  2614. \p{Cn}) (865_147)
  2615. X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
  2616. Unified_Canadian_Aboriginal_Syllabics}
  2617. (Short: \p{InCanadianSyllabics}) (640)
  2618. X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
  2619. Unified_Canadian_Aboriginal_Syllabics_-
  2620. Extended} (80)
  2621. \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
  2622. (74_616)
  2623. \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
  2624. (1_039_496)
  2625. \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (74_616)
  2626. \p{Unknown} \p{Script=Unknown} (Short: \p{Zzzz})
  2627. (1_004_663)
  2628. \p{Upper} \p{Uppercase=Y} (/i= Cased=Yes) (1478)
  2629. \p{Upper: *} \p{Uppercase: *}
  2630. \p{Uppercase} \p{Upper} (= \p{Uppercase=Y}) (/i= Cased=
  2631. Yes) (1478)
  2632. \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
  2633. No) (1_112_634)
  2634. \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
  2635. Yes) (1478)
  2636. \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter}
  2637. (Short: \p{Lu}; /i= General_Category=
  2638. Cased_Letter) (1436)
  2639. \p{Vai} \p{Script=Vai} (NOT \p{Block=Vai}) (300)
  2640. \p{Vaii} \p{Vai} (= \p{Script=Vai}) (NOT \p{Block=
  2641. Vai}) (300)
  2642. \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS})
  2643. (259)
  2644. \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853)
  2645. \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259)
  2646. X \p{Variation_Selectors} \p{Block=Variation_Selectors} (16)
  2647. X \p{Variation_Selectors_Supplement} \p{Block=
  2648. Variation_Selectors_Supplement} (240)
  2649. X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (48)
  2650. X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16)
  2651. \p{VertSpace} \v (7)
  2652. \p{VS} \p{Variation_Selector} (=
  2653. \p{Variation_Selector=Y}) (259)
  2654. \p{VS: *} \p{Variation_Selector: *}
  2655. \p{WB: *} \p{Word_Break: *}
  2656. \p{White_Space} \p{White_Space=Y} (Short: \p{WSpace}) (26)
  2657. \p{White_Space: N*} (Short: \p{Space=N}, \P{WSpace})
  2658. (1_114_086)
  2659. \p{White_Space: Y*} (Short: \p{Space=Y}, \p{WSpace}) (26)
  2660. \p{Word} \w, including beyond ASCII; = \p{Alnum} +
  2661. \pM + \p{Pc} (102_724)
  2662. \p{Word_Break: ALetter} (Short: \p{WB=LE}) (24_453)
  2663. \p{Word_Break: CR} (Short: \p{WB=CR}) (1)
  2664. \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (10)
  2665. \p{Word_Break: Extend} (Short: \p{WB=Extend}) (1502)
  2666. \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (10)
  2667. \p{Word_Break: FO} \p{Word_Break=Format} (137)
  2668. \p{Word_Break: Format} (Short: \p{WB=FO}) (137)
  2669. \p{Word_Break: KA} \p{Word_Break=Katakana} (310)
  2670. \p{Word_Break: Katakana} (Short: \p{WB=KA}) (310)
  2671. \p{Word_Break: LE} \p{Word_Break=ALetter} (24_453)
  2672. \p{Word_Break: LF} (Short: \p{WB=LF}) (1)
  2673. \p{Word_Break: MB} \p{Word_Break=MidNumLet} (8)
  2674. \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (8)
  2675. \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15)
  2676. \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (8)
  2677. \p{Word_Break: ML} \p{Word_Break=MidLetter} (8)
  2678. \p{Word_Break: MN} \p{Word_Break=MidNum} (15)
  2679. \p{Word_Break: Newline} (Short: \p{WB=NL}) (5)
  2680. \p{Word_Break: NL} \p{Word_Break=Newline} (5)
  2681. \p{Word_Break: NU} \p{Word_Break=Numeric} (411)
  2682. \p{Word_Break: Numeric} (Short: \p{WB=NU}) (411)
  2683. \p{Word_Break: Other} (Short: \p{WB=XX}) (1_087_251)
  2684. \p{Word_Break: XX} \p{Word_Break=Other} (1_087_251)
  2685. \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (26)
  2686. \p{WSpace: *} \p{White_Space: *}
  2687. \p{XDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
  2688. \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC})
  2689. (102_656)
  2690. \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (1_011_456)
  2691. \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (102_656)
  2692. \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (100_724)
  2693. \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (1_013_388)
  2694. \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (100_724)
  2695. \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y})
  2696. (102_656)
  2697. \p{XIDC: *} \p{XID_Continue: *}
  2698. \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (100_724)
  2699. \p{XIDS: *} \p{XID_Start: *}
  2700. \p{Xpeo} \p{Old_Persian} (= \p{Script=Old_Persian})
  2701. (NOT \p{Block=Old_Persian}) (50)
  2702. \p{XPerlSpace} \s, including beyond ASCII (Short:
  2703. \p{SpacePerl}) (25)
  2704. \p{XPosixAlnum} \p{Alnum} (101_959)
  2705. \p{XPosixAlpha} \p{Alpha} (= \p{Alphabetic=Y}) (101_539)
  2706. \p{XPosixBlank} \p{Blank} (19)
  2707. \p{XPosixCntrl} \p{Cntrl} (= \p{General_Category=Control})
  2708. (65)
  2709. \p{XPosixDigit} \p{Digit} (= \p{General_Category=
  2710. Decimal_Number}) (420)
  2711. \p{XPosixGraph} \p{Graph} (246_832)
  2712. \p{XPosixLower} \p{Lower} (= \p{Lowercase=Y}) (/i= Cased=
  2713. Yes) (1918)
  2714. \p{XPosixPrint} \p{Print} (246_850)
  2715. \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (607)
  2716. \p{XPosixSpace} \p{Space} (= \p{White_Space=Y}) (26)
  2717. \p{XPosixUpper} \p{Upper} (= \p{Uppercase=Y}) (/i= Cased=
  2718. Yes) (1478)
  2719. \p{XPosixWord} \p{Word} (102_724)
  2720. \p{XPosixXDigit} \p{XDigit} (= \p{Hex_Digit=Y}) (44)
  2721. \p{Xsux} \p{Cuneiform} (= \p{Script=Cuneiform})
  2722. (NOT \p{Block=Cuneiform}) (982)
  2723. \p{Yi} \p{Script=Yi} (1220)
  2724. X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64)
  2725. X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168)
  2726. \p{Yiii} \p{Yi} (= \p{Script=Yi}) (1220)
  2727. X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols} (64)
  2728. \p{Z} \p{Separator} (= \p{General_Category=
  2729. Separator}) (20)
  2730. \p{Zinh} \p{Inherited} (= \p{Script=Inherited})
  2731. (523)
  2732. \p{Zl} \p{Line_Separator} (= \p{General_Category=
  2733. Line_Separator}) (1)
  2734. \p{Zp} \p{Paragraph_Separator} (=
  2735. \p{General_Category=
  2736. Paragraph_Separator}) (1)
  2737. \p{Zs} \p{Space_Separator} (=
  2738. \p{General_Category=Space_Separator})
  2739. (18)
  2740. \p{Zyyy} \p{Common} (= \p{Script=Common}) (6379)
  2741. \p{Zzzz} \p{Unknown} (= \p{Script=Unknown})
  2742. (1_004_663)
  2743. T \p{_CanonDCIJ} (For internal use by Perl, not necessarily
  2744. stable) (= \p{Soft_Dotted=Y}) (46)
  2745. T \p{_Case_Ignorable} (For internal use by Perl, not necessarily
  2746. stable) (= \p{Case_Ignorable=Y}) (1692)
  2747. T \p{_CombAbove} (For internal use by Perl, not necessarily
  2748. stable) (= \p{Canonical_Combining_Class=
  2749. Above}) (320)
  2750. T \p{_X_Begin} (For internal use by Perl, not necessarily
  2751. stable) (1_113_907)
  2752. T \p{_X_Extend} (For internal use by Perl, not necessarily
  2753. stable) (1509)
  2754. T \p{_X_LV_LVT_V} (For internal use by Perl, not necessarily
  2755. stable) (11_267)

Legal \p{} and \P{} constructs that match no characters

Unicode has some property-value pairs that currently don't match anything. This happens generally either because they are obsolete, or for symmetry with other forms, but no language has yet been encoded that uses them. In this version of Unicode, the following match zero code points:

  • \p{Canonical_Combining_Class=Attached_Below_Left}
  • \p{Joining_Type=Left_Joining}

Properties not accessible through \p{} and \P{}

A few properties are accessible in Perl via various function calls only. These are:

  1. Lowercase_Mapping lc() and lcfirst()
  2. Titlecase_Mapping ucfirst()
  3. Uppercase_Mapping uc()

Case_Folding is accessible through the /i modifier in regular expressions.

The Name property is accessible through the \N{} interpolation in double-quoted strings and regular expressions, but both usages require a use charnames; to be specified, which also contains related functions viacode(), vianame(), and string_vianame().

Unicode regular expression properties that are NOT accepted by Perl

Perl will generate an error for a few character properties in Unicode when used in a regular expression. The non-Unihan ones are listed below, with the reasons they are not accepted, perhaps with work-arounds. The short names for the properties are listed enclosed in (parentheses).

  • Expands_On_NFC (XO_NFC)
  • Expands_On_NFD (XO_NFD)
  • Expands_On_NFKC (XO_NFKC)
  • Expands_On_NFKD (XO_NFKD)

    Deprecated by Unicode. These are characters that expand to more than one character in the specified normalization form, but whether they actually take up more bytes or not depends on the encoding being used. For example, a UTF-8 encoded character may expand to a different number of bytes than a UTF-32 encoded character.

  • Grapheme_Link (Gr_Link)

    Deprecated by Unicode: Duplicates ccc=vr (Canonical_Combining_Class=Virama)

  • Jamo_Short_Name (JSN)

    Used by Unicode internally for generating other properties and not intended to be used stand-alone

  • Script=Katakana_Or_Hiragana (sc=Hrkt)

    Obsolete. All code points previously matched by this have been moved to "Script=Common"

An installation can choose to allow any of these to be matched by downloading the Unicode database from http://www.unicode.org/Public/ to $Config{privlib} /unicore/ in the Perl source tree, changing the controlling lists contained in the program $Config{privlib} /unicore/mktables and then re-compiling and installing. (%Config is available from the Config module).

Files in the To directory (for serious hackers only)

All Unicode properties are really mappings (in the mathematical sense) from code points to their respective values. As part of its build process, Perl constructs tables containing these mappings for all properties that it deals with. Some, but not all, of these are written out into files. Those written out are in the directory $Config{privlib} /unicore/To/ (%Config is available from the Config module).

Perl reserves the right to change the format and even the existence of any of those files without notice, except the ones that were in existence prior to release 5.13. If those change, a deprecation cycle will be done first. These are:

  1. Bmg.pl Bidi_Mirroring_Glyph (bmg)
  2. Digit.pl Perl_Decimal_Digit
  3. Fold.pl Case_Folding (cf)
  4. Lower.pl Lowercase_Mapping (lc)
  5. NFKCCF.pl NFKC_Casefold (NFKC_CF)
  6. Title.pl Titlecase_Mapping (tc)
  7. Upper.pl Uppercase_Mapping (uc)

Each of the files in this directory defines two hash entries to help reading programs decipher it. One of them looks like this:

  1. $utf8::SwashInfo{'ToNAME'}{'format'} = 's';

where 'NAME' is a name to indicate the property. For backwards compatibility, this is not necessarily the property's official Unicode name. (The 'To' is also for backwards compatibility.) The hash entry gives the format of the mapping fields of the table, currently one of the following:

  1. b binary
  2. c Perl's internal (Normalize.pm) decomposition mapping
  3. d single decimal digit
  4. f floating point number
  5. i integer
  6. r rational: an integer or a fraction
  7. s string
  8. x positive hex whole number; a code point

This format applies only to the entries in the main body of the table. Entries defined in hashes or ones that are missing from the list can have a different format.

The value that the missing entries have is given by the other SwashInfo hash entry line; it looks like this:

  1. $utf8::SwashInfo{'ToNAME'}{'missing'} = 'NaN';

This example line says that any Unicode code points not explicitly listed in the file have the value 'NaN' under the property indicated by NAME. If the value is the special string <code point> , it means that the value for any missing code point is the code point itself. This happens, for example, in the file for Uppercase_Mapping (To/Upper.pl), in which code points like the character 'A', are missing because the uppercase of 'A' is itself.

SEE ALSO

http://www.unicode.org/reports/tr44/

perlrecharclass

perlunicode