Perl 5 version 30.0 documentation

perluniprops

NAME

perluniprops - Index of Unicode Version 12.1.0 character properties in Perl

DESCRIPTION

This document provides information about the portion of the Unicode database that deals with character properties, that is the portion that is defined on single code points. (Other information in the Unicode data base below briefly mentions other data that Unicode provides.)

Perl can provide access to all non-provisional Unicode character properties, though not all are enabled by default. The omitted ones are the Unihan properties (accessible via the CPAN module Unicode::Unihan) and certain deprecated or Unicode-internal properties. (An installation may choose to recompile Perl's tables to change this. See Unicode character properties that are NOT accepted by Perl.)

For most purposes, access to Unicode properties from the Perl core is through regular expression matches, as described in the next section. For some special purposes, and to access the properties that are not suitable for regular expression matching, all the Unicode character properties that Perl handles are accessible via the standard Unicode::UCD module, as described in the section Properties accessible through Unicode::UCD.

Perl also provides some additional extensions and short-cut synonyms for Unicode properties.

This document merely lists all available properties and does not attempt to explain what each property really means. There is a brief description of each Perl extension; see Other Properties in perlunicode for more information on these. There is some detail about Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but to find out about the intricacies of the official Unicode properties, refer to the Unicode standard. A good starting place is http://www.unicode.org/reports/tr44/.

Note that you can define your own properties; see User-Defined Character Properties in perlunicode.

Properties accessible through \p{} and \P{}

The Perl regular expression \p{} and \P{} constructs give access to most of the Unicode character properties. The table below shows all these constructs, both single and compound forms.

Compound forms consist of two components, separated by an equals sign or a colon. The first component is the property name, and the second component is the particular value of the property to match against, for example, \p{Script_Extensions: Greek} and \p{Script_Extensions=Greek} both mean to match characters whose Script_Extensions property value is Greek. (Script_Extensions is an improved version of the Script property.)

Single forms, like \p{Greek} , are mostly Perl-defined shortcuts for their equivalent compound forms. The table shows these equivalences. (In our example, \p{Greek} is a just a shortcut for \p{Script_Extensions=Greek} ). There are also a few Perl-defined single forms that are not shortcuts for a compound form. One such is \p{Word} . These are also listed in the table.

In parsing these constructs, Perl always ignores Upper/lower case differences everywhere within the {braces}. Thus \p{Greek} means the same thing as \p{greek} . But note that changing the case of the "p" or "P" before the left brace completely changes the meaning of the construct, from "match" (for \p{} ) to "doesn't match" (for \P{} ). Casing in this document is for improved legibility.

Also, white space, hyphens, and underscores are normally ignored everywhere between the {braces}, and hence can be freely added or removed even if the /x modifier hasn't been specified on the regular expression. But in the table below a 'T' at the beginning of an entry means that tighter (stricter) rules are used for that entry:

Some properties are considered obsolete by Unicode, but still available. There are several varieties of obsolescence:

The table below has two columns. The left column contains the \p{} constructs to look up, possibly preceded by the flags mentioned above; and the right column contains information about them, like a description, or synonyms. The table shows both the single and compound forms for each property that has them. If the left column is a short name for a property, the right column will give its longer, more descriptive name; and if the left column is the longest name, the right column will show any equivalent shortest name, in both single and compound forms if applicable.

If braces are not needed to specify a property (e.g., \pL ), the left column contains both forms, with and without braces.

The right column will also caution you if a property means something different than what might normally be expected.

All single forms are Perl extensions; a few compound forms are as well, and are noted as such.

Numbers in (parentheses) indicate the total number of Unicode code points matched by the property. For the entries that give the longest, most descriptive version of the property, the count is followed by a list of some of the code points matched by it. The list includes all the matched characters in the 0-255 range, enclosed in the familiar [brackets] the same as a regular expression bracketed character class. Following that, the next few higher matching ranges are also given. To avoid visual ambiguity, the SPACE character is represented as \x20 .

For emphasis, those properties that match no code points at all are listed as well in a separate section following the table.

Most properties match the same code points regardless of whether "/i" case-insensitive matching is specified or not. But a few properties are affected. These are shown with the notation (/i= other_property) in the second column. Under case-insensitive matching they match the same code pode points as the property other_property.

There is no description given for most non-Perl defined properties (See http://www.unicode.org/reports/tr44/ for that).

For compactness, '*' is used as a wildcard instead of showing all possible combinations. For example, entries like:

  1. \p{Gc: *} \p{General_Category: *}

mean that 'Gc' is a synonym for 'General_Category', and anything that is valid for the latter is also valid for the former. Similarly,

  1. \p{Is_*} \p{*}

means that if and only if, for example, \p{Foo} exists, then \p{Is_Foo} and \p{IsFoo} are also valid and all mean the same thing. And similarly, \p{Foo=Bar} means the same as \p{Is_Foo=Bar} and \p{IsFoo=Bar} . "*" here is restricted to something not beginning with an underscore.

Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for 'Y'. And 'No', 'F', and 'False' are all synonyms for 'N'. The table shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries for the other possibilities. Note that not all properties which have values 'Yes' and 'No' are binary, and they have all their values spelled out without using this wild card, and a NOT clause in their description that highlights their not being binary. These also require the compound form to match them, whereas true binary properties have both single and compound forms available.

Note that all non-essential underscores are removed in the display of the short names below.

Legend summary:

  • * is a wild-card
  • (\d+) in the info column gives the number of Unicode code points matched by this property.
  • D means this is deprecated.
  • O means this is obsolete.
  • S means this is stabilized.
  • T means tighter (stricter) name matching applies.
  • X means use of this form is discouraged, and may not be stable.
  1. NAME INFO
  2. \p{Adlam} \p{Script_Extensions=Adlam} (Short:
  3. \p{Adlm}; NOT \p{Block=Adlam}) (89)
  4. \p{Adlm} \p{Adlam} (= \p{Script_Extensions=Adlam})
  5. (NOT \p{Block=Adlam}) (89)
  6. X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64)
  7. T \p{Age: 1.1} \p{Age=V1_1} (33_979)
  8. \p{Age: V1_1} Code point's usage introduced in version
  9. 1.1 (33_979: U+0000..01F5, U+01FA..0217,
  10. U+0250..02A8, U+02B0..02DE,
  11. U+02E0..02E9, U+0300..0345 ...)
  12. T \p{Age: 2.0} \p{Age=V2_0} (144_521)
  13. \p{Age: V2_0} Code point's usage was introduced in
  14. version 2.0; See also Property
  15. 'Present_In' (144_521: U+0591..05A1,
  16. U+05A3..05AF, U+05C4, U+0F00..0F47,
  17. U+0F49..0F69, U+0F71..0F8B ...)
  18. T \p{Age: 2.1} \p{Age=V2_1} (2)
  19. \p{Age: V2_1} Code point's usage was introduced in
  20. version 2.1; See also Property
  21. 'Present_In' (2: U+20AC, U+FFFC)
  22. T \p{Age: 3.0} \p{Age=V3_0} (10_307)
  23. \p{Age: V3_0} Code point's usage was introduced in
  24. version 3.0; See also Property
  25. 'Present_In' (10_307: U+01F6..01F9,
  26. U+0218..021F, U+0222..0233,
  27. U+02A9..02AD, U+02DF, U+02EA..02EE ...)
  28. T \p{Age: 3.1} \p{Age=V3_1} (44_978)
  29. \p{Age: V3_1} Code point's usage was introduced in
  30. version 3.1; See also Property
  31. 'Present_In' (44_978: U+03F4..03F5,
  32. U+FDD0..FDEF, U+10300..1031E,
  33. U+10320..10323, U+10330..1034A,
  34. U+10400..10425 ...)
  35. T \p{Age: 3.2} \p{Age=V3_2} (1016)
  36. \p{Age: V3_2} Code point's usage was introduced in
  37. version 3.2; See also Property
  38. 'Present_In' (1016: U+0220, U+034F,
  39. U+0363..036F, U+03D8..03D9, U+03F6,
  40. U+048A..048B ...)
  41. T \p{Age: 4.0} \p{Age=V4_0} (1226)
  42. \p{Age: V4_0} Code point's usage was introduced in
  43. version 4.0; See also Property
  44. 'Present_In' (1226: U+0221,
  45. U+0234..0236, U+02AE..02AF,
  46. U+02EF..02FF, U+0350..0357, U+035D..035F
  47. ...)
  48. T \p{Age: 4.1} \p{Age=V4_1} (1273)
  49. \p{Age: V4_1} Code point's usage was introduced in
  50. version 4.1; See also Property
  51. 'Present_In' (1273: U+0237..0241,
  52. U+0358..035C, U+03FC..03FF,
  53. U+04F6..04F7, U+05A2, U+05C5..05C7 ...)
  54. T \p{Age: 5.0} \p{Age=V5_0} (1369)
  55. \p{Age: V5_0} Code point's usage was introduced in
  56. version 5.0; See also Property
  57. 'Present_In' (1369: U+0242..024F,
  58. U+037B..037D, U+04CF, U+04FA..04FF,
  59. U+0510..0513, U+05BA ...)
  60. T \p{Age: 5.1} \p{Age=V5_1} (1624)
  61. \p{Age: V5_1} Code point's usage was introduced in
  62. version 5.1; See also Property
  63. 'Present_In' (1624: U+0370..0373,
  64. U+0376..0377, U+03CF, U+0487,
  65. U+0514..0523, U+0606..060A ...)
  66. T \p{Age: 5.2} \p{Age=V5_2} (6648)
  67. \p{Age: V5_2} Code point's usage was introduced in
  68. version 5.2; See also Property
  69. 'Present_In' (6648: U+0524..0525,
  70. U+0800..082D, U+0830..083E, U+0900,
  71. U+094E, U+0955 ...)
  72. T \p{Age: 6.0} \p{Age=V6_0} (2088)
  73. \p{Age: V6_0} Code point's usage was introduced in
  74. version 6.0; See also Property
  75. 'Present_In' (2088: U+0526..0527,
  76. U+0620, U+065F, U+0840..085B, U+085E,
  77. U+093A..093B ...)
  78. T \p{Age: 6.1} \p{Age=V6_1} (732)
  79. \p{Age: V6_1} Code point's usage was introduced in
  80. version 6.1; See also Property
  81. 'Present_In' (732: U+058F, U+0604,
  82. U+08A0, U+08A2..08AC, U+08E4..08FE,
  83. U+0AF0 ...)
  84. T \p{Age: 6.2} \p{Age=V6_2} (1)
  85. \p{Age: V6_2} Code point's usage was introduced in
  86. version 6.2; See also Property
  87. 'Present_In' (1: U+20BA)
  88. T \p{Age: 6.3} \p{Age=V6_3} (5)
  89. \p{Age: V6_3} Code point's usage was introduced in
  90. version 6.3; See also Property
  91. 'Present_In' (5: U+061C, U+2066..2069)
  92. T \p{Age: 7.0} \p{Age=V7_0} (2834)
  93. \p{Age: V7_0} Code point's usage was introduced in
  94. version 7.0; See also Property
  95. 'Present_In' (2834: U+037F,
  96. U+0528..052F, U+058D..058E, U+0605,
  97. U+08A1, U+08AD..08B2 ...)
  98. T \p{Age: 8.0} \p{Age=V8_0} (7716)
  99. \p{Age: V8_0} Code point's usage was introduced in
  100. version 8.0; See also Property
  101. 'Present_In' (7716: U+08B3..08B4,
  102. U+08E3, U+0AF9, U+0C5A, U+0D5F, U+13F5
  103. ...)
  104. T \p{Age: 9.0} \p{Age=V9_0} (7500)
  105. \p{Age: V9_0} Code point's usage was introduced in
  106. version 9.0; See also Property
  107. 'Present_In' (7500: U+08B6..08BD,
  108. U+08D4..08E2, U+0C80, U+0D4F,
  109. U+0D54..0D56, U+0D58..0D5E ...)
  110. T \p{Age: 10.0} \p{Age=V10_0} (8518)
  111. \p{Age: V10_0} Code point's usage was introduced in
  112. version 10.0; See also Property
  113. 'Present_In' (8518: U+0860..086A,
  114. U+09FC..09FD, U+0AFA..0AFF, U+0D00,
  115. U+0D3B..0D3C, U+1CF7 ...)
  116. T \p{Age: 11.0} \p{Age=V11_0} (684)
  117. \p{Age: V11_0} Code point's usage was introduced in
  118. version 11.0; See also Property
  119. 'Present_In' (684: U+0560, U+0588,
  120. U+05EF, U+07FD..07FF, U+08D3, U+09FE ...)
  121. T \p{Age: 12.0} \p{Age=V12_0} (554)
  122. \p{Age: V12_0} Code point's usage was introduced in
  123. version 12.0; See also Property
  124. 'Present_In' (554: U+0C77, U+0E86,
  125. U+0E89, U+0E8C, U+0E8E..0E93, U+0E98 ...)
  126. T \p{Age: 12.1} \p{Age=V12_1} (1)
  127. \p{Age: V12_1} Code point's usage was introduced in
  128. version 12.1; See also Property
  129. 'Present_In' (1: U+32FF)
  130. \p{Age: NA} \p{Age=Unassigned} (836_536 plus all
  131. above-Unicode code points)
  132. \p{Age: Unassigned} Code point's usage has not been assigned
  133. in any Unicode release thus far. (Short:
  134. \p{Age=NA}) (836_536 plus all above-
  135. Unicode code points: U+0378..0379,
  136. U+0380..0383, U+038B, U+038D, U+03A2,
  137. U+0530 ...)
  138. \p{Aghb} \p{Caucasian_Albanian} (=
  139. \p{Script_Extensions=
  140. Caucasian_Albanian}) (NOT \p{Block=
  141. Caucasian_Albanian}) (53)
  142. \p{AHex} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
  143. (22)
  144. \p{AHex: *} \p{ASCII_Hex_Digit: *}
  145. \p{Ahom} \p{Script_Extensions=Ahom} (NOT \p{Block=
  146. Ahom}) (58)
  147. X \p{Alchemical} \p{Alchemical_Symbols} (= \p{Block=
  148. Alchemical_Symbols}) (128)
  149. X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (Short:
  150. \p{InAlchemical}) (128)
  151. \p{All} All code points, including those above
  152. Unicode. Same as qr/./s (1_114_112 plus
  153. all above-Unicode code points:
  154. U+0000..infinity)
  155. \p{Alnum} \p{XPosixAlnum} (127_886)
  156. \p{Alpha} \p{XPosixAlpha} (= \p{Alphabetic=Y})
  157. (127_256)
  158. \p{Alpha: *} \p{Alphabetic: *}
  159. \p{Alphabetic} \p{XPosixAlpha} (= \p{Alphabetic=Y})
  160. (127_256)
  161. \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (986_856
  162. plus all above-Unicode code points:
  163. [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
  164. >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
  165. \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
  166. U+02C2..02C5, U+02D2..02DF,
  167. U+02E5..02EB, U+02ED, U+02EF..0344 ...)
  168. \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (127_256:
  169. [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
  170. \xf6\xf8-\xff], U+0100..02C1,
  171. U+02C6..02D1, U+02E0..02E4, U+02EC,
  172. U+02EE ...)
  173. X \p{Alphabetic_PF} \p{Alphabetic_Presentation_Forms} (=
  174. \p{Block=Alphabetic_Presentation_Forms})
  175. (80)
  176. X \p{Alphabetic_Presentation_Forms} \p{Block=
  177. Alphabetic_Presentation_Forms} (Short:
  178. \p{InAlphabeticPF}) (80)
  179. \p{Anatolian_Hieroglyphs} \p{Script_Extensions=
  180. Anatolian_Hieroglyphs} (Short: \p{Hluw};
  181. NOT \p{Block=Anatolian_Hieroglyphs})
  182. (583)
  183. X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
  184. \p{Block=
  185. Ancient_Greek_Musical_Notation}) (80)
  186. X \p{Ancient_Greek_Musical_Notation} \p{Block=
  187. Ancient_Greek_Musical_Notation} (Short:
  188. \p{InAncientGreekMusic}) (80)
  189. X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
  190. X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64)
  191. \p{Any} All Unicode code points (1_114_112:
  192. U+0000..10FFFF)
  193. \p{Arab} \p{Arabic} (= \p{Script_Extensions=
  194. Arabic}) (NOT \p{Block=Arabic}) (1325)
  195. \p{Arabic} \p{Script_Extensions=Arabic} (Short:
  196. \p{Arab}; NOT \p{Block=Arabic}) (1325)
  197. X \p{Arabic_Ext_A} \p{Arabic_Extended_A} (= \p{Block=
  198. Arabic_Extended_A}) (96)
  199. X \p{Arabic_Extended_A} \p{Block=Arabic_Extended_A} (Short:
  200. \p{InArabicExtA}) (96)
  201. X \p{Arabic_Math} \p{Arabic_Mathematical_Alphabetic_Symbols}
  202. (= \p{Block=
  203. Arabic_Mathematical_Alphabetic_Symbols})
  204. (256)
  205. X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
  206. Arabic_Mathematical_Alphabetic_Symbols}
  207. (Short: \p{InArabicMath}) (256)
  208. X \p{Arabic_PF_A} \p{Arabic_Presentation_Forms_A} (=
  209. \p{Block=Arabic_Presentation_Forms_A})
  210. (688)
  211. X \p{Arabic_PF_B} \p{Arabic_Presentation_Forms_B} (=
  212. \p{Block=Arabic_Presentation_Forms_B})
  213. (144)
  214. X \p{Arabic_Presentation_Forms_A} \p{Block=
  215. Arabic_Presentation_Forms_A} (Short:
  216. \p{InArabicPFA}) (688)
  217. X \p{Arabic_Presentation_Forms_B} \p{Block=
  218. Arabic_Presentation_Forms_B} (Short:
  219. \p{InArabicPFB}) (144)
  220. X \p{Arabic_Sup} \p{Arabic_Supplement} (= \p{Block=
  221. Arabic_Supplement}) (48)
  222. X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (Short:
  223. \p{InArabicSup}) (48)
  224. \p{Armenian} \p{Script_Extensions=Armenian} (Short:
  225. \p{Armn}; NOT \p{Block=Armenian}) (96)
  226. \p{Armi} \p{Imperial_Aramaic} (=
  227. \p{Script_Extensions=Imperial_Aramaic})
  228. (NOT \p{Block=Imperial_Aramaic}) (31)
  229. \p{Armn} \p{Armenian} (= \p{Script_Extensions=
  230. Armenian}) (NOT \p{Block=Armenian}) (96)
  231. X \p{Arrows} \p{Block=Arrows} (112)
  232. \p{ASCII} \p{Block=Basic_Latin} (128)
  233. \p{ASCII_Hex_Digit} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
  234. (22)
  235. \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
  236. plus all above-Unicode code points:
  237. [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
  238. >?\@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
  239. U+0100..infinity)
  240. \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22: [0-9A-
  241. Fa-f])
  242. \p{Assigned} All assigned code points (277_510:
  243. U+0000..0377, U+037A..037F,
  244. U+0384..038A, U+038C, U+038E..03A1,
  245. U+03A3..052F ...)
  246. \p{Avestan} \p{Script_Extensions=Avestan} (Short:
  247. \p{Avst}; NOT \p{Block=Avestan}) (61)
  248. \p{Avst} \p{Avestan} (= \p{Script_Extensions=
  249. Avestan}) (NOT \p{Block=Avestan}) (61)
  250. \p{Bali} \p{Balinese} (= \p{Script_Extensions=
  251. Balinese}) (NOT \p{Block=Balinese}) (121)
  252. \p{Balinese} \p{Script_Extensions=Balinese} (Short:
  253. \p{Bali}; NOT \p{Block=Balinese}) (121)
  254. \p{Bamu} \p{Bamum} (= \p{Script_Extensions=Bamum})
  255. (NOT \p{Block=Bamum}) (657)
  256. \p{Bamum} \p{Script_Extensions=Bamum} (Short:
  257. \p{Bamu}; NOT \p{Block=Bamum}) (657)
  258. X \p{Bamum_Sup} \p{Bamum_Supplement} (= \p{Block=
  259. Bamum_Supplement}) (576)
  260. X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (Short:
  261. \p{InBamumSup}) (576)
  262. X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128)
  263. \p{Bass} \p{Bassa_Vah} (= \p{Script_Extensions=
  264. Bassa_Vah}) (NOT \p{Block=Bassa_Vah})
  265. (36)
  266. \p{Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short:
  267. \p{Bass}; NOT \p{Block=Bassa_Vah}) (36)
  268. \p{Batak} \p{Script_Extensions=Batak} (Short:
  269. \p{Batk}; NOT \p{Block=Batak}) (56)
  270. \p{Batk} \p{Batak} (= \p{Script_Extensions=Batak})
  271. (NOT \p{Block=Batak}) (56)
  272. \p{Bc: *} \p{Bidi_Class: *}
  273. \p{Beng} \p{Bengali} (= \p{Script_Extensions=
  274. Bengali}) (NOT \p{Block=Bengali}) (113)
  275. \p{Bengali} \p{Script_Extensions=Bengali} (Short:
  276. \p{Beng}; NOT \p{Block=Bengali}) (113)
  277. \p{Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short:
  278. \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97)
  279. \p{Bhks} \p{Bhaiksuki} (= \p{Script_Extensions=
  280. Bhaiksuki}) (NOT \p{Block=Bhaiksuki})
  281. (97)
  282. \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y})
  283. (12)
  284. \p{Bidi_C: *} \p{Bidi_Control: *}
  285. \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1698)
  286. \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (61)
  287. \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1698: U+0608,
  288. U+060B, U+060D, U+061B..064A,
  289. U+066D..066F, U+0671..06D5 ...)
  290. \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (61:
  291. U+0600..0605, U+0660..0669,
  292. U+066B..066C, U+06DD, U+08E2,
  293. U+10D30..10D39 ...)
  294. \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7)
  295. \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4016)
  296. \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016:
  297. [^\t\n\cK\f\r\x1c-\x7e\x85\xa0-\xac\xae-
  298. \xff], U+180E, U+200B..200D,
  299. U+2060..2065, U+206A..206F, U+FDD0..FDEF
  300. ...)
  301. \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15:
  302. [,.\/:\xa0], U+060C, U+202F, U+2044,
  303. U+FE50, U+FE52 ...)
  304. \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15)
  305. \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (158)
  306. \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12)
  307. \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (92)
  308. \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (158: [0-9\xb2-
  309. \xb3\xb9], U+06F0..06F9, U+2070,
  310. U+2074..2079, U+2080..2089, U+2488..249B
  311. ...)
  312. \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12: [+\-],
  313. U+207A..207B, U+208A..208B, U+2212,
  314. U+FB29, U+FE62..FE63 ...)
  315. \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (92:
  316. [#\$\%\xa2-\xa5\xb0-\xb1], U+058F,
  317. U+0609..060A, U+066A, U+09F2..09F3,
  318. U+09FB ...)
  319. \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1:
  320. U+2068)
  321. \p{Bidi_Class: FSI} \p{Bidi_Class=First_Strong_Isolate} (1)
  322. \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_096_767
  323. plus all above-Unicode code points)
  324. \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_096_767 plus
  325. all above-Unicode code points: [A-Za-
  326. z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
  327. \xff], U+0100..02B8, U+02BB..02C1,
  328. U+02D0..02D1, U+02E0..02E4, U+02EE ...)
  329. \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1:
  330. U+202A)
  331. \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1:
  332. U+2066)
  333. \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1:
  334. U+202D)
  335. \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1)
  336. \p{Bidi_Class: LRI} \p{Bidi_Class=Left_To_Right_Isolate} (1)
  337. \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1)
  338. \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1834:
  339. U+0300..036F, U+0483..0489,
  340. U+0591..05BD, U+05BF, U+05C1..05C2,
  341. U+05C4..05C5 ...)
  342. \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1834)
  343. \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (5658)
  344. \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (5658:
  345. [!\"&\'\(\)*;<=
  346. >?\@\[\\\]\^_`\{\|\}~\xa1\xa6-\xa9\xab-
  347. \xac\xae-\xaf\xb4\xb6-\xb8\xbb-
  348. \xbf\xd7\xf7], U+02B9..02BA,
  349. U+02C2..02CF, U+02D2..02DF,
  350. U+02E5..02ED, U+02EF..02FF ...)
  351. \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7:
  352. [\n\r\x1c-\x1e\x85], U+2029)
  353. \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1)
  354. \p{Bidi_Class: PDI} \p{Bidi_Class=Pop_Directional_Isolate} (1)
  355. \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1:
  356. U+202C)
  357. \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1:
  358. U+2069)
  359. \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (3765)
  360. \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (3765: U+0590,
  361. U+05BE, U+05C0, U+05C3, U+05C6,
  362. U+05C8..05FF ...)
  363. \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1:
  364. U+202B)
  365. \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1:
  366. U+2067)
  367. \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1:
  368. U+202E)
  369. \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1)
  370. \p{Bidi_Class: RLI} \p{Bidi_Class=Right_To_Left_Isolate} (1)
  371. \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1)
  372. \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3)
  373. \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3:
  374. [\t\cK\x1f])
  375. \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17: [\f\x20],
  376. U+1680, U+2000..200A, U+2028, U+205F,
  377. U+3000)
  378. \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (17)
  379. \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
  380. \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
  381. plus all above-Unicode code points:
  382. U+0000..061B, U+061D..200D,
  383. U+2010..2029, U+202F..2065,
  384. U+206A..infinity)
  385. \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (12:
  386. U+061C, U+200E..200F, U+202A..202E,
  387. U+2066..2069)
  388. \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
  389. (545)
  390. \p{Bidi_M: *} \p{Bidi_Mirrored: *}
  391. \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
  392. (545)
  393. \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567
  394. plus all above-Unicode code points:
  395. [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;=?\@A-
  396. Z\\\^_`a-z\|~\x7f-\xaa\xac-\xba\xbc-
  397. \xff], U+0100..0F39, U+0F3E..169A,
  398. U+169D..2038, U+203B..2044, U+2047..207C
  399. ...)
  400. \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (545:
  401. [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
  402. U+169B..169C, U+2039..203A,
  403. U+2045..2046, U+207D..207E ...)
  404. \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
  405. (60)
  406. \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (60:
  407. [\)\]\}], U+0F3B, U+0F3D, U+169C,
  408. U+2046, U+207E ...)
  409. \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
  410. (1_113_992 plus all above-Unicode code
  411. points)
  412. \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_992
  413. plus all above-Unicode code points:
  414. [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;<=
  415. >?\@A-Z\\\^_`a-z\|~\x7f-\xff],
  416. U+0100..0F39, U+0F3E..169A,
  417. U+169D..2044, U+2047..207C, U+207F..208C
  418. ...)
  419. \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
  420. (60)
  421. \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (60:
  422. [\(\[\{], U+0F3A, U+0F3C, U+169B,
  423. U+2045, U+207D ...)
  424. \p{Blank} \p{XPosixBlank} (18)
  425. \p{Blk: *} \p{Block: *}
  426. \p{Block: Adlam} (NOT \p{Adlam} NOR \p{Is_Adlam}) (96:
  427. U+1E900..1E95F)
  428. \p{Block: Aegean_Numbers} (64: U+10100..1013F)
  429. \p{Block: Ahom} (NOT \p{Ahom} NOR \p{Is_Ahom}) (64:
  430. U+11700..1173F)
  431. \p{Block: Alchemical} \p{Block=Alchemical_Symbols} (128)
  432. \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128:
  433. U+1F700..1F77F)
  434. \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
  435. (80)
  436. \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
  437. AlphabeticPF}) (80: U+FB00..FB4F)
  438. \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
  439. NOR \p{Is_Anatolian_Hieroglyphs}) (640:
  440. U+14400..1467F)
  441. \p{Block: Ancient_Greek_Music} \p{Block=
  442. Ancient_Greek_Musical_Notation} (80)
  443. \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
  444. AncientGreekMusic}) (80: U+1D200..1D24F)
  445. \p{Block: Ancient_Greek_Numbers} (80: U+10140..1018F)
  446. \p{Block: Ancient_Symbols} (64: U+10190..101CF)
  447. \p{Block: Arabic} (NOT \p{Arabic} NOR \p{Is_Arabic}) (256:
  448. U+0600..06FF)
  449. \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
  450. \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96:
  451. U+08A0..08FF)
  452. \p{Block: Arabic_Math} \p{Block=
  453. Arabic_Mathematical_Alphabetic_Symbols}
  454. (256)
  455. \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
  456. ArabicMath}) (256: U+1EE00..1EEFF)
  457. \p{Block: Arabic_PF_A} \p{Block=Arabic_Presentation_Forms_A} (688)
  458. \p{Block: Arabic_PF_B} \p{Block=Arabic_Presentation_Forms_B} (144)
  459. \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
  460. (688: U+FB50..FDFF)
  461. \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
  462. (144: U+FE70..FEFF)
  463. \p{Block: Arabic_Sup} \p{Block=Arabic_Supplement} (48)
  464. \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48:
  465. U+0750..077F)
  466. \p{Block: Armenian} (NOT \p{Armenian} NOR \p{Is_Armenian})
  467. (96: U+0530..058F)
  468. \p{Block: Arrows} (112: U+2190..21FF)
  469. \p{Block: ASCII} \p{Block=Basic_Latin} (128)
  470. \p{Block: Avestan} (NOT \p{Avestan} NOR \p{Is_Avestan}) (64:
  471. U+10B00..10B3F)
  472. \p{Block: Balinese} (NOT \p{Balinese} NOR \p{Is_Balinese})
  473. (128: U+1B00..1B7F)
  474. \p{Block: Bamum} (NOT \p{Bamum} NOR \p{Is_Bamum}) (96:
  475. U+A6A0..A6FF)
  476. \p{Block: Bamum_Sup} \p{Block=Bamum_Supplement} (576)
  477. \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576:
  478. U+16800..16A3F)
  479. \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}) (128: [\x00-\x7f])
  480. \p{Block: Bassa_Vah} (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
  481. (48: U+16AD0..16AFF)
  482. \p{Block: Batak} (NOT \p{Batak} NOR \p{Is_Batak}) (64:
  483. U+1BC0..1BFF)
  484. \p{Block: Bengali} (NOT \p{Bengali} NOR \p{Is_Bengali}) (128:
  485. U+0980..09FF)
  486. \p{Block: Bhaiksuki} (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki})
  487. (112: U+11C00..11C6F)
  488. \p{Block: Block_Elements} (32: U+2580..259F)
  489. \p{Block: Bopomofo} (NOT \p{Bopomofo} NOR \p{Is_Bopomofo})
  490. (48: U+3100..312F)
  491. \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
  492. \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32:
  493. U+31A0..31BF)
  494. \p{Block: Box_Drawing} (128: U+2500..257F)
  495. \p{Block: Brahmi} (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128:
  496. U+11000..1107F)
  497. \p{Block: Braille} \p{Block=Braille_Patterns} (256)
  498. \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256:
  499. U+2800..28FF)
  500. \p{Block: Buginese} (NOT \p{Buginese} NOR \p{Is_Buginese})
  501. (32: U+1A00..1A1F)
  502. \p{Block: Buhid} (NOT \p{Buhid} NOR \p{Is_Buhid}) (32:
  503. U+1740..175F)
  504. \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
  505. (256)
  506. \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
  507. ByzantineMusic}) (256: U+1D000..1D0FF)
  508. \p{Block: Canadian_Syllabics} \p{Block=
  509. Unified_Canadian_Aboriginal_Syllabics}
  510. (640)
  511. \p{Block: Carian} (NOT \p{Carian} NOR \p{Is_Carian}) (64:
  512. U+102A0..102DF)
  513. \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
  514. \p{Is_Caucasian_Albanian}) (64:
  515. U+10530..1056F)
  516. \p{Block: Chakma} (NOT \p{Chakma} NOR \p{Is_Chakma}) (80:
  517. U+11100..1114F)
  518. \p{Block: Cham} (NOT \p{Cham} NOR \p{Is_Cham}) (96:
  519. U+AA00..AA5F)
  520. \p{Block: Cherokee} (NOT \p{Cherokee} NOR \p{Is_Cherokee})
  521. (96: U+13A0..13FF)
  522. \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
  523. \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80:
  524. U+AB70..ABBF)
  525. \p{Block: Chess_Symbols} (112: U+1FA00..1FA6F)
  526. \p{Block: CJK} \p{Block=CJK_Unified_Ideographs} (20_992)
  527. \p{Block: CJK_Compat} \p{Block=CJK_Compatibility} (256)
  528. \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
  529. \p{Block: CJK_Compat_Ideographs} \p{Block=
  530. CJK_Compatibility_Ideographs} (512)
  531. \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
  532. CJK_Compatibility_Ideographs_Supplement}
  533. (544)
  534. \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256:
  535. U+3300..33FF)
  536. \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
  537. (32: U+FE30..FE4F)
  538. \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
  539. CJKCompatIdeographs}) (512: U+F900..FAFF)
  540. \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
  541. CJKCompatIdeographsSup}) (544:
  542. U+2F800..2FA1F)
  543. \p{Block: CJK_Ext_A} \p{Block=
  544. CJK_Unified_Ideographs_Extension_A}
  545. (6592)
  546. \p{Block: CJK_Ext_B} \p{Block=
  547. CJK_Unified_Ideographs_Extension_B}
  548. (42_720)
  549. \p{Block: CJK_Ext_C} \p{Block=
  550. CJK_Unified_Ideographs_Extension_C}
  551. (4160)
  552. \p{Block: CJK_Ext_D} \p{Block=
  553. CJK_Unified_Ideographs_Extension_D} (224)
  554. \p{Block: CJK_Ext_E} \p{Block=
  555. CJK_Unified_Ideographs_Extension_E}
  556. (5776)
  557. \p{Block: CJK_Ext_F} \p{Block=
  558. CJK_Unified_Ideographs_Extension_F}
  559. (7488)
  560. \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
  561. \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
  562. (128: U+2E80..2EFF)
  563. \p{Block: CJK_Strokes} (48: U+31C0..31EF)
  564. \p{Block: CJK_Symbols} \p{Block=CJK_Symbols_And_Punctuation} (64)
  565. \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
  566. (64: U+3000..303F)
  567. \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992:
  568. U+4E00..9FFF)
  569. \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
  570. CJKExtA}) (6592: U+3400..4DBF)
  571. \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
  572. CJKExtB}) (42_720: U+20000..2A6DF)
  573. \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
  574. CJKExtC}) (4160: U+2A700..2B73F)
  575. \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
  576. CJKExtD}) (224: U+2B740..2B81F)
  577. \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
  578. CJKExtE}) (5776: U+2B820..2CEAF)
  579. \p{Block: CJK_Unified_Ideographs_Extension_F} (Short: \p{Blk=
  580. CJKExtF}) (7488: U+2CEB0..2EBEF)
  581. \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
  582. Diacriticals}) (112: U+0300..036F)
  583. \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
  584. DiacriticalsExt}) (80: U+1AB0..1AFF)
  585. \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
  586. DiacriticalsForSymbols}) (48:
  587. U+20D0..20FF)
  588. \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
  589. DiacriticalsSup}) (64: U+1DC0..1DFF)
  590. \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16:
  591. U+FE20..FE2F)
  592. \p{Block: Combining_Marks_For_Symbols} \p{Block=
  593. Combining_Diacritical_Marks_For_Symbols}
  594. (48)
  595. \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
  596. IndicNumberForms}) (16: U+A830..A83F)
  597. \p{Block: Compat_Jamo} \p{Block=Hangul_Compatibility_Jamo} (96)
  598. \p{Block: Control_Pictures} (64: U+2400..243F)
  599. \p{Block: Coptic} (NOT \p{Coptic} NOR \p{Is_Coptic}) (128:
  600. U+2C80..2CFF)
  601. \p{Block: Coptic_Epact_Numbers} (32: U+102E0..102FF)
  602. \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
  603. \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32:
  604. U+1D360..1D37F)
  605. \p{Block: Cuneiform} (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
  606. (1024: U+12000..123FF)
  607. \p{Block: Cuneiform_Numbers} \p{Block=
  608. Cuneiform_Numbers_And_Punctuation} (128)
  609. \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
  610. CuneiformNumbers}) (128: U+12400..1247F)
  611. \p{Block: Currency_Symbols} (48: U+20A0..20CF)
  612. \p{Block: Cypriot_Syllabary} (64: U+10800..1083F)
  613. \p{Block: Cyrillic} (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
  614. (256: U+0400..04FF)
  615. \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
  616. \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
  617. \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16)
  618. \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32:
  619. U+2DE0..2DFF)
  620. \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96:
  621. U+A640..A69F)
  622. \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16:
  623. U+1C80..1C8F)
  624. \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
  625. \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48:
  626. U+0500..052F)
  627. \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
  628. (48)
  629. \p{Block: Deseret} (80: U+10400..1044F)
  630. \p{Block: Devanagari} (NOT \p{Devanagari} NOR \p{Is_Devanagari})
  631. (128: U+0900..097F)
  632. \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
  633. \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32:
  634. U+A8E0..A8FF)
  635. \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
  636. \p{Block: Diacriticals_Ext} \p{Block=
  637. Combining_Diacritical_Marks_Extended}
  638. (80)
  639. \p{Block: Diacriticals_For_Symbols} \p{Block=
  640. Combining_Diacritical_Marks_For_Symbols}
  641. (48)
  642. \p{Block: Diacriticals_Sup} \p{Block=
  643. Combining_Diacritical_Marks_Supplement}
  644. (64)
  645. \p{Block: Dingbats} (192: U+2700..27BF)
  646. \p{Block: Dogra} (NOT \p{Dogra} NOR \p{Is_Dogra}) (80:
  647. U+11800..1184F)
  648. \p{Block: Domino} \p{Block=Domino_Tiles} (112)
  649. \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112:
  650. U+1F030..1F09F)
  651. \p{Block: Duployan} (NOT \p{Duployan} NOR \p{Is_Duployan})
  652. (160: U+1BC00..1BC9F)
  653. \p{Block: Early_Dynastic_Cuneiform} (208: U+12480..1254F)
  654. \p{Block: Egyptian_Hieroglyph_Format_Controls} (16: U+13430..1343F)
  655. \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
  656. \p{Is_Egyptian_Hieroglyphs}) (1072:
  657. U+13000..1342F)
  658. \p{Block: Elbasan} (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48:
  659. U+10500..1052F)
  660. \p{Block: Elymaic} (NOT \p{Elymaic} NOR \p{Is_Elymaic}) (32:
  661. U+10FE0..10FFF)
  662. \p{Block: Emoticons} (80: U+1F600..1F64F)
  663. \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
  664. \p{Block: Enclosed_Alphanum_Sup} \p{Block=
  665. Enclosed_Alphanumeric_Supplement} (256)
  666. \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
  667. EnclosedAlphanumSup}) (256:
  668. U+1F100..1F1FF)
  669. \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
  670. EnclosedAlphanum}) (160: U+2460..24FF)
  671. \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
  672. (256)
  673. \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
  674. EnclosedCJK}) (256: U+3200..32FF)
  675. \p{Block: Enclosed_Ideographic_Sup} \p{Block=
  676. Enclosed_Ideographic_Supplement} (256)
  677. \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
  678. EnclosedIdeographicSup}) (256:
  679. U+1F200..1F2FF)
  680. \p{Block: Ethiopic} (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
  681. (384: U+1200..137F)
  682. \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
  683. \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
  684. \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96:
  685. U+2D80..2DDF)
  686. \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48:
  687. U+AB00..AB2F)
  688. \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
  689. \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32:
  690. U+1380..139F)
  691. \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
  692. \p{Punct} NOR \p{Is_Punctuation}) (112:
  693. U+2000..206F)
  694. \p{Block: Geometric_Shapes} (96: U+25A0..25FF)
  695. \p{Block: Geometric_Shapes_Ext} \p{Block=
  696. Geometric_Shapes_Extended} (128)
  697. \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
  698. GeometricShapesExt}) (128:
  699. U+1F780..1F7FF)
  700. \p{Block: Georgian} (NOT \p{Georgian} NOR \p{Is_Georgian})
  701. (96: U+10A0..10FF)
  702. \p{Block: Georgian_Ext} \p{Block=Georgian_Extended} (48)
  703. \p{Block: Georgian_Extended} (Short: \p{Blk=GeorgianExt}) (48:
  704. U+1C90..1CBF)
  705. \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
  706. \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48:
  707. U+2D00..2D2F)
  708. \p{Block: Glagolitic} (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
  709. (96: U+2C00..2C5F)
  710. \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48)
  711. \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup})
  712. (48: U+1E000..1E02F)
  713. \p{Block: Gothic} (NOT \p{Gothic} NOR \p{Is_Gothic}) (32:
  714. U+10330..1034F)
  715. \p{Block: Grantha} (NOT \p{Grantha} NOR \p{Is_Grantha}) (128:
  716. U+11300..1137F)
  717. \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek}
  718. NOR \p{Is_Greek}) (144)
  719. \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
  720. NOR \p{Is_Greek}) (144: U+0370..03FF)
  721. \p{Block: Greek_Ext} \p{Block=Greek_Extended} (256)
  722. \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256:
  723. U+1F00..1FFF)
  724. \p{Block: Gujarati} (NOT \p{Gujarati} NOR \p{Is_Gujarati})
  725. (128: U+0A80..0AFF)
  726. \p{Block: Gunjala_Gondi} (NOT \p{Gunjala_Gondi} NOR
  727. \p{Is_Gunjala_Gondi}) (80:
  728. U+11D60..11DAF)
  729. \p{Block: Gurmukhi} (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
  730. (128: U+0A00..0A7F)
  731. \p{Block: Half_And_Full_Forms} \p{Block=
  732. Halfwidth_And_Fullwidth_Forms} (240)
  733. \p{Block: Half_Marks} \p{Block=Combining_Half_Marks} (16)
  734. \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
  735. HalfAndFullForms}) (240: U+FF00..FFEF)
  736. \p{Block: Hangul} \p{Block=Hangul_Syllables} (NOT \p{Hangul}
  737. NOR \p{Is_Hangul}) (11_184)
  738. \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
  739. (96: U+3130..318F)
  740. \p{Block: Hangul_Jamo} (Short: \p{Blk=Jamo}) (256: U+1100..11FF)
  741. \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32:
  742. U+A960..A97F)
  743. \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80:
  744. U+D7B0..D7FF)
  745. \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
  746. NOR \p{Is_Hangul}) (11_184: U+AC00..D7AF)
  747. \p{Block: Hanifi_Rohingya} (NOT \p{Hanifi_Rohingya} NOR
  748. \p{Is_Hanifi_Rohingya}) (64:
  749. U+10D00..10D3F)
  750. \p{Block: Hanunoo} (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32:
  751. U+1720..173F)
  752. \p{Block: Hatran} (NOT \p{Hatran} NOR \p{Is_Hatran}) (32:
  753. U+108E0..108FF)
  754. \p{Block: Hebrew} (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112:
  755. U+0590..05FF)
  756. \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
  757. HighPUSurrogates}) (128: U+DB80..DBFF)
  758. \p{Block: High_PU_Surrogates} \p{Block=
  759. High_Private_Use_Surrogates} (128)
  760. \p{Block: High_Surrogates} (896: U+D800..DB7F)
  761. \p{Block: Hiragana} (NOT \p{Hiragana} NOR \p{Is_Hiragana})
  762. (96: U+3040..309F)
  763. \p{Block: IDC} \p{Block=
  764. Ideographic_Description_Characters} (NOT
  765. \p{ID_Continue} NOR \p{Is_IDC}) (16)
  766. \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
  767. NOT \p{ID_Continue} NOR \p{Is_IDC}) (16:
  768. U+2FF0..2FFF)
  769. \p{Block: Ideographic_Symbols} \p{Block=
  770. Ideographic_Symbols_And_Punctuation} (32)
  771. \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk=
  772. IdeographicSymbols}) (32: U+16FE0..16FFF)
  773. \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
  774. \p{Is_Imperial_Aramaic}) (32:
  775. U+10840..1085F)
  776. \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
  777. (16)
  778. \p{Block: Indic_Siyaq_Numbers} (80: U+1EC70..1ECBF)
  779. \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
  780. NOR \p{Is_Inscriptional_Pahlavi}) (32:
  781. U+10B60..10B7F)
  782. \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
  783. NOR \p{Is_Inscriptional_Parthian}) (32:
  784. U+10B40..10B5F)
  785. \p{Block: IPA_Ext} \p{Block=IPA_Extensions} (96)
  786. \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96:
  787. U+0250..02AF)
  788. \p{Block: Jamo} \p{Block=Hangul_Jamo} (256)
  789. \p{Block: Jamo_Ext_A} \p{Block=Hangul_Jamo_Extended_A} (32)
  790. \p{Block: Jamo_Ext_B} \p{Block=Hangul_Jamo_Extended_B} (80)
  791. \p{Block: Javanese} (NOT \p{Javanese} NOR \p{Is_Javanese})
  792. (96: U+A980..A9DF)
  793. \p{Block: Kaithi} (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80:
  794. U+11080..110CF)
  795. \p{Block: Kana_Ext_A} \p{Block=Kana_Extended_A} (48)
  796. \p{Block: Kana_Extended_A} (Short: \p{Blk=KanaExtA}) (48:
  797. U+1B100..1B12F)
  798. \p{Block: Kana_Sup} \p{Block=Kana_Supplement} (256)
  799. \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256:
  800. U+1B000..1B0FF)
  801. \p{Block: Kanbun} (16: U+3190..319F)
  802. \p{Block: Kangxi} \p{Block=Kangxi_Radicals} (224)
  803. \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224:
  804. U+2F00..2FDF)
  805. \p{Block: Kannada} (NOT \p{Kannada} NOR \p{Is_Kannada}) (128:
  806. U+0C80..0CFF)
  807. \p{Block: Katakana} (NOT \p{Katakana} NOR \p{Is_Katakana})
  808. (96: U+30A0..30FF)
  809. \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
  810. \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
  811. KatakanaExt}) (16: U+31F0..31FF)
  812. \p{Block: Kayah_Li} (48: U+A900..A92F)
  813. \p{Block: Kharoshthi} (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
  814. (96: U+10A00..10A5F)
  815. \p{Block: Khmer} (NOT \p{Khmer} NOR \p{Is_Khmer}) (128:
  816. U+1780..17FF)
  817. \p{Block: Khmer_Symbols} (32: U+19E0..19FF)
  818. \p{Block: Khojki} (NOT \p{Khojki} NOR \p{Is_Khojki}) (80:
  819. U+11200..1124F)
  820. \p{Block: Khudawadi} (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
  821. (80: U+112B0..112FF)
  822. \p{Block: Lao} (NOT \p{Lao} NOR \p{Is_Lao}) (128:
  823. U+0E80..0EFF)
  824. \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128)
  825. \p{Block: Latin_1_Sup} \p{Block=Latin_1_Supplement} (128)
  826. \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128: [\x80-
  827. \xff])
  828. \p{Block: Latin_Ext_A} \p{Block=Latin_Extended_A} (128)
  829. \p{Block: Latin_Ext_Additional} \p{Block=
  830. Latin_Extended_Additional} (256)
  831. \p{Block: Latin_Ext_B} \p{Block=Latin_Extended_B} (208)
  832. \p{Block: Latin_Ext_C} \p{Block=Latin_Extended_C} (32)
  833. \p{Block: Latin_Ext_D} \p{Block=Latin_Extended_D} (224)
  834. \p{Block: Latin_Ext_E} \p{Block=Latin_Extended_E} (64)
  835. \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128:
  836. U+0100..017F)
  837. \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
  838. LatinExtAdditional}) (256: U+1E00..1EFF)
  839. \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208:
  840. U+0180..024F)
  841. \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32:
  842. U+2C60..2C7F)
  843. \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224:
  844. U+A720..A7FF)
  845. \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64:
  846. U+AB30..AB6F)
  847. \p{Block: Lepcha} (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80:
  848. U+1C00..1C4F)
  849. \p{Block: Letterlike_Symbols} (80: U+2100..214F)
  850. \p{Block: Limbu} (NOT \p{Limbu} NOR \p{Is_Limbu}) (80:
  851. U+1900..194F)
  852. \p{Block: Linear_A} (NOT \p{Linear_A} NOR \p{Is_Linear_A})
  853. (384: U+10600..1077F)
  854. \p{Block: Linear_B_Ideograms} (128: U+10080..100FF)
  855. \p{Block: Linear_B_Syllabary} (128: U+10000..1007F)
  856. \p{Block: Lisu} (48: U+A4D0..A4FF)
  857. \p{Block: Low_Surrogates} (1024: U+DC00..DFFF)
  858. \p{Block: Lycian} (NOT \p{Lycian} NOR \p{Is_Lycian}) (32:
  859. U+10280..1029F)
  860. \p{Block: Lydian} (NOT \p{Lydian} NOR \p{Is_Lydian}) (32:
  861. U+10920..1093F)
  862. \p{Block: Mahajani} (NOT \p{Mahajani} NOR \p{Is_Mahajani})
  863. (48: U+11150..1117F)
  864. \p{Block: Mahjong} \p{Block=Mahjong_Tiles} (48)
  865. \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48:
  866. U+1F000..1F02F)
  867. \p{Block: Makasar} (NOT \p{Makasar} NOR \p{Is_Makasar}) (32:
  868. U+11EE0..11EFF)
  869. \p{Block: Malayalam} (NOT \p{Malayalam} NOR \p{Is_Malayalam})
  870. (128: U+0D00..0D7F)
  871. \p{Block: Mandaic} (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32:
  872. U+0840..085F)
  873. \p{Block: Manichaean} (NOT \p{Manichaean} NOR \p{Is_Manichaean})
  874. (64: U+10AC0..10AFF)
  875. \p{Block: Marchen} (NOT \p{Marchen} NOR \p{Is_Marchen}) (80:
  876. U+11C70..11CBF)
  877. \p{Block: Masaram_Gondi} (NOT \p{Masaram_Gondi} NOR
  878. \p{Is_Masaram_Gondi}) (96:
  879. U+11D00..11D5F)
  880. \p{Block: Math_Alphanum} \p{Block=
  881. Mathematical_Alphanumeric_Symbols} (1024)
  882. \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
  883. \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
  884. MathAlphanum}) (1024: U+1D400..1D7FF)
  885. \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
  886. (256: U+2200..22FF)
  887. \p{Block: Mayan_Numerals} (32: U+1D2E0..1D2FF)
  888. \p{Block: Medefaidrin} (NOT \p{Medefaidrin} NOR
  889. \p{Is_Medefaidrin}) (96: U+16E40..16E9F)
  890. \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
  891. \p{Is_Meetei_Mayek}) (64: U+ABC0..ABFF)
  892. \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
  893. \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
  894. (32: U+AAE0..AAFF)
  895. \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
  896. \p{Is_Mende_Kikakui}) (224:
  897. U+1E800..1E8DF)
  898. \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
  899. \p{Is_Meroitic_Cursive}) (96:
  900. U+109A0..109FF)
  901. \p{Block: Meroitic_Hieroglyphs} (32: U+10980..1099F)
  902. \p{Block: Miao} (NOT \p{Miao} NOR \p{Is_Miao}) (160:
  903. U+16F00..16F9F)
  904. \p{Block: Misc_Arrows} \p{Block=Miscellaneous_Symbols_And_Arrows}
  905. (256)
  906. \p{Block: Misc_Math_Symbols_A} \p{Block=
  907. Miscellaneous_Mathematical_Symbols_A}
  908. (48)
  909. \p{Block: Misc_Math_Symbols_B} \p{Block=
  910. Miscellaneous_Mathematical_Symbols_B}
  911. (128)
  912. \p{Block: Misc_Pictographs} \p{Block=
  913. Miscellaneous_Symbols_And_Pictographs}
  914. (768)
  915. \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
  916. \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
  917. \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
  918. MiscMathSymbolsA}) (48: U+27C0..27EF)
  919. \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
  920. MiscMathSymbolsB}) (128: U+2980..29FF)
  921. \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols})
  922. (256: U+2600..26FF)
  923. \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
  924. MiscArrows}) (256: U+2B00..2BFF)
  925. \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
  926. MiscPictographs}) (768: U+1F300..1F5FF)
  927. \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
  928. (256: U+2300..23FF)
  929. \p{Block: Modi} (NOT \p{Modi} NOR \p{Is_Modi}) (96:
  930. U+11600..1165F)
  931. \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
  932. \p{Block: Modifier_Tone_Letters} (32: U+A700..A71F)
  933. \p{Block: Mongolian} (NOT \p{Mongolian} NOR \p{Is_Mongolian})
  934. (176: U+1800..18AF)
  935. \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32)
  936. \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32:
  937. U+11660..1167F)
  938. \p{Block: Mro} (NOT \p{Mro} NOR \p{Is_Mro}) (48:
  939. U+16A40..16A6F)
  940. \p{Block: Multani} (NOT \p{Multani} NOR \p{Is_Multani}) (48:
  941. U+11280..112AF)
  942. \p{Block: Music} \p{Block=Musical_Symbols} (256)
  943. \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256:
  944. U+1D100..1D1FF)
  945. \p{Block: Myanmar} (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160:
  946. U+1000..109F)
  947. \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
  948. \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
  949. \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32:
  950. U+AA60..AA7F)
  951. \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32:
  952. U+A9E0..A9FF)
  953. \p{Block: Nabataean} (NOT \p{Nabataean} NOR \p{Is_Nabataean})
  954. (48: U+10880..108AF)
  955. \p{Block: Nandinagari} (NOT \p{Nandinagari} NOR
  956. \p{Is_Nandinagari}) (96: U+119A0..119FF)
  957. \p{Block: NB} \p{Block=No_Block} (832_720 plus all
  958. above-Unicode code points)
  959. \p{Block: New_Tai_Lue} (NOT \p{New_Tai_Lue} NOR
  960. \p{Is_New_Tai_Lue}) (96: U+1980..19DF)
  961. \p{Block: Newa} (NOT \p{Newa} NOR \p{Is_Newa}) (128:
  962. U+11400..1147F)
  963. \p{Block: NKo} (NOT \p{Nko} NOR \p{Is_NKo}) (64:
  964. U+07C0..07FF)
  965. \p{Block: No_Block} (Short: \p{Blk=NB}) (832_720 plus all
  966. above-Unicode code points: U+0870..089F,
  967. U+2FE0..2FEF, U+10200..1027F,
  968. U+103E0..103FF, U+10570..105FF,
  969. U+10780..107FF ...)
  970. \p{Block: Number_Forms} (64: U+2150..218F)
  971. \p{Block: Nushu} (NOT \p{Nushu} NOR \p{Is_Nushu}) (400:
  972. U+1B170..1B2FF)
  973. \p{Block: Nyiakeng_Puachue_Hmong} (NOT \p{Nyiakeng_Puachue_Hmong}
  974. NOR \p{Is_Nyiakeng_Puachue_Hmong}) (80:
  975. U+1E100..1E14F)
  976. \p{Block: OCR} \p{Block=Optical_Character_Recognition}
  977. (32)
  978. \p{Block: Ogham} (NOT \p{Ogham} NOR \p{Is_Ogham}) (32:
  979. U+1680..169F)
  980. \p{Block: Ol_Chiki} (48: U+1C50..1C7F)
  981. \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
  982. \p{Is_Old_Hungarian}) (128:
  983. U+10C80..10CFF)
  984. \p{Block: Old_Italic} (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
  985. (48: U+10300..1032F)
  986. \p{Block: Old_North_Arabian} (32: U+10A80..10A9F)
  987. \p{Block: Old_Permic} (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
  988. (48: U+10350..1037F)
  989. \p{Block: Old_Persian} (NOT \p{Old_Persian} NOR
  990. \p{Is_Old_Persian}) (64: U+103A0..103DF)
  991. \p{Block: Old_Sogdian} (NOT \p{Old_Sogdian} NOR
  992. \p{Is_Old_Sogdian}) (48: U+10F00..10F2F)
  993. \p{Block: Old_South_Arabian} (32: U+10A60..10A7F)
  994. \p{Block: Old_Turkic} (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
  995. (80: U+10C00..10C4F)
  996. \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32:
  997. U+2440..245F)
  998. \p{Block: Oriya} (NOT \p{Oriya} NOR \p{Is_Oriya}) (128:
  999. U+0B00..0B7F)
  1000. \p{Block: Ornamental_Dingbats} (48: U+1F650..1F67F)
  1001. \p{Block: Osage} (NOT \p{Osage} NOR \p{Is_Osage}) (80:
  1002. U+104B0..104FF)
  1003. \p{Block: Osmanya} (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48:
  1004. U+10480..104AF)
  1005. \p{Block: Ottoman_Siyaq_Numbers} (80: U+1ED00..1ED4F)
  1006. \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
  1007. \p{Is_Pahawh_Hmong}) (144:
  1008. U+16B00..16B8F)
  1009. \p{Block: Palmyrene} (32: U+10860..1087F)
  1010. \p{Block: Pau_Cin_Hau} (NOT \p{Pau_Cin_Hau} NOR
  1011. \p{Is_Pau_Cin_Hau}) (64: U+11AC0..11AFF)
  1012. \p{Block: Phags_Pa} (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa})
  1013. (64: U+A840..A87F)
  1014. \p{Block: Phaistos} \p{Block=Phaistos_Disc} (48)
  1015. \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48:
  1016. U+101D0..101FF)
  1017. \p{Block: Phoenician} (NOT \p{Phoenician} NOR \p{Is_Phoenician})
  1018. (32: U+10900..1091F)
  1019. \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
  1020. \p{Block: Phonetic_Ext_Sup} \p{Block=
  1021. Phonetic_Extensions_Supplement} (64)
  1022. \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128:
  1023. U+1D00..1D7F)
  1024. \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
  1025. PhoneticExtSup}) (64: U+1D80..1DBF)
  1026. \p{Block: Playing_Cards} (96: U+1F0A0..1F0FF)
  1027. \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT
  1028. \p{Private_Use} NOR \p{Is_Private_Use})
  1029. (6400)
  1030. \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
  1031. \p{Private_Use} NOR \p{Is_Private_Use})
  1032. (6400: U+E000..F8FF)
  1033. \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
  1034. \p{Is_Psalter_Pahlavi}) (48:
  1035. U+10B80..10BAF)
  1036. \p{Block: PUA} \p{Block=Private_Use_Area} (NOT
  1037. \p{Private_Use} NOR \p{Is_Private_Use})
  1038. (6400)
  1039. \p{Block: Punctuation} \p{Block=General_Punctuation} (NOT
  1040. \p{Punct} NOR \p{Is_Punctuation}) (112)
  1041. \p{Block: Rejang} (NOT \p{Rejang} NOR \p{Is_Rejang}) (48:
  1042. U+A930..A95F)
  1043. \p{Block: Rumi} \p{Block=Rumi_Numeral_Symbols} (32)
  1044. \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32:
  1045. U+10E60..10E7F)
  1046. \p{Block: Runic} (NOT \p{Runic} NOR \p{Is_Runic}) (96:
  1047. U+16A0..16FF)
  1048. \p{Block: Samaritan} (NOT \p{Samaritan} NOR \p{Is_Samaritan})
  1049. (64: U+0800..083F)
  1050. \p{Block: Saurashtra} (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
  1051. (96: U+A880..A8DF)
  1052. \p{Block: Sharada} (NOT \p{Sharada} NOR \p{Is_Sharada}) (96:
  1053. U+11180..111DF)
  1054. \p{Block: Shavian} (48: U+10450..1047F)
  1055. \p{Block: Shorthand_Format_Controls} (16: U+1BCA0..1BCAF)
  1056. \p{Block: Siddham} (NOT \p{Siddham} NOR \p{Is_Siddham}) (128:
  1057. U+11580..115FF)
  1058. \p{Block: Sinhala} (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128:
  1059. U+0D80..0DFF)
  1060. \p{Block: Sinhala_Archaic_Numbers} (32: U+111E0..111FF)
  1061. \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32:
  1062. U+FE50..FE6F)
  1063. \p{Block: Small_Forms} \p{Block=Small_Form_Variants} (32)
  1064. \p{Block: Small_Kana_Ext} \p{Block=Small_Kana_Extension} (64)
  1065. \p{Block: Small_Kana_Extension} (Short: \p{Blk=SmallKanaExt}) (64:
  1066. U+1B130..1B16F)
  1067. \p{Block: Sogdian} (NOT \p{Sogdian} NOR \p{Is_Sogdian}) (64:
  1068. U+10F30..10F6F)
  1069. \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
  1070. \p{Is_Sora_Sompeng}) (48: U+110D0..110FF)
  1071. \p{Block: Soyombo} (NOT \p{Soyombo} NOR \p{Is_Soyombo}) (96:
  1072. U+11A50..11AAF)
  1073. \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
  1074. ModifierLetters}) (80: U+02B0..02FF)
  1075. \p{Block: Specials} (16: U+FFF0..FFFF)
  1076. \p{Block: Sundanese} (NOT \p{Sundanese} NOR \p{Is_Sundanese})
  1077. (64: U+1B80..1BBF)
  1078. \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
  1079. \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16:
  1080. U+1CC0..1CCF)
  1081. \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
  1082. \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
  1083. \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
  1084. \p{Block: Sup_Math_Operators} \p{Block=
  1085. Supplemental_Mathematical_Operators}
  1086. (256)
  1087. \p{Block: Sup_PUA_A} \p{Block=Supplementary_Private_Use_Area_A}
  1088. (65_536)
  1089. \p{Block: Sup_PUA_B} \p{Block=Supplementary_Private_Use_Area_B}
  1090. (65_536)
  1091. \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
  1092. \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
  1093. Supplemental_Symbols_And_Pictographs}
  1094. (256)
  1095. \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
  1096. \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
  1097. SuperAndSub}) (48: U+2070..209F)
  1098. \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16:
  1099. U+27F0..27FF)
  1100. \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128:
  1101. U+2900..297F)
  1102. \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256:
  1103. U+1F800..1F8FF)
  1104. \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
  1105. SupMathOperators}) (256: U+2A00..2AFF)
  1106. \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
  1107. SupPunctuation}) (128: U+2E00..2E7F)
  1108. \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
  1109. SupSymbolsAndPictographs}) (256:
  1110. U+1F900..1F9FF)
  1111. \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
  1112. SupPUAA}) (65_536: U+F0000..FFFFF)
  1113. \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
  1114. SupPUAB}) (65_536: U+100000..10FFFF)
  1115. \p{Block: Sutton_SignWriting} (688: U+1D800..1DAAF)
  1116. \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
  1117. \p{Is_Syloti_Nagri}) (48: U+A800..A82F)
  1118. \p{Block: Symbols_And_Pictographs_Ext_A} \p{Block=
  1119. Symbols_And_Pictographs_Extended_A} (144)
  1120. \p{Block: Symbols_And_Pictographs_Extended_A} (Short: \p{Blk=
  1121. SymbolsAndPictographsExtA}) (144:
  1122. U+1FA70..1FAFF)
  1123. \p{Block: Syriac} (NOT \p{Syriac} NOR \p{Is_Syriac}) (80:
  1124. U+0700..074F)
  1125. \p{Block: Syriac_Sup} \p{Block=Syriac_Supplement} (16)
  1126. \p{Block: Syriac_Supplement} (Short: \p{Blk=SyriacSup}) (16:
  1127. U+0860..086F)
  1128. \p{Block: Tagalog} (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32:
  1129. U+1700..171F)
  1130. \p{Block: Tagbanwa} (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa})
  1131. (32: U+1760..177F)
  1132. \p{Block: Tags} (128: U+E0000..E007F)
  1133. \p{Block: Tai_Le} (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48:
  1134. U+1950..197F)
  1135. \p{Block: Tai_Tham} (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
  1136. (144: U+1A20..1AAF)
  1137. \p{Block: Tai_Viet} (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet})
  1138. (96: U+AA80..AADF)
  1139. \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
  1140. \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96:
  1141. U+1D300..1D35F)
  1142. \p{Block: Takri} (NOT \p{Takri} NOR \p{Is_Takri}) (80:
  1143. U+11680..116CF)
  1144. \p{Block: Tamil} (NOT \p{Tamil} NOR \p{Is_Tamil}) (128:
  1145. U+0B80..0BFF)
  1146. \p{Block: Tamil_Sup} \p{Block=Tamil_Supplement} (64)
  1147. \p{Block: Tamil_Supplement} (Short: \p{Blk=TamilSup}) (64:
  1148. U+11FC0..11FFF)
  1149. \p{Block: Tangut} (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144:
  1150. U+17000..187FF)
  1151. \p{Block: Tangut_Components} (768: U+18800..18AFF)
  1152. \p{Block: Telugu} (NOT \p{Telugu} NOR \p{Is_Telugu}) (128:
  1153. U+0C00..0C7F)
  1154. \p{Block: Thaana} (NOT \p{Thaana} NOR \p{Is_Thaana}) (64:
  1155. U+0780..07BF)
  1156. \p{Block: Thai} (NOT \p{Thai} NOR \p{Is_Thai}) (128:
  1157. U+0E00..0E7F)
  1158. \p{Block: Tibetan} (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256:
  1159. U+0F00..0FFF)
  1160. \p{Block: Tifinagh} (NOT \p{Tifinagh} NOR \p{Is_Tifinagh})
  1161. (80: U+2D30..2D7F)
  1162. \p{Block: Tirhuta} (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96:
  1163. U+11480..114DF)
  1164. \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
  1165. (128)
  1166. \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
  1167. TransportAndMap}) (128: U+1F680..1F6FF)
  1168. \p{Block: UCAS} \p{Block=
  1169. Unified_Canadian_Aboriginal_Syllabics}
  1170. (640)
  1171. \p{Block: UCAS_Ext} \p{Block=
  1172. Unified_Canadian_Aboriginal_Syllabics_-
  1173. Extended} (80)
  1174. \p{Block: Ugaritic} (NOT \p{Ugaritic} NOR \p{Is_Ugaritic})
  1175. (32: U+10380..1039F)
  1176. \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
  1177. UCAS}) (640: U+1400..167F)
  1178. \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
  1179. \p{Blk=UCASExt}) (80: U+18B0..18FF)
  1180. \p{Block: Vai} (NOT \p{Vai} NOR \p{Is_Vai}) (320:
  1181. U+A500..A63F)
  1182. \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
  1183. \p{Variation_Selector} NOR \p{Is_VS})
  1184. (16: U+FE00..FE0F)
  1185. \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
  1186. (240: U+E0100..E01EF)
  1187. \p{Block: Vedic_Ext} \p{Block=Vedic_Extensions} (48)
  1188. \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48:
  1189. U+1CD0..1CFF)
  1190. \p{Block: Vertical_Forms} (16: U+FE10..FE1F)
  1191. \p{Block: VS} \p{Block=Variation_Selectors} (NOT
  1192. \p{Variation_Selector} NOR \p{Is_VS})
  1193. (16)
  1194. \p{Block: VS_Sup} \p{Block=Variation_Selectors_Supplement}
  1195. (240)
  1196. \p{Block: Wancho} (NOT \p{Wancho} NOR \p{Is_Wancho}) (64:
  1197. U+1E2C0..1E2FF)
  1198. \p{Block: Warang_Citi} (NOT \p{Warang_Citi} NOR
  1199. \p{Is_Warang_Citi}) (96: U+118A0..118FF)
  1200. \p{Block: Yi_Radicals} (64: U+A490..A4CF)
  1201. \p{Block: Yi_Syllables} (1168: U+A000..A48F)
  1202. \p{Block: Yijing} \p{Block=Yijing_Hexagram_Symbols} (64)
  1203. \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64:
  1204. U+4DC0..4DFF)
  1205. \p{Block: Zanabazar_Square} (NOT \p{Zanabazar_Square} NOR
  1206. \p{Is_Zanabazar_Square}) (80:
  1207. U+11A00..11A4F)
  1208. X \p{Block_Elements} \p{Block=Block_Elements} (32)
  1209. \p{Bopo} \p{Bopomofo} (= \p{Script_Extensions=
  1210. Bopomofo}) (NOT \p{Block=Bopomofo}) (112)
  1211. \p{Bopomofo} \p{Script_Extensions=Bopomofo} (Short:
  1212. \p{Bopo}; NOT \p{Block=Bopomofo}) (112)
  1213. X \p{Bopomofo_Ext} \p{Bopomofo_Extended} (= \p{Block=
  1214. Bopomofo_Extended}) (32)
  1215. X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (Short:
  1216. \p{InBopomofoExt}) (32)
  1217. X \p{Box_Drawing} \p{Block=Box_Drawing} (128)
  1218. \p{Bpt: *} \p{Bidi_Paired_Bracket_Type: *}
  1219. \p{Brah} \p{Brahmi} (= \p{Script_Extensions=
  1220. Brahmi}) (NOT \p{Block=Brahmi}) (109)
  1221. \p{Brahmi} \p{Script_Extensions=Brahmi} (Short:
  1222. \p{Brah}; NOT \p{Block=Brahmi}) (109)
  1223. \p{Brai} \p{Braille} (= \p{Script_Extensions=
  1224. Braille}) (256)
  1225. \p{Braille} \p{Script_Extensions=Braille} (Short:
  1226. \p{Brai}) (256)
  1227. X \p{Braille_Patterns} \p{Block=Braille_Patterns} (Short:
  1228. \p{InBraille}) (256)
  1229. \p{Bugi} \p{Buginese} (= \p{Script_Extensions=
  1230. Buginese}) (NOT \p{Block=Buginese}) (31)
  1231. \p{Buginese} \p{Script_Extensions=Buginese} (Short:
  1232. \p{Bugi}; NOT \p{Block=Buginese}) (31)
  1233. \p{Buhd} \p{Buhid} (= \p{Script_Extensions=Buhid})
  1234. (NOT \p{Block=Buhid}) (22)
  1235. \p{Buhid} \p{Script_Extensions=Buhid} (Short:
  1236. \p{Buhd}; NOT \p{Block=Buhid}) (22)
  1237. X \p{Byzantine_Music} \p{Byzantine_Musical_Symbols} (= \p{Block=
  1238. Byzantine_Musical_Symbols}) (256)
  1239. X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
  1240. (Short: \p{InByzantineMusic}) (256)
  1241. \p{C} \pC \p{Other} (= \p{General_Category=Other})
  1242. (976_344 plus all above-Unicode code
  1243. points)
  1244. \p{Cakm} \p{Chakma} (= \p{Script_Extensions=
  1245. Chakma}) (NOT \p{Block=Chakma}) (90)
  1246. \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal}
  1247. (Short: \p{Cans}) (710)
  1248. X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics}
  1249. (= \p{Block=
  1250. Unified_Canadian_Aboriginal_Syllabics})
  1251. (640)
  1252. T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
  1253. Not_Reordered} (1_113_250 plus all
  1254. above-Unicode code points)
  1255. T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
  1256. Overlay} (32)
  1257. T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
  1258. Nukta} (25)
  1259. T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
  1260. Kana_Voicing} (2)
  1261. T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
  1262. Virama} (58)
  1263. T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
  1264. CCC10} (1)
  1265. \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1:
  1266. U+05B0)
  1267. T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
  1268. CCC11} (1)
  1269. \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1:
  1270. U+05B1)
  1271. T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
  1272. CCC12} (1)
  1273. \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1:
  1274. U+05B2)
  1275. T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
  1276. CCC13} (1)
  1277. \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1:
  1278. U+05B3)
  1279. T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
  1280. CCC14} (1)
  1281. \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1:
  1282. U+05B4)
  1283. T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
  1284. CCC15} (1)
  1285. \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1:
  1286. U+05B5)
  1287. T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
  1288. CCC16} (1)
  1289. \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1:
  1290. U+05B6)
  1291. T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
  1292. CCC17} (1)
  1293. \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1:
  1294. U+05B7)
  1295. T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
  1296. CCC18} (2)
  1297. \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2:
  1298. U+05B8, U+05C7)
  1299. T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
  1300. CCC19} (2)
  1301. \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2:
  1302. U+05B9..05BA)
  1303. T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
  1304. CCC20} (1)
  1305. \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1:
  1306. U+05BB)
  1307. T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
  1308. CCC21} (1)
  1309. \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1:
  1310. U+05BC)
  1311. T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
  1312. CCC22} (1)
  1313. \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1:
  1314. U+05BD)
  1315. T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
  1316. CCC23} (1)
  1317. \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1:
  1318. U+05BF)
  1319. T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
  1320. CCC24} (1)
  1321. \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1:
  1322. U+05C1)
  1323. T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
  1324. CCC25} (1)
  1325. \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1:
  1326. U+05C2)
  1327. T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
  1328. CCC26} (1)
  1329. \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1:
  1330. U+FB1E)
  1331. T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
  1332. CCC27} (2)
  1333. \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2:
  1334. U+064B, U+08F0)
  1335. T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
  1336. CCC28} (2)
  1337. \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2:
  1338. U+064C, U+08F1)
  1339. T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
  1340. CCC29} (2)
  1341. \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2:
  1342. U+064D, U+08F2)
  1343. T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
  1344. CCC30} (2)
  1345. \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2:
  1346. U+0618, U+064E)
  1347. T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
  1348. CCC31} (2)
  1349. \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2:
  1350. U+0619, U+064F)
  1351. T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
  1352. CCC32} (2)
  1353. \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2:
  1354. U+061A, U+0650)
  1355. T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
  1356. CCC33} (1)
  1357. \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1:
  1358. U+0651)
  1359. T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
  1360. CCC34} (1)
  1361. \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1:
  1362. U+0652)
  1363. T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
  1364. CCC35} (1)
  1365. \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1:
  1366. U+0670)
  1367. T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
  1368. CCC36} (1)
  1369. \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1:
  1370. U+0711)
  1371. T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
  1372. CCC84} (1)
  1373. \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1:
  1374. U+0C55)
  1375. T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
  1376. CCC91} (1)
  1377. \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1:
  1378. U+0C56)
  1379. T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
  1380. CCC103} (2)
  1381. \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2:
  1382. U+0E38..0E39)
  1383. T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
  1384. CCC107} (4)
  1385. \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4:
  1386. U+0E48..0E4B)
  1387. T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
  1388. CCC118} (2)
  1389. \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2:
  1390. U+0EB8..0EB9)
  1391. T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
  1392. CCC122} (4)
  1393. \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4:
  1394. U+0EC8..0ECB)
  1395. T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
  1396. CCC129} (1)
  1397. \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1:
  1398. U+0F71)
  1399. T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
  1400. CCC130} (6)
  1401. \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6:
  1402. U+0F72, U+0F7A..0F7D, U+0F80)
  1403. T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
  1404. CCC132} (1)
  1405. \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1:
  1406. U+0F74)
  1407. T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
  1408. CCC133} (0)
  1409. \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
  1410. T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
  1411. Attached_Below_Left} (0)
  1412. T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
  1413. Attached_Below} (5)
  1414. T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
  1415. Attached_Above} (1)
  1416. T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
  1417. Attached_Above_Right} (9)
  1418. T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
  1419. Below_Left} (1)
  1420. T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
  1421. Below} (163)
  1422. T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
  1423. Below_Right} (4)
  1424. T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
  1425. Left} (2)
  1426. T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
  1427. Right} (1)
  1428. T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
  1429. Above_Left} (5)
  1430. T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
  1431. Above} (482)
  1432. T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
  1433. Above_Right} (5)
  1434. T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
  1435. Double_Below} (4)
  1436. T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
  1437. Double_Above} (5)
  1438. T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
  1439. Iota_Subscript} (1)
  1440. \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
  1441. Above} (482)
  1442. \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (482:
  1443. U+0300..0314, U+033D..0344, U+0346,
  1444. U+034A..034C, U+0350..0352, U+0357 ...)
  1445. \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (5:
  1446. U+05AE, U+18A9, U+1DF7..1DF8, U+302B)
  1447. \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (5:
  1448. U+0315, U+031A, U+0358, U+1DF6, U+302C)
  1449. \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
  1450. Above_Left} (5)
  1451. \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
  1452. Above_Right} (5)
  1453. \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
  1454. Attached_Above} (1)
  1455. \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
  1456. Attached_Above_Right} (9)
  1457. \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
  1458. Attached_Below} (5)
  1459. \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
  1460. Attached_Below_Left} (0)
  1461. \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
  1462. (1: U+1DCE)
  1463. \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
  1464. \p{Ccc=ATAR}) (9: U+031B, U+0F39,
  1465. U+1D165..1D166, U+1D16E..1D172)
  1466. \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
  1467. (5: U+0321..0322, U+0327..0328, U+1DD0)
  1468. \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
  1469. ATBL}) (0)
  1470. \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
  1471. Below} (163)
  1472. \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (163:
  1473. U+0316..0319, U+031C..0320,
  1474. U+0323..0326, U+0329..0333,
  1475. U+0339..033C, U+0347..0349 ...)
  1476. \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1:
  1477. U+302A)
  1478. \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4:
  1479. U+059A, U+05AD, U+1939, U+302D)
  1480. \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
  1481. Below_Left} (1)
  1482. \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
  1483. Below_Right} (4)
  1484. \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
  1485. Double_Above} (5)
  1486. \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
  1487. Double_Below} (4)
  1488. \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA})
  1489. (5: U+035D..035E, U+0360..0361, U+1DCD)
  1490. \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB})
  1491. (4: U+035C, U+035F, U+0362, U+1DFC)
  1492. \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
  1493. (1: U+0345)
  1494. \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
  1495. Iota_Subscript} (1)
  1496. \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV})
  1497. (2: U+3099..309A)
  1498. \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
  1499. Kana_Voicing} (2)
  1500. \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
  1501. Left} (2)
  1502. \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2:
  1503. U+302E..302F)
  1504. \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
  1505. Nukta} (25)
  1506. \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
  1507. (1_113_250 plus all above-Unicode code
  1508. points: U+0000..02FF, U+034F,
  1509. U+0370..0482, U+0488..0590, U+05BE,
  1510. U+05C0 ...)
  1511. \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
  1512. Not_Reordered} (1_113_250 plus all
  1513. above-Unicode code points)
  1514. \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (25:
  1515. U+093C, U+09BC, U+0A3C, U+0ABC, U+0B3C,
  1516. U+0CBC ...)
  1517. \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
  1518. Overlay} (32)
  1519. \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32:
  1520. U+0334..0338, U+1CD4, U+1CE2..1CE8,
  1521. U+20D2..20D3, U+20D8..20DA, U+20E5..20E6
  1522. ...)
  1523. \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
  1524. Right} (1)
  1525. \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1:
  1526. U+1D16D)
  1527. \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (58:
  1528. U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
  1529. U+0BCD ...)
  1530. \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
  1531. Virama} (58)
  1532. \p{Cans} \p{Canadian_Aboriginal} (=
  1533. \p{Script_Extensions=
  1534. Canadian_Aboriginal}) (710)
  1535. \p{Cari} \p{Carian} (= \p{Script_Extensions=
  1536. Carian}) (NOT \p{Block=Carian}) (49)
  1537. \p{Carian} \p{Script_Extensions=Carian} (Short:
  1538. \p{Cari}; NOT \p{Block=Carian}) (49)
  1539. \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (2396)
  1540. \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_111_716 plus
  1541. all above-Unicode code points: [\x00-
  1542. \x20!\"#\$\%&\(\)*+,\-\/0-9;<=>?\@A-
  1543. Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
  1544. \xac\xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
  1545. U+0100..02AF, U+0370..0373,
  1546. U+0376..0379, U+037B..0383, U+0386 ...)
  1547. \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (2396:
  1548. [\'.:\^`\xa8\xad\xaf\xb4\xb7-\xb8],
  1549. U+02B0..036F, U+0374..0375, U+037A,
  1550. U+0384..0385, U+0387 ...)
  1551. \p{Cased} \p{Cased=Y} (4279)
  1552. \p{Cased: N*} (Single: \P{Cased}) (1_109_833 plus all
  1553. above-Unicode code points: [\x00-
  1554. \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
  1555. >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
  1556. \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7], U+01BB,
  1557. U+01C0..01C3, U+0294, U+02B9..02BF,
  1558. U+02C2..02DF ...)
  1559. \p{Cased: Y*} (Single: \p{Cased}) (4279: [A-Za-
  1560. z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
  1561. \xff], U+0100..01BA, U+01BC..01BF,
  1562. U+01C4..0293, U+0295..02B8, U+02C0..02C1
  1563. ...)
  1564. \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short:
  1565. \p{LC}) (3970)
  1566. \p{Category: *} \p{General_Category: *}
  1567. \p{Caucasian_Albanian} \p{Script_Extensions=Caucasian_Albanian}
  1568. (Short: \p{Aghb}; NOT \p{Block=
  1569. Caucasian_Albanian}) (53)
  1570. \p{Cc} \p{XPosixCntrl} (= \p{General_Category=
  1571. Control}) (65)
  1572. \p{Ccc: *} \p{Canonical_Combining_Class: *}
  1573. \p{CE} \p{Composition_Exclusion} (=
  1574. \p{Composition_Exclusion=Y}) (81)
  1575. \p{CE: *} \p{Composition_Exclusion: *}
  1576. \p{Cf} \p{Format} (= \p{General_Category=Format})
  1577. (161)
  1578. \p{Chakma} \p{Script_Extensions=Chakma} (Short:
  1579. \p{Cakm}; NOT \p{Block=Chakma}) (90)
  1580. \p{Cham} \p{Script_Extensions=Cham} (NOT \p{Block=
  1581. Cham}) (83)
  1582. \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
  1583. \p{CWCF}) (1463)
  1584. \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
  1585. (1_112_649 plus all above-Unicode code
  1586. points: [\x00-
  1587. \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
  1588. >?\@\[\\\]\^_`a-z\{\|\}~\x7f-\xb4\xb6-
  1589. \xbf\xd7\xe0-\xff], U+0101, U+0103,
  1590. U+0105, U+0107, U+0109 ...)
  1591. \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
  1592. (1463: [A-Z\xb5\xc0-\xd6\xd8-\xdf],
  1593. U+0100, U+0102, U+0104, U+0106, U+0108
  1594. ...)
  1595. \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
  1596. \p{CWCM}) (2841)
  1597. \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
  1598. (1_111_271 plus all above-Unicode code
  1599. points: [\x00-
  1600. \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
  1601. >?\@\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
  1602. \xbf\xd7\xf7], U+0138, U+018D, U+019B,
  1603. U+01AA..01AB, U+01BA..01BB ...)
  1604. \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
  1605. (2841: [A-Za-z\xb5\xc0-\xd6\xd8-
  1606. \xf6\xf8-\xff], U+0100..0137,
  1607. U+0139..018C, U+018E..019A,
  1608. U+019C..01A9, U+01AC..01B9 ...)
  1609. \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
  1610. \p{CWL}) (1390)
  1611. \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
  1612. (1_112_722 plus all above-Unicode code
  1613. points: [\x00-
  1614. \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
  1615. >?\@\[\\\]\^_`a-z\{\|\}~\x7f-
  1616. \xbf\xd7\xdf-\xff], U+0101, U+0103,
  1617. U+0105, U+0107, U+0109 ...)
  1618. \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1390:
  1619. [A-Z\xc0-\xd6\xd8-\xde], U+0100, U+0102,
  1620. U+0104, U+0106, U+0108 ...)
  1621. \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
  1622. Y} (Short: \p{CWKCF}) (10_315)
  1623. \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
  1624. \P{CWKCF}) (1_103_797 plus all above-
  1625. Unicode code points: [\x00-
  1626. \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
  1627. >?\@\[\\\]\^_`a-z\{\|\}~\x7f-\x9f\xa1-
  1628. \xa7\xa9\xab-\xac\xae\xb0-\xb1\xb6-
  1629. \xb7\xbb\xbf\xd7\xe0-\xff], U+0101,
  1630. U+0103, U+0105, U+0107, U+0109 ...)
  1631. \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
  1632. \p{CWKCF}) (10_315: [A-
  1633. Z\xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb8-
  1634. \xba\xbc-\xbe\xc0-\xd6\xd8-\xdf],
  1635. U+0100, U+0102, U+0104, U+0106, U+0108
  1636. ...)
  1637. \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
  1638. \p{CWT}) (1409)
  1639. \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
  1640. (1_112_703 plus all above-Unicode code
  1641. points: [\x00-
  1642. \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
  1643. Z\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
  1644. \xde\xf7], U+0100, U+0102, U+0104,
  1645. U+0106, U+0108 ...)
  1646. \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1409:
  1647. [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
  1648. U+0103, U+0105, U+0107, U+0109 ...)
  1649. \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
  1650. \p{CWU}) (1482)
  1651. \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
  1652. (1_112_630 plus all above-Unicode code
  1653. points: [\x00-
  1654. \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
  1655. Z\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
  1656. \xde\xf7], U+0100, U+0102, U+0104,
  1657. U+0106, U+0108 ...)
  1658. \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1482:
  1659. [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
  1660. U+0103, U+0105, U+0107, U+0109 ...)
  1661. \p{Cher} \p{Cherokee} (= \p{Script_Extensions=
  1662. Cherokee}) (NOT \p{Block=Cherokee}) (172)
  1663. \p{Cherokee} \p{Script_Extensions=Cherokee} (Short:
  1664. \p{Cher}; NOT \p{Block=Cherokee}) (172)
  1665. X \p{Cherokee_Sup} \p{Cherokee_Supplement} (= \p{Block=
  1666. Cherokee_Supplement}) (80)
  1667. X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
  1668. \p{InCherokeeSup}) (80)
  1669. X \p{Chess_Symbols} \p{Block=Chess_Symbols} (112)
  1670. \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable=
  1671. Y}) (2396)
  1672. \p{CI: *} \p{Case_Ignorable: *}
  1673. X \p{CJK} \p{CJK_Unified_Ideographs} (= \p{Block=
  1674. CJK_Unified_Ideographs}) (20_992)
  1675. X \p{CJK_Compat} \p{CJK_Compatibility} (= \p{Block=
  1676. CJK_Compatibility}) (256)
  1677. X \p{CJK_Compat_Forms} \p{CJK_Compatibility_Forms} (= \p{Block=
  1678. CJK_Compatibility_Forms}) (32)
  1679. X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
  1680. \p{Block=CJK_Compatibility_Ideographs})
  1681. (512)
  1682. X \p{CJK_Compat_Ideographs_Sup}
  1683. \p{CJK_Compatibility_Ideographs_-
  1684. Supplement} (= \p{Block=
  1685. CJK_Compatibility_Ideographs_-
  1686. Supplement}) (544)
  1687. X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (Short:
  1688. \p{InCJKCompat}) (256)
  1689. X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
  1690. (Short: \p{InCJKCompatForms}) (32)
  1691. X \p{CJK_Compatibility_Ideographs} \p{Block=
  1692. CJK_Compatibility_Ideographs} (Short:
  1693. \p{InCJKCompatIdeographs}) (512)
  1694. X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
  1695. CJK_Compatibility_Ideographs_Supplement}
  1696. (Short: \p{InCJKCompatIdeographsSup})
  1697. (544)
  1698. X \p{CJK_Ext_A} \p{CJK_Unified_Ideographs_Extension_A} (=
  1699. \p{Block=
  1700. CJK_Unified_Ideographs_Extension_A})
  1701. (6592)
  1702. X \p{CJK_Ext_B} \p{CJK_Unified_Ideographs_Extension_B} (=
  1703. \p{Block=
  1704. CJK_Unified_Ideographs_Extension_B})
  1705. (42_720)
  1706. X \p{CJK_Ext_C} \p{CJK_Unified_Ideographs_Extension_C} (=
  1707. \p{Block=
  1708. CJK_Unified_Ideographs_Extension_C})
  1709. (4160)
  1710. X \p{CJK_Ext_D} \p{CJK_Unified_Ideographs_Extension_D} (=
  1711. \p{Block=
  1712. CJK_Unified_Ideographs_Extension_D})
  1713. (224)
  1714. X \p{CJK_Ext_E} \p{CJK_Unified_Ideographs_Extension_E} (=
  1715. \p{Block=
  1716. CJK_Unified_Ideographs_Extension_E})
  1717. (5776)
  1718. X \p{CJK_Ext_F} \p{CJK_Unified_Ideographs_Extension_F} (=
  1719. \p{Block=
  1720. CJK_Unified_Ideographs_Extension_F})
  1721. (7488)
  1722. X \p{CJK_Radicals_Sup} \p{CJK_Radicals_Supplement} (= \p{Block=
  1723. CJK_Radicals_Supplement}) (128)
  1724. X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
  1725. (Short: \p{InCJKRadicalsSup}) (128)
  1726. X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48)
  1727. X \p{CJK_Symbols} \p{CJK_Symbols_And_Punctuation} (=
  1728. \p{Block=CJK_Symbols_And_Punctuation})
  1729. (64)
  1730. X \p{CJK_Symbols_And_Punctuation} \p{Block=
  1731. CJK_Symbols_And_Punctuation} (Short:
  1732. \p{InCJKSymbols}) (64)
  1733. X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
  1734. (Short: \p{InCJK}) (20_992)
  1735. X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
  1736. CJK_Unified_Ideographs_Extension_A}
  1737. (Short: \p{InCJKExtA}) (6592)
  1738. X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
  1739. CJK_Unified_Ideographs_Extension_B}
  1740. (Short: \p{InCJKExtB}) (42_720)
  1741. X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
  1742. CJK_Unified_Ideographs_Extension_C}
  1743. (Short: \p{InCJKExtC}) (4160)
  1744. X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
  1745. CJK_Unified_Ideographs_Extension_D}
  1746. (Short: \p{InCJKExtD}) (224)
  1747. X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
  1748. CJK_Unified_Ideographs_Extension_E}
  1749. (Short: \p{InCJKExtE}) (5776)
  1750. X \p{CJK_Unified_Ideographs_Extension_F} \p{Block=
  1751. CJK_Unified_Ideographs_Extension_F}
  1752. (Short: \p{InCJKExtF}) (7488)
  1753. \p{Close_Punctuation} \p{General_Category=Close_Punctuation}
  1754. (Short: \p{Pe}) (73)
  1755. \p{Cn} \p{Unassigned} (= \p{General_Category=
  1756. Unassigned}) (836_602 plus all above-
  1757. Unicode code points)
  1758. \p{Cntrl} \p{XPosixCntrl} (= \p{General_Category=
  1759. Control}) (65)
  1760. \p{Co} \p{Private_Use} (= \p{General_Category=
  1761. Private_Use}) (NOT \p{Private_Use_Area})
  1762. (137_468)
  1763. X \p{Combining_Diacritical_Marks} \p{Block=
  1764. Combining_Diacritical_Marks} (Short:
  1765. \p{InDiacriticals}) (112)
  1766. X \p{Combining_Diacritical_Marks_Extended} \p{Block=
  1767. Combining_Diacritical_Marks_Extended}
  1768. (Short: \p{InDiacriticalsExt}) (80)
  1769. X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
  1770. Combining_Diacritical_Marks_For_Symbols}
  1771. (Short: \p{InDiacriticalsForSymbols})
  1772. (48)
  1773. X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
  1774. Combining_Diacritical_Marks_Supplement}
  1775. (Short: \p{InDiacriticalsSup}) (64)
  1776. X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
  1777. \p{InHalfMarks}) (16)
  1778. \p{Combining_Mark} \p{Mark} (= \p{General_Category=Mark})
  1779. (2268)
  1780. X \p{Combining_Marks_For_Symbols}
  1781. \p{Combining_Diacritical_Marks_For_-
  1782. Symbols} (= \p{Block=
  1783. Combining_Diacritical_Marks_For_-
  1784. Symbols}) (48)
  1785. \p{Common} \p{Script_Extensions=Common} (Short:
  1786. \p{Zyyy}) (7386)
  1787. X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
  1788. (Short: \p{InIndicNumberForms}) (16)
  1789. \p{Comp_Ex} \p{Full_Composition_Exclusion} (=
  1790. \p{Full_Composition_Exclusion=Y}) (1120)
  1791. \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *}
  1792. X \p{Compat_Jamo} \p{Hangul_Compatibility_Jamo} (= \p{Block=
  1793. Hangul_Compatibility_Jamo}) (96)
  1794. \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
  1795. \p{CE}) (81)
  1796. \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
  1797. plus all above-Unicode code points:
  1798. U+0000..0957, U+0960..09DB, U+09DE,
  1799. U+09E0..0A32, U+0A34..0A35, U+0A37..0A58
  1800. ...)
  1801. \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81:
  1802. U+0958..095F, U+09DC..09DD, U+09DF,
  1803. U+0A33, U+0A36, U+0A59..0A5B ...)
  1804. \p{Connector_Punctuation} \p{General_Category=
  1805. Connector_Punctuation} (Short: \p{Pc})
  1806. (10)
  1807. \p{Control} \p{XPosixCntrl} (= \p{General_Category=
  1808. Control}) (65)
  1809. X \p{Control_Pictures} \p{Block=Control_Pictures} (64)
  1810. \p{Copt} \p{Coptic} (= \p{Script_Extensions=
  1811. Coptic}) (NOT \p{Block=Coptic}) (165)
  1812. \p{Coptic} \p{Script_Extensions=Coptic} (Short:
  1813. \p{Copt}; NOT \p{Block=Coptic}) (165)
  1814. X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
  1815. X \p{Counting_Rod} \p{Counting_Rod_Numerals} (= \p{Block=
  1816. Counting_Rod_Numerals}) (32)
  1817. X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
  1818. \p{InCountingRod}) (32)
  1819. \p{Cprt} \p{Cypriot} (= \p{Script_Extensions=
  1820. Cypriot}) (112)
  1821. \p{Cs} \p{Surrogate} (= \p{General_Category=
  1822. Surrogate}) (2048)
  1823. \p{Cuneiform} \p{Script_Extensions=Cuneiform} (Short:
  1824. \p{Xsux}; NOT \p{Block=Cuneiform}) (1234)
  1825. X \p{Cuneiform_Numbers} \p{Cuneiform_Numbers_And_Punctuation} (=
  1826. \p{Block=
  1827. Cuneiform_Numbers_And_Punctuation}) (128)
  1828. X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
  1829. Cuneiform_Numbers_And_Punctuation}
  1830. (Short: \p{InCuneiformNumbers}) (128)
  1831. \p{Currency_Symbol} \p{General_Category=Currency_Symbol}
  1832. (Short: \p{Sc}) (62)
  1833. X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48)
  1834. \p{CWCF} \p{Changes_When_Casefolded} (=
  1835. \p{Changes_When_Casefolded=Y}) (1463)
  1836. \p{CWCF: *} \p{Changes_When_Casefolded: *}
  1837. \p{CWCM} \p{Changes_When_Casemapped} (=
  1838. \p{Changes_When_Casemapped=Y}) (2841)
  1839. \p{CWCM: *} \p{Changes_When_Casemapped: *}
  1840. \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (=
  1841. \p{Changes_When_NFKC_Casefolded=Y})
  1842. (10_315)
  1843. \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *}
  1844. \p{CWL} \p{Changes_When_Lowercased} (=
  1845. \p{Changes_When_Lowercased=Y}) (1390)
  1846. \p{CWL: *} \p{Changes_When_Lowercased: *}
  1847. \p{CWT} \p{Changes_When_Titlecased} (=
  1848. \p{Changes_When_Titlecased=Y}) (1409)
  1849. \p{CWT: *} \p{Changes_When_Titlecased: *}
  1850. \p{CWU} \p{Changes_When_Uppercased} (=
  1851. \p{Changes_When_Uppercased=Y}) (1482)
  1852. \p{CWU: *} \p{Changes_When_Uppercased: *}
  1853. \p{Cypriot} \p{Script_Extensions=Cypriot} (Short:
  1854. \p{Cprt}) (112)
  1855. X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64)
  1856. \p{Cyrillic} \p{Script_Extensions=Cyrillic} (Short:
  1857. \p{Cyrl}; NOT \p{Block=Cyrillic}) (446)
  1858. X \p{Cyrillic_Ext_A} \p{Cyrillic_Extended_A} (= \p{Block=
  1859. Cyrillic_Extended_A}) (32)
  1860. X \p{Cyrillic_Ext_B} \p{Cyrillic_Extended_B} (= \p{Block=
  1861. Cyrillic_Extended_B}) (96)
  1862. X \p{Cyrillic_Ext_C} \p{Cyrillic_Extended_C} (= \p{Block=
  1863. Cyrillic_Extended_C}) (16)
  1864. X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
  1865. \p{InCyrillicExtA}) (32)
  1866. X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
  1867. \p{InCyrillicExtB}) (96)
  1868. X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short:
  1869. \p{InCyrillicExtC}) (16)
  1870. X \p{Cyrillic_Sup} \p{Cyrillic_Supplement} (= \p{Block=
  1871. Cyrillic_Supplement}) (48)
  1872. X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
  1873. \p{InCyrillicSup}) (48)
  1874. X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
  1875. Cyrillic_Supplement}) (48)
  1876. \p{Cyrl} \p{Cyrillic} (= \p{Script_Extensions=
  1877. Cyrillic}) (NOT \p{Block=Cyrillic}) (446)
  1878. \p{Dash} \p{Dash=Y} (28)
  1879. \p{Dash: N*} (Single: \P{Dash}) (1_114_084 plus all
  1880. above-Unicode code points: [\x00-
  1881. \x20!\"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-
  1882. Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
  1883. U+0100..0589, U+058B..05BD,
  1884. U+05BF..13FF, U+1401..1805, U+1807..200F
  1885. ...)
  1886. \p{Dash: Y*} (Single: \p{Dash}) (28: [\-], U+058A,
  1887. U+05BE, U+1400, U+1806, U+2010..2015 ...)
  1888. \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation}
  1889. (Short: \p{Pd}) (24)
  1890. \p{Decimal_Number} \p{XPosixDigit} (= \p{General_Category=
  1891. Decimal_Number}) (630)
  1892. \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
  1893. (13_232)
  1894. \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_232:
  1895. [\xc0-\xc5\xc7-\xcf\xd1-\xd6\xd9-
  1896. \xdd\xe0-\xe5\xe7-\xef\xf1-\xf6\xf9-
  1897. \xfd\xff], U+0100..010F, U+0112..0125,
  1898. U+0128..0130, U+0134..0137, U+0139..013E
  1899. ...)
  1900. \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240:
  1901. U+2460..2473, U+24B6..24EA,
  1902. U+3244..3247, U+3251..327E,
  1903. U+3280..32BF, U+32D0..32FE ...)
  1904. \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
  1905. \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720:
  1906. [\xa8\xaf\xb4-\xb5\xb8], U+0132..0133,
  1907. U+013F..0140, U+0149, U+017F,
  1908. U+01C4..01CC ...)
  1909. \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
  1910. \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
  1911. \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240: U+FB51,
  1912. U+FB53, U+FB57, U+FB5B, U+FB5F, U+FB63
  1913. ...)
  1914. \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184: U+2102,
  1915. U+210A..2113, U+2115, U+2119..211D,
  1916. U+2124, U+2128 ...)
  1917. \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
  1918. \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20: [\xbc-
  1919. \xbe], U+2150..215F, U+2189)
  1920. \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
  1921. \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171: U+FB54,
  1922. U+FB58, U+FB5C, U+FB60, U+FB64, U+FB68
  1923. ...)
  1924. \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
  1925. \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238: U+FB50,
  1926. U+FB52, U+FB56, U+FB5A, U+FB5E, U+FB62
  1927. ...)
  1928. \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
  1929. \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82: U+FB55,
  1930. U+FB59, U+FB5D, U+FB61, U+FB65, U+FB69
  1931. ...)
  1932. \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
  1933. \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122:
  1934. U+FF61..FFBE, U+FFC2..FFC7,
  1935. U+FFCA..FFCF, U+FFD2..FFD7,
  1936. U+FFDA..FFDC, U+FFE8..FFEE)
  1937. \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
  1938. \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5: [\xa0],
  1939. U+0F0C, U+2007, U+2011, U+202F)
  1940. \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
  1941. Non_Canonical} (Perl extension) (3664)
  1942. \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
  1943. decompositions (Short: \p{Dt=NonCanon})
  1944. (Perl extension) (3664:
  1945. [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-\xba\xbc-
  1946. \xbe], U+0132..0133, U+013F..0140,
  1947. U+0149, U+017F, U+01C4..01CC ...)
  1948. \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_216 plus
  1949. all above-Unicode code points: [\x00-
  1950. \x9f\xa1-\xa7\xa9\xab-\xae\xb0-\xb1\xb6-
  1951. \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-
  1952. \xdf\xe6\xf0\xf7-\xf8\xfe],
  1953. U+0110..0111, U+0126..0127, U+0131,
  1954. U+0138, U+0141..0142 ...)
  1955. \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26:
  1956. U+FE50..FE52, U+FE54..FE66, U+FE68..FE6B)
  1957. \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
  1958. \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (286)
  1959. \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (286: U+3250,
  1960. U+32CC..32CF, U+32FF..3357,
  1961. U+3371..33DF, U+33FF, U+1F130..1F14F ...)
  1962. \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38: U+1D62..1D6A,
  1963. U+2080..208E, U+2090..209C, U+2C7C)
  1964. \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (153)
  1965. \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (153: [\xaa\xb2-
  1966. \xb3\xb9-\xba], U+02B0..02B8,
  1967. U+02E0..02E4, U+10FC, U+1D2C..1D2E,
  1968. U+1D30..1D3A ...)
  1969. \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
  1970. \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35: U+309F,
  1971. U+30FF, U+FE10..FE19, U+FE30..FE44,
  1972. U+FE47..FE48)
  1973. \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104: U+3000,
  1974. U+FF01..FF60, U+FFE0..FFE6)
  1975. \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
  1976. Y} (Short: \p{DI}) (4173)
  1977. \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
  1978. (1_109_939 plus all above-Unicode code
  1979. points: [\x00-\xac\xae-\xff],
  1980. U+0100..034E, U+0350..061B,
  1981. U+061D..115E, U+1161..17B3, U+17B6..180A
  1982. ...)
  1983. \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
  1984. (4173: [\xad], U+034F, U+061C,
  1985. U+115F..1160, U+17B4..17B5, U+180B..180E
  1986. ...)
  1987. \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (15)
  1988. \p{Dep: *} \p{Deprecated: *}
  1989. \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (15)
  1990. \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_097
  1991. plus all above-Unicode code points:
  1992. U+0000..0148, U+014A..0672,
  1993. U+0674..0F76, U+0F78, U+0F7A..17A2,
  1994. U+17A5..2069 ...)
  1995. \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (15: U+0149,
  1996. U+0673, U+0F77, U+0F79, U+17A3..17A4,
  1997. U+206A..206F ...)
  1998. \p{Deseret} \p{Script_Extensions=Deseret} (Short:
  1999. \p{Dsrt}) (80)
  2000. \p{Deva} \p{Devanagari} (= \p{Script_Extensions=
  2001. Devanagari}) (NOT \p{Block=Devanagari})
  2002. (210)
  2003. \p{Devanagari} \p{Script_Extensions=Devanagari} (Short:
  2004. \p{Deva}; NOT \p{Block=Devanagari}) (210)
  2005. X \p{Devanagari_Ext} \p{Devanagari_Extended} (= \p{Block=
  2006. Devanagari_Extended}) (32)
  2007. X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
  2008. \p{InDevanagariExt}) (32)
  2009. \p{DI} \p{Default_Ignorable_Code_Point} (=
  2010. \p{Default_Ignorable_Code_Point=Y})
  2011. (4173)
  2012. \p{DI: *} \p{Default_Ignorable_Code_Point: *}
  2013. \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (873)
  2014. \p{Dia: *} \p{Diacritic: *}
  2015. \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (873)
  2016. \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_113_239
  2017. plus all above-Unicode code points:
  2018. [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
  2019. >?\@A-Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
  2020. \xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
  2021. U+0100..02AF, U+034F, U+0358..035C,
  2022. U+0363..0373, U+0376..0379 ...)
  2023. \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (873:
  2024. [\^`\xa8\xaf\xb4\xb7-\xb8],
  2025. U+02B0..034E, U+0350..0357,
  2026. U+035D..0362, U+0374..0375, U+037A ...)
  2027. X \p{Diacriticals} \p{Combining_Diacritical_Marks} (=
  2028. \p{Block=Combining_Diacritical_Marks})
  2029. (112)
  2030. X \p{Diacriticals_Ext} \p{Combining_Diacritical_Marks_Extended}
  2031. (= \p{Block=
  2032. Combining_Diacritical_Marks_Extended})
  2033. (80)
  2034. X \p{Diacriticals_For_Symbols}
  2035. \p{Combining_Diacritical_Marks_For_-
  2036. Symbols} (= \p{Block=
  2037. Combining_Diacritical_Marks_For_-
  2038. Symbols}) (48)
  2039. X \p{Diacriticals_Sup} \p{Combining_Diacritical_Marks_Supplement}
  2040. (= \p{Block=
  2041. Combining_Diacritical_Marks_Supplement})
  2042. (64)
  2043. \p{Digit} \p{XPosixDigit} (= \p{General_Category=
  2044. Decimal_Number}) (630)
  2045. X \p{Dingbats} \p{Block=Dingbats} (192)
  2046. \p{Dogr} \p{Dogra} (= \p{Script_Extensions=Dogra})
  2047. (NOT \p{Block=Dogra}) (82)
  2048. \p{Dogra} \p{Script_Extensions=Dogra} (Short:
  2049. \p{Dogr}; NOT \p{Block=Dogra}) (82)
  2050. X \p{Domino} \p{Domino_Tiles} (= \p{Block=
  2051. Domino_Tiles}) (112)
  2052. X \p{Domino_Tiles} \p{Block=Domino_Tiles} (Short:
  2053. \p{InDomino}) (112)
  2054. \p{Dsrt} \p{Deseret} (= \p{Script_Extensions=
  2055. Deseret}) (80)
  2056. \p{Dt: *} \p{Decomposition_Type: *}
  2057. \p{Dupl} \p{Duployan} (= \p{Script_Extensions=
  2058. Duployan}) (NOT \p{Block=Duployan}) (147)
  2059. \p{Duployan} \p{Script_Extensions=Duployan} (Short:
  2060. \p{Dupl}; NOT \p{Block=Duployan}) (147)
  2061. \p{Ea: *} \p{East_Asian_Width: *}
  2062. X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
  2063. (208)
  2064. \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739)
  2065. \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739:
  2066. [\xa1\xa4\xa7-\xa8\xaa\xad-\xae\xb0-
  2067. \xb4\xb6-\xba\xbc-\xbf\xc6\xd0\xd7-
  2068. \xd8\xde-\xe1\xe6\xe8-\xea\xec-
  2069. \xed\xf0\xf2-\xf3\xf7-\xfa\xfc\xfe],
  2070. U+0101, U+0111, U+0113, U+011B,
  2071. U+0126..0127 ...)
  2072. \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
  2073. \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104: U+3000,
  2074. U+FF01..FF60, U+FFE0..FFE6)
  2075. \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
  2076. \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123: U+20A9,
  2077. U+FF61..FFBE, U+FFC2..FFC7,
  2078. U+FFCA..FFCF, U+FFD2..FFD7, U+FFDA..FFDC
  2079. ...)
  2080. \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (793_252 plus
  2081. all above-Unicode code points)
  2082. \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
  2083. \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111: [\x20-
  2084. \x7e\xa2-\xa3\xa5-\xa6\xac\xaf],
  2085. U+27E6..27ED, U+2985..2986)
  2086. \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (793_252 plus all
  2087. above-Unicode code points: [\x00-
  2088. \x1f\x7f-\xa0\xa9\xab\xb5\xbb\xc0-
  2089. \xc5\xc7-\xcf\xd1-\xd6\xd9-\xdd\xe2-
  2090. \xe5\xe7\xeb\xee-\xef\xf1\xf4-
  2091. \xf6\xfb\xfd\xff], U+00FF..0100,
  2092. U+0102..0110, U+0112, U+0114..011A,
  2093. U+011C..0125 ...)
  2094. \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (181_783)
  2095. \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (181_783:
  2096. U+1100..115F, U+231A..231B,
  2097. U+2329..232A, U+23E9..23EC, U+23F0,
  2098. U+23F3 ...)
  2099. \p{Egyp} \p{Egyptian_Hieroglyphs} (=
  2100. \p{Script_Extensions=
  2101. Egyptian_Hieroglyphs}) (NOT \p{Block=
  2102. Egyptian_Hieroglyphs}) (1080)
  2103. X \p{Egyptian_Hieroglyph_Format_Controls} \p{Block=
  2104. Egyptian_Hieroglyph_Format_Controls} (16)
  2105. \p{Egyptian_Hieroglyphs} \p{Script_Extensions=
  2106. Egyptian_Hieroglyphs} (Short: \p{Egyp};
  2107. NOT \p{Block=Egyptian_Hieroglyphs})
  2108. (1080)
  2109. \p{Elba} \p{Elbasan} (= \p{Script_Extensions=
  2110. Elbasan}) (NOT \p{Block=Elbasan}) (40)
  2111. \p{Elbasan} \p{Script_Extensions=Elbasan} (Short:
  2112. \p{Elba}; NOT \p{Block=Elbasan}) (40)
  2113. \p{Elym} \p{Elymaic} (= \p{Script_Extensions=
  2114. Elymaic}) (NOT \p{Block=Elymaic}) (23)
  2115. \p{Elymaic} \p{Script_Extensions=Elymaic} (Short:
  2116. \p{Elym}; NOT \p{Block=Elymaic}) (23)
  2117. X \p{Emoticons} \p{Block=Emoticons} (80)
  2118. X \p{Enclosed_Alphanum} \p{Enclosed_Alphanumerics} (= \p{Block=
  2119. Enclosed_Alphanumerics}) (160)
  2120. X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
  2121. \p{Block=
  2122. Enclosed_Alphanumeric_Supplement}) (256)
  2123. X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
  2124. Enclosed_Alphanumeric_Supplement}
  2125. (Short: \p{InEnclosedAlphanumSup}) (256)
  2126. X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
  2127. (Short: \p{InEnclosedAlphanum}) (160)
  2128. X \p{Enclosed_CJK} \p{Enclosed_CJK_Letters_And_Months} (=
  2129. \p{Block=
  2130. Enclosed_CJK_Letters_And_Months}) (256)
  2131. X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
  2132. Enclosed_CJK_Letters_And_Months} (Short:
  2133. \p{InEnclosedCJK}) (256)
  2134. X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
  2135. (= \p{Block=
  2136. Enclosed_Ideographic_Supplement}) (256)
  2137. X \p{Enclosed_Ideographic_Supplement} \p{Block=
  2138. Enclosed_Ideographic_Supplement} (Short:
  2139. \p{InEnclosedIdeographicSup}) (256)
  2140. \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark}
  2141. (Short: \p{Me}) (13)
  2142. \p{Ethi} \p{Ethiopic} (= \p{Script_Extensions=
  2143. Ethiopic}) (NOT \p{Block=Ethiopic}) (495)
  2144. \p{Ethiopic} \p{Script_Extensions=Ethiopic} (Short:
  2145. \p{Ethi}; NOT \p{Block=Ethiopic}) (495)
  2146. X \p{Ethiopic_Ext} \p{Ethiopic_Extended} (= \p{Block=
  2147. Ethiopic_Extended}) (96)
  2148. X \p{Ethiopic_Ext_A} \p{Ethiopic_Extended_A} (= \p{Block=
  2149. Ethiopic_Extended_A}) (48)
  2150. X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (Short:
  2151. \p{InEthiopicExt}) (96)
  2152. X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
  2153. \p{InEthiopicExtA}) (48)
  2154. X \p{Ethiopic_Sup} \p{Ethiopic_Supplement} (= \p{Block=
  2155. Ethiopic_Supplement}) (32)
  2156. X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
  2157. \p{InEthiopicSup}) (32)
  2158. \p{Ext} \p{Extender} (= \p{Extender=Y}) (47)
  2159. \p{Ext: *} \p{Extender: *}
  2160. \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (47)
  2161. \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_065
  2162. plus all above-Unicode code points:
  2163. [\x00-\xb6\xb8-\xff], U+0100..02CF,
  2164. U+02D2..063F, U+0641..07F9,
  2165. U+07FB..0E45, U+0E47..0EC5 ...)
  2166. \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (47: [\xb7],
  2167. U+02D0..02D1, U+0640, U+07FA, U+0E46,
  2168. U+0EC6 ...)
  2169. \p{Final_Punctuation} \p{General_Category=Final_Punctuation}
  2170. (Short: \p{Pf}) (10)
  2171. \p{Format} \p{General_Category=Format} (Short:
  2172. \p{Cf}) (161)
  2173. \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
  2174. (Short: \p{CompEx}) (1120)
  2175. \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
  2176. \P{CompEx}) (1_112_992 plus all above-
  2177. Unicode code points: U+0000..033F,
  2178. U+0342, U+0345..0373, U+0375..037D,
  2179. U+037F..0386, U+0388..0957 ...)
  2180. \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
  2181. \p{CompEx}) (1120: U+0340..0341,
  2182. U+0343..0344, U+0374, U+037E, U+0387,
  2183. U+0958..095F ...)
  2184. \p{Gc: *} \p{General_Category: *}
  2185. \p{GCB: *} \p{Grapheme_Cluster_Break: *}
  2186. \p{General_Category: C} \p{General_Category=Other} (976_344 plus
  2187. all above-Unicode code points)
  2188. \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
  2189. \p{Gc=LC}, \p{LC}) (3970: [A-Za-
  2190. z\xb5\xc0-\xd6\xd8-\xf6\xf8-\xff],
  2191. U+0100..01BA, U+01BC..01BF,
  2192. U+01C4..0293, U+0295..02AF, U+0370..0373
  2193. ...)
  2194. \p{General_Category: Cc} \p{General_Category=Control} (65)
  2195. \p{General_Category: Cf} \p{General_Category=Format} (161)
  2196. \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
  2197. (73: [\)\]\}], U+0F3B, U+0F3D, U+169C,
  2198. U+2046, U+207E ...)
  2199. \p{General_Category: Cn} \p{General_Category=Unassigned} (836_602
  2200. plus all above-Unicode code points)
  2201. \p{General_Category: Cntrl} \p{General_Category=Control} (65)
  2202. \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
  2203. \p{General_Category: Combining_Mark} \p{General_Category=Mark}
  2204. (2268)
  2205. \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
  2206. \p{Pc}) (10: [_], U+203F..2040, U+2054,
  2207. U+FE33..FE34, U+FE4D..FE4F, U+FF3F)
  2208. \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65:
  2209. [\x00-\x1f\x7f-\x9f])
  2210. \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
  2211. \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
  2212. (62: [\$\xa2-\xa5], U+058F, U+060B,
  2213. U+07FE..07FF, U+09F2..09F3, U+09FB ...)
  2214. \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
  2215. (24: [\-], U+058A, U+05BE, U+1400,
  2216. U+1806, U+2010..2015 ...)
  2217. \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
  2218. (630: [0-9], U+0660..0669, U+06F0..06F9,
  2219. U+07C0..07C9, U+0966..096F, U+09E6..09EF
  2220. ...)
  2221. \p{General_Category: Digit} \p{General_Category=Decimal_Number}
  2222. (630)
  2223. \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
  2224. (13: U+0488..0489, U+1ABE, U+20DD..20E0,
  2225. U+20E2..20E4, U+A670..A672)
  2226. \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
  2227. (10: [\xbb], U+2019, U+201D, U+203A,
  2228. U+2E03, U+2E05 ...)
  2229. \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (161:
  2230. [\xad], U+0600..0605, U+061C, U+06DD,
  2231. U+070F, U+08E2 ...)
  2232. \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
  2233. \p{Pi}) (12: [\xab], U+2018,
  2234. U+201B..201C, U+201F, U+2039, U+2E02 ...)
  2235. \p{General_Category: L} \p{General_Category=Letter} (125_643)
  2236. X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3970)
  2237. X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
  2238. the trailing '_' matters in spite of
  2239. loose matching rules. (3970)
  2240. \p{General_Category: LC} \p{General_Category=Cased_Letter} (3970)
  2241. \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (125_643:
  2242. [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
  2243. \xf6\xf8-\xff], U+0100..02C1,
  2244. U+02C6..02D1, U+02E0..02E4, U+02EC,
  2245. U+02EE ...)
  2246. \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
  2247. (236: U+16EE..16F0, U+2160..2182,
  2248. U+2185..2188, U+3007, U+3021..3029,
  2249. U+3038..303A ...)
  2250. \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl})
  2251. (1: U+2028)
  2252. \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
  2253. (/i= General_Category=Cased_Letter)
  2254. (2151)
  2255. \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (259)
  2256. \p{General_Category: Lo} \p{General_Category=Other_Letter}
  2257. (121_414)
  2258. \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
  2259. /i= General_Category=Cased_Letter)
  2260. (2151: [a-z\xb5\xdf-\xf6\xf8-\xff],
  2261. U+0101, U+0103, U+0105, U+0107, U+0109
  2262. ...)
  2263. \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
  2264. (/i= General_Category=Cased_Letter) (31)
  2265. \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
  2266. (/i= General_Category=Cased_Letter)
  2267. (1788)
  2268. \p{General_Category: M} \p{General_Category=Mark} (2268)
  2269. \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2268:
  2270. U+0300..036F, U+0483..0489,
  2271. U+0591..05BD, U+05BF, U+05C1..05C2,
  2272. U+05C4..05C5 ...)
  2273. \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948:
  2274. [+<=>\|~\xac\xb1\xd7\xf7], U+03F6,
  2275. U+0606..0608, U+2044, U+2052,
  2276. U+207A..207C ...)
  2277. \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (429)
  2278. \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
  2279. \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
  2280. (1826)
  2281. \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
  2282. (259: U+02B0..02C1, U+02C6..02D1,
  2283. U+02E0..02E4, U+02EC, U+02EE, U+0374 ...)
  2284. \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
  2285. (121: [\^`\xa8\xaf\xb4\xb8],
  2286. U+02C2..02C5, U+02D2..02DF,
  2287. U+02E5..02EB, U+02ED, U+02EF..02FF ...)
  2288. \p{General_Category: N} \p{General_Category=Number} (1754)
  2289. \p{General_Category: Nd} \p{General_Category=Decimal_Number} (630)
  2290. \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
  2291. \p{General_Category: No} \p{General_Category=Other_Number} (888)
  2292. \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
  2293. (1826: U+0300..036F, U+0483..0487,
  2294. U+0591..05BD, U+05BF, U+05C1..05C2,
  2295. U+05C4..05C5 ...)
  2296. \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1754:
  2297. [0-9\xb2-\xb3\xb9\xbc-\xbe],
  2298. U+0660..0669, U+06F0..06F9,
  2299. U+07C0..07C9, U+0966..096F, U+09E6..09EF
  2300. ...)
  2301. \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
  2302. (75: [\(\[\{], U+0F3A, U+0F3C, U+169B,
  2303. U+201A, U+201E ...)
  2304. \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (976_344 plus
  2305. all above-Unicode code points: [\x00-
  2306. \x1f\x7f-\x9f\xad], U+0378..0379,
  2307. U+0380..0383, U+038B, U+038D, U+03A2 ...)
  2308. \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
  2309. (121_414: [\xaa\xba], U+01BB,
  2310. U+01C0..01C3, U+0294, U+05D0..05EA,
  2311. U+05EF..05F2 ...)
  2312. \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No})
  2313. (888: [\xb2-\xb3\xb9\xbc-\xbe],
  2314. U+09F4..09F9, U+0B72..0B77,
  2315. U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
  2316. ...)
  2317. \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
  2318. (588:
  2319. [!\"#\%&\'*,.\/:;?\@\\\xa1\xa7\xb6-
  2320. \xb7\xbf], U+037E, U+0387, U+055A..055F,
  2321. U+0589, U+05C0 ...)
  2322. \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
  2323. (6161: [\xa6\xa9\xae\xb0], U+0482,
  2324. U+058D..058E, U+060E..060F, U+06DE,
  2325. U+06E9 ...)
  2326. \p{General_Category: P} \p{General_Category=Punctuation} (792)
  2327. \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
  2328. \p{Zp}) (1: U+2029)
  2329. \p{General_Category: Pc} \p{General_Category=
  2330. Connector_Punctuation} (10)
  2331. \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (24)
  2332. \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
  2333. (73)
  2334. \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
  2335. (10)
  2336. \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
  2337. (12)
  2338. \p{General_Category: Po} \p{General_Category=Other_Punctuation}
  2339. (588)
  2340. \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
  2341. (137_468: U+E000..F8FF, U+F0000..FFFFD,
  2342. U+100000..10FFFD)
  2343. \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (75)
  2344. \p{General_Category: Punct} \p{General_Category=Punctuation} (792)
  2345. \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (792:
  2346. [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_-
  2347. \{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
  2348. U+037E, U+0387, U+055A..055F,
  2349. U+0589..058A, U+05BE ...)
  2350. \p{General_Category: S} \p{General_Category=Symbol} (7292)
  2351. \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (62)
  2352. \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19:
  2353. [\x20\xa0], U+1680, U+2000..200A,
  2354. U+2028..2029, U+202F, U+205F ...)
  2355. \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (121)
  2356. \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
  2357. \p{General_Category: So} \p{General_Category=Other_Symbol} (6161)
  2358. \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
  2359. (17: [\x20\xa0], U+1680, U+2000..200A,
  2360. U+202F, U+205F, U+3000)
  2361. \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc})
  2362. (429: U+0903, U+093B, U+093E..0940,
  2363. U+0949..094C, U+094E..094F, U+0982..0983
  2364. ...)
  2365. \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048:
  2366. U+D800..DFFF)
  2367. \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (7292:
  2368. [\$+<=>\^`\|~\xa2-\xa6\xa8-\xa9\xac\xae-
  2369. \xb1\xb4\xb8\xd7\xf7], U+02C2..02C5,
  2370. U+02D2..02DF, U+02E5..02EB, U+02ED,
  2371. U+02EF..02FF ...)
  2372. \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
  2373. /i= General_Category=Cased_Letter) (31:
  2374. U+01C5, U+01C8, U+01CB, U+01F2,
  2375. U+1F88..1F8F, U+1F98..1F9F ...)
  2376. \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
  2377. (836_602 plus all above-Unicode code
  2378. points: U+0378..0379, U+0380..0383,
  2379. U+038B, U+038D, U+03A2, U+0530 ...)
  2380. \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
  2381. /i= General_Category=Cased_Letter)
  2382. (1788: [A-Z\xc0-\xd6\xd8-\xde], U+0100,
  2383. U+0102, U+0104, U+0106, U+0108 ...)
  2384. \p{General_Category: Z} \p{General_Category=Separator} (19)
  2385. \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
  2386. \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
  2387. (1)
  2388. \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
  2389. X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
  2390. \p{InPunctuation}) (112)
  2391. X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96)
  2392. X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
  2393. \p{Block=Geometric_Shapes_Extended})
  2394. (128)
  2395. X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
  2396. (Short: \p{InGeometricShapesExt}) (128)
  2397. \p{Geor} \p{Georgian} (= \p{Script_Extensions=
  2398. Georgian}) (NOT \p{Block=Georgian}) (175)
  2399. \p{Georgian} \p{Script_Extensions=Georgian} (Short:
  2400. \p{Geor}; NOT \p{Block=Georgian}) (175)
  2401. X \p{Georgian_Ext} \p{Georgian_Extended} (= \p{Block=
  2402. Georgian_Extended}) (48)
  2403. X \p{Georgian_Extended} \p{Block=Georgian_Extended} (Short:
  2404. \p{InGeorgianExt}) (48)
  2405. X \p{Georgian_Sup} \p{Georgian_Supplement} (= \p{Block=
  2406. Georgian_Supplement}) (48)
  2407. X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
  2408. \p{InGeorgianSup}) (48)
  2409. \p{Glag} \p{Glagolitic} (= \p{Script_Extensions=
  2410. Glagolitic}) (NOT \p{Block=Glagolitic})
  2411. (136)
  2412. \p{Glagolitic} \p{Script_Extensions=Glagolitic} (Short:
  2413. \p{Glag}; NOT \p{Block=Glagolitic}) (136)
  2414. X \p{Glagolitic_Sup} \p{Glagolitic_Supplement} (= \p{Block=
  2415. Glagolitic_Supplement}) (48)
  2416. X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short:
  2417. \p{InGlagoliticSup}) (48)
  2418. \p{Gong} \p{Gunjala_Gondi} (= \p{Script_Extensions=
  2419. Gunjala_Gondi}) (NOT \p{Block=
  2420. Gunjala_Gondi}) (65)
  2421. \p{Gonm} \p{Masaram_Gondi} (= \p{Script_Extensions=
  2422. Masaram_Gondi}) (NOT \p{Block=
  2423. Masaram_Gondi}) (77)
  2424. \p{Goth} \p{Gothic} (= \p{Script_Extensions=
  2425. Gothic}) (NOT \p{Block=Gothic}) (27)
  2426. \p{Gothic} \p{Script_Extensions=Gothic} (Short:
  2427. \p{Goth}; NOT \p{Block=Gothic}) (27)
  2428. \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
  2429. (135_898)
  2430. \p{Gr_Base: *} \p{Grapheme_Base: *}
  2431. \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend=
  2432. Y}) (1965)
  2433. \p{Gr_Ext: *} \p{Grapheme_Extend: *}
  2434. \p{Gran} \p{Grantha} (= \p{Script_Extensions=
  2435. Grantha}) (NOT \p{Block=Grantha}) (116)
  2436. \p{Grantha} \p{Script_Extensions=Grantha} (Short:
  2437. \p{Gran}; NOT \p{Block=Grantha}) (116)
  2438. \p{Graph} \p{XPosixGraph} (275_378)
  2439. \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase})
  2440. (135_898)
  2441. \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase}) (978_214
  2442. plus all above-Unicode code points:
  2443. [\x00-\x1f\x7f-\x9f\xad], U+0300..036F,
  2444. U+0378..0379, U+0380..0383, U+038B,
  2445. U+038D ...)
  2446. \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase})
  2447. (135_898: [\x20-\x7e\xa0-\xac\xae-\xff],
  2448. U+0100..02FF, U+0370..0377,
  2449. U+037A..037F, U+0384..038A, U+038C ...)
  2450. \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
  2451. (3886)
  2452. \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (3886:
  2453. [^\n\r\x20-\x7e\xa0-\xac\xae-\xff],
  2454. U+061C, U+180E, U+200B, U+200E..200F,
  2455. U+2028..202E ...)
  2456. \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1: [\r])
  2457. \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (0)
  2458. \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (0)
  2459. \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (0)
  2460. \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base}
  2461. (0)
  2462. \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break=
  2463. E_Base_GAZ} (0)
  2464. \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break=
  2465. E_Modifier} (0)
  2466. \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
  2467. (1970)
  2468. \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1970:
  2469. U+0300..036F, U+0483..0489,
  2470. U+0591..05BD, U+05BF, U+05C1..05C2,
  2471. U+05C4..05C5 ...)
  2472. \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break=
  2473. Glue_After_Zwj} (0)
  2474. \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ}) (0)
  2475. \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125:
  2476. U+1100..115F, U+A960..A97C)
  2477. \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1: [\n])
  2478. \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399: U+AC00,
  2479. U+AC1C, U+AC38, U+AC54, U+AC70, U+AC8C
  2480. ...)
  2481. \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773:
  2482. U+AC01..AC1B, U+AC1D..AC37,
  2483. U+AC39..AC53, U+AC55..AC6F,
  2484. U+AC71..AC8B, U+AC8D..ACA7 ...)
  2485. \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_096_301
  2486. plus all above-Unicode code points:
  2487. [\x20-\x7e\xa0-\xac\xae-\xff],
  2488. U+0100..02FF, U+0370..0482,
  2489. U+048A..0590, U+05BE, U+05C0 ...)
  2490. \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
  2491. (22)
  2492. \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (22:
  2493. U+0600..0605, U+06DD, U+070F, U+08E2,
  2494. U+0D4E, U+110BD ...)
  2495. \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
  2496. (26: U+1F1E6..1F1FF)
  2497. \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
  2498. Regional_Indicator} (26)
  2499. \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
  2500. SpacingMark} (375)
  2501. \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (375:
  2502. U+0903, U+093B, U+093E..0940,
  2503. U+0949..094C, U+094E..094F, U+0982..0983
  2504. ...)
  2505. \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137:
  2506. U+11A8..11FF, U+D7CB..D7FB)
  2507. \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95:
  2508. U+1160..11A7, U+D7B0..D7C6)
  2509. \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
  2510. (1_096_301 plus all above-Unicode code
  2511. points)
  2512. \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1: U+200D)
  2513. \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt})
  2514. (1965)
  2515. \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_147
  2516. plus all above-Unicode code points:
  2517. U+0000..02FF, U+0370..0482,
  2518. U+048A..0590, U+05BE, U+05C0, U+05C3 ...)
  2519. \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1965:
  2520. U+0300..036F, U+0483..0489,
  2521. U+0591..05BD, U+05BF, U+05C1..05C2,
  2522. U+05C4..05C5 ...)
  2523. \p{Greek} \p{Script_Extensions=Greek} (Short:
  2524. \p{Grek}; NOT \p{Greek_And_Coptic}) (522)
  2525. X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short:
  2526. \p{InGreek}) (144)
  2527. X \p{Greek_Ext} \p{Greek_Extended} (= \p{Block=
  2528. Greek_Extended}) (256)
  2529. X \p{Greek_Extended} \p{Block=Greek_Extended} (Short:
  2530. \p{InGreekExt}) (256)
  2531. \p{Grek} \p{Greek} (= \p{Script_Extensions=Greek})
  2532. (NOT \p{Greek_And_Coptic}) (522)
  2533. \p{Gujarati} \p{Script_Extensions=Gujarati} (Short:
  2534. \p{Gujr}; NOT \p{Block=Gujarati}) (105)
  2535. \p{Gujr} \p{Gujarati} (= \p{Script_Extensions=
  2536. Gujarati}) (NOT \p{Block=Gujarati}) (105)
  2537. \p{Gunjala_Gondi} \p{Script_Extensions=Gunjala_Gondi}
  2538. (Short: \p{Gong}; NOT \p{Block=
  2539. Gunjala_Gondi}) (65)
  2540. \p{Gurmukhi} \p{Script_Extensions=Gurmukhi} (Short:
  2541. \p{Guru}; NOT \p{Block=Gurmukhi}) (94)
  2542. \p{Guru} \p{Gurmukhi} (= \p{Script_Extensions=
  2543. Gurmukhi}) (NOT \p{Block=Gurmukhi}) (94)
  2544. X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
  2545. \p{Block=Halfwidth_And_Fullwidth_Forms})
  2546. (240)
  2547. X \p{Half_Marks} \p{Combining_Half_Marks} (= \p{Block=
  2548. Combining_Half_Marks}) (16)
  2549. X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
  2550. Halfwidth_And_Fullwidth_Forms} (Short:
  2551. \p{InHalfAndFullForms}) (240)
  2552. \p{Han} \p{Script_Extensions=Han} (89_513)
  2553. \p{Hang} \p{Hangul} (= \p{Script_Extensions=
  2554. Hangul}) (NOT \p{Hangul_Syllables})
  2555. (11_775)
  2556. \p{Hangul} \p{Script_Extensions=Hangul} (Short:
  2557. \p{Hang}; NOT \p{Hangul_Syllables})
  2558. (11_775)
  2559. X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
  2560. (Short: \p{InCompatJamo}) (96)
  2561. X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (Short: \p{InJamo})
  2562. (256)
  2563. X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
  2564. (Short: \p{InJamoExtA}) (32)
  2565. X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
  2566. (Short: \p{InJamoExtB}) (80)
  2567. \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
  2568. (125)
  2569. \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125:
  2570. U+1100..115F, U+A960..A97C)
  2571. \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
  2572. (399)
  2573. \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399:
  2574. U+AC00, U+AC1C, U+AC38, U+AC54, U+AC70,
  2575. U+AC8C ...)
  2576. \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
  2577. LVT_Syllable} (10_773)
  2578. \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
  2579. (10_773: U+AC01..AC1B, U+AC1D..AC37,
  2580. U+AC39..AC53, U+AC55..AC6F,
  2581. U+AC71..AC8B, U+AC8D..ACA7 ...)
  2582. \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
  2583. Not_Applicable} (1_102_583 plus all
  2584. above-Unicode code points)
  2585. \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
  2586. (1_102_583 plus all above-Unicode code
  2587. points: U+0000..10FF, U+1200..A95F,
  2588. U+A97D..ABFF, U+D7A4..D7AF,
  2589. U+D7C7..D7CA, U+D7FC..infinity)
  2590. \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
  2591. (137)
  2592. \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137:
  2593. U+11A8..11FF, U+D7CB..D7FB)
  2594. \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
  2595. (95)
  2596. \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95:
  2597. U+1160..11A7, U+D7B0..D7C6)
  2598. X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (Short:
  2599. \p{InHangul}) (11_184)
  2600. \p{Hani} \p{Han} (= \p{Script_Extensions=Han})
  2601. (89_513)
  2602. \p{Hanifi_Rohingya} \p{Script_Extensions=Hanifi_Rohingya}
  2603. (Short: \p{Rohg}; NOT \p{Block=
  2604. Hanifi_Rohingya}) (55)
  2605. \p{Hano} \p{Hanunoo} (= \p{Script_Extensions=
  2606. Hanunoo}) (NOT \p{Block=Hanunoo}) (23)
  2607. \p{Hanunoo} \p{Script_Extensions=Hanunoo} (Short:
  2608. \p{Hano}; NOT \p{Block=Hanunoo}) (23)
  2609. \p{Hatr} \p{Hatran} (= \p{Script_Extensions=
  2610. Hatran}) (NOT \p{Block=Hatran}) (26)
  2611. \p{Hatran} \p{Script_Extensions=Hatran} (Short:
  2612. \p{Hatr}; NOT \p{Block=Hatran}) (26)
  2613. \p{Hebr} \p{Hebrew} (= \p{Script_Extensions=
  2614. Hebrew}) (NOT \p{Block=Hebrew}) (134)
  2615. \p{Hebrew} \p{Script_Extensions=Hebrew} (Short:
  2616. \p{Hebr}; NOT \p{Block=Hebrew}) (134)
  2617. \p{Hex} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
  2618. \p{Hex: *} \p{Hex_Digit: *}
  2619. \p{Hex_Digit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
  2620. \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068
  2621. plus all above-Unicode code points:
  2622. [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
  2623. >?\@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
  2624. U+0100..FF0F, U+FF1A..FF20,
  2625. U+FF27..FF40, U+FF47..infinity)
  2626. \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44: [0-9A-Fa-
  2627. f], U+FF10..FF19, U+FF21..FF26,
  2628. U+FF41..FF46)
  2629. X \p{High_Private_Use_Surrogates} \p{Block=
  2630. High_Private_Use_Surrogates} (Short:
  2631. \p{InHighPUSurrogates}) (128)
  2632. X \p{High_PU_Surrogates} \p{High_Private_Use_Surrogates} (=
  2633. \p{Block=High_Private_Use_Surrogates})
  2634. (128)
  2635. X \p{High_Surrogates} \p{Block=High_Surrogates} (896)
  2636. \p{Hira} \p{Hiragana} (= \p{Script_Extensions=
  2637. Hiragana}) (NOT \p{Block=Hiragana}) (431)
  2638. \p{Hiragana} \p{Script_Extensions=Hiragana} (Short:
  2639. \p{Hira}; NOT \p{Block=Hiragana}) (431)
  2640. \p{Hluw} \p{Anatolian_Hieroglyphs} (=
  2641. \p{Script_Extensions=
  2642. Anatolian_Hieroglyphs}) (NOT \p{Block=
  2643. Anatolian_Hieroglyphs}) (583)
  2644. \p{Hmng} \p{Pahawh_Hmong} (= \p{Script_Extensions=
  2645. Pahawh_Hmong}) (NOT \p{Block=
  2646. Pahawh_Hmong}) (127)
  2647. \p{Hmnp} \p{Nyiakeng_Puachue_Hmong} (=
  2648. \p{Script_Extensions=
  2649. Nyiakeng_Puachue_Hmong}) (NOT \p{Block=
  2650. Nyiakeng_Puachue_Hmong}) (71)
  2651. \p{HorizSpace} \p{XPosixBlank} (18)
  2652. \p{Hst: *} \p{Hangul_Syllable_Type: *}
  2653. \p{Hung} \p{Old_Hungarian} (= \p{Script_Extensions=
  2654. Old_Hungarian}) (NOT \p{Block=
  2655. Old_Hungarian}) (108)
  2656. D \p{Hyphen} \p{Hyphen=Y} (11)
  2657. D \p{Hyphen: N*} Supplanted by Line_Break property values;
  2658. see www.unicode.org/reports/tr14
  2659. (Single: \P{Hyphen}) (1_114_101 plus all
  2660. above-Unicode code points: [\x00-
  2661. \x20!\"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-
  2662. Z\[\\\]\^_`a-z\{\|\}~\x7f-\xac\xae-
  2663. \xff], U+0100..0589, U+058B..1805,
  2664. U+1807..200F, U+2012..2E16, U+2E18..30FA
  2665. ...)
  2666. D \p{Hyphen: Y*} Supplanted by Line_Break property values;
  2667. see www.unicode.org/reports/tr14
  2668. (Single: \p{Hyphen}) (11: [\-\xad],
  2669. U+058A, U+1806, U+2010..2011, U+2E17,
  2670. U+30FB ...)
  2671. \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC}; NOT
  2672. \p{Ideographic_Description_Characters})
  2673. (128_789)
  2674. \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (985_323 plus
  2675. all above-Unicode code points: [\x00-
  2676. \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
  2677. >?\@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-
  2678. \xb4\xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
  2679. U+02C2..02C5, U+02D2..02DF,
  2680. U+02E5..02EB, U+02ED, U+02EF..02FF ...)
  2681. \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (128_789:
  2682. [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-
  2683. \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
  2684. U+02C6..02D1, U+02E0..02E4, U+02EC,
  2685. U+02EE ...)
  2686. \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (125_884)
  2687. \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (988_228 plus
  2688. all above-Unicode code points: [\x00-
  2689. \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
  2690. >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
  2691. \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
  2692. U+02C2..02C5, U+02D2..02DF,
  2693. U+02E5..02EB, U+02ED, U+02EF..036F ...)
  2694. \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (125_884: [A-
  2695. Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
  2696. \xff], U+0100..02C1, U+02C6..02D1,
  2697. U+02E0..02E4, U+02EC, U+02EE ...)
  2698. \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
  2699. \p{Ideographic_Description_Characters})
  2700. (128_789)
  2701. \p{IDC: *} \p{ID_Continue: *}
  2702. \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y})
  2703. (96_190)
  2704. \p{Ideo: *} \p{Ideographic: *}
  2705. \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo})
  2706. (96_190)
  2707. \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_017_922
  2708. plus all above-Unicode code points:
  2709. U+0000..3005, U+3008..3020,
  2710. U+302A..3037, U+303B..33FF,
  2711. U+4DB6..4DFF, U+9FF0..F8FF ...)
  2712. \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (96_190:
  2713. U+3006..3007, U+3021..3029,
  2714. U+3038..303A, U+3400..4DB5,
  2715. U+4E00..9FEF, U+F900..FA6D ...)
  2716. X \p{Ideographic_Description_Characters} \p{Block=
  2717. Ideographic_Description_Characters}
  2718. (Short: \p{InIDC}) (16)
  2719. X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (=
  2720. \p{Block=
  2721. Ideographic_Symbols_And_Punctuation})
  2722. (32)
  2723. X \p{Ideographic_Symbols_And_Punctuation} \p{Block=
  2724. Ideographic_Symbols_And_Punctuation}
  2725. (Short: \p{InIdeographicSymbols}) (32)
  2726. \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (125_884)
  2727. \p{IDS: *} \p{ID_Start: *}
  2728. \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
  2729. \p{IDSB}) (10)
  2730. \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
  2731. (1_114_102 plus all above-Unicode code
  2732. points: U+0000..2FEF, U+2FF2..2FF3,
  2733. U+2FFC..infinity)
  2734. \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10:
  2735. U+2FF0..2FF1, U+2FF4..2FFB)
  2736. \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
  2737. \p{IDST}) (2)
  2738. \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
  2739. (1_114_110 plus all above-Unicode code
  2740. points: U+0000..2FF1, U+2FF4..infinity)
  2741. \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2:
  2742. U+2FF2..2FF3)
  2743. \p{IDSB} \p{IDS_Binary_Operator} (=
  2744. \p{IDS_Binary_Operator=Y}) (10)
  2745. \p{IDSB: *} \p{IDS_Binary_Operator: *}
  2746. \p{IDST} \p{IDS_Trinary_Operator} (=
  2747. \p{IDS_Trinary_Operator=Y}) (2)
  2748. \p{IDST: *} \p{IDS_Trinary_Operator: *}
  2749. \p{Imperial_Aramaic} \p{Script_Extensions=Imperial_Aramaic}
  2750. (Short: \p{Armi}; NOT \p{Block=
  2751. Imperial_Aramaic}) (31)
  2752. \p{In: *} \p{Present_In: *} (Perl extension)
  2753. X \p{In_*} \p{Block: *}
  2754. X \p{Indic_Number_Forms} \p{Common_Indic_Number_Forms} (= \p{Block=
  2755. Common_Indic_Number_Forms}) (16)
  2756. \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
  2757. (349: U+093C, U+0941..0944, U+094D,
  2758. U+0952, U+0956..0957, U+0962..0963 ...)
  2759. \p{Indic_Positional_Category: Bottom_And_Left} (Short: \p{InPC=
  2760. BottomAndLeft}) (1: U+A9BF)
  2761. \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
  2762. BottomAndRight}) (2: U+1B3B, U+A9C0)
  2763. \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (61:
  2764. U+093F, U+094E, U+09BF, U+09C7..09C8,
  2765. U+0A3F, U+0ABF ...)
  2766. \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
  2767. LeftAndRight}) (21: U+09CB..09CC,
  2768. U+0B4B, U+0BCA..0BCC, U+0D4A..0D4C,
  2769. U+0DDC, U+0DDE ...)
  2770. \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_112_936
  2771. plus all above-Unicode code points:
  2772. U+0000..08FF, U+0904..0939, U+093D,
  2773. U+0950, U+0958..0961, U+0964..0980 ...)
  2774. \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
  2775. Overstruck}) (10: U+1CD4, U+1CE2..1CE8,
  2776. U+10A01, U+10A06)
  2777. \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (281:
  2778. U+0903, U+093B, U+093E, U+0940,
  2779. U+0949..094C, U+094F ...)
  2780. \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (398:
  2781. U+0900..0902, U+093A, U+0945..0948,
  2782. U+0951, U+0953..0955, U+0981 ...)
  2783. \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
  2784. TopAndBottom}) (10: U+0C48, U+0F73,
  2785. U+0F76..0F79, U+0F81, U+1B3C,
  2786. U+1112E..1112F)
  2787. \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
  2788. \p{InPC=TopAndBottomAndRight}) (1:
  2789. U+1B3D)
  2790. \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
  2791. TopAndLeft}) (6: U+0B48, U+0DDA, U+17BE,
  2792. U+1C29, U+114BB, U+115B9)
  2793. \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
  2794. \p{InPC=TopAndLeftAndRight}) (4: U+0B4C,
  2795. U+0DDD, U+17BF, U+115BB)
  2796. \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
  2797. TopAndRight}) (13: U+0AC9, U+0B57,
  2798. U+0CC0, U+0CC7..0CC8, U+0CCA..0CCB,
  2799. U+1925..1926 ...)
  2800. \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
  2801. VisualOrderLeft}) (19: U+0E40..0E44,
  2802. U+0EC0..0EC4, U+19B5..19B7, U+19BA,
  2803. U+AAB5..AAB6, U+AAB9 ...)
  2804. X \p{Indic_Siyaq_Numbers} \p{Block=Indic_Siyaq_Numbers} (80)
  2805. \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
  2806. (17: U+093D, U+09BD, U+0ABD, U+0B3D,
  2807. U+0C3D, U+0CBD ...)
  2808. \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (86:
  2809. U+0900..0902, U+0981..0982, U+09FC,
  2810. U+0A01..0A02, U+0A70, U+0A81..0A82 ...)
  2811. \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
  2812. \p{InSC=BrahmiJoiningNumber}) (20:
  2813. U+11052..11065)
  2814. \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
  2815. CantillationMark}) (59: U+0951..0952,
  2816. U+0A51, U+0AFA..0AFC, U+1CD0..1CD2,
  2817. U+1CD4..1CE1, U+1CF4 ...)
  2818. \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
  2819. (2160: U+0915..0939, U+0958..095F,
  2820. U+0978..097F, U+0995..09A8,
  2821. U+09AA..09B0, U+09B2 ...)
  2822. \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
  2823. ConsonantDead}) (12: U+09CE,
  2824. U+0D54..0D56, U+0D7A..0D7F, U+1CF2..1CF3)
  2825. \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
  2826. ConsonantFinal}) (67: U+1930..1931,
  2827. U+1933..1939, U+19C1..19C7,
  2828. U+1A58..1A59, U+1BBE..1BBF, U+1BF0..1BF1
  2829. ...)
  2830. \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
  2831. \p{InSC=ConsonantHeadLetter}) (5:
  2832. U+0F88..0F8C)
  2833. \p{Indic_Syllabic_Category: Consonant_Initial_Postfixed} (Short:
  2834. \p{InSC=ConsonantInitialPostfixed}) (1:
  2835. U+1A5A)
  2836. \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
  2837. ConsonantKiller}) (2: U+0E4C, U+17CD)
  2838. \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
  2839. ConsonantMedial}) (29: U+0A75,
  2840. U+0EBC..0EBD, U+103B..103E,
  2841. U+105E..1060, U+1082, U+1A55..1A56 ...)
  2842. \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
  2843. \p{InSC=ConsonantPlaceholder}) (22: [\-
  2844. \xa0\xd7], U+0980, U+0A72..0A73, U+104B,
  2845. U+104E, U+1900 ...)
  2846. \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
  2847. \p{InSC=ConsonantPrecedingRepha}) (2:
  2848. U+0D4E, U+11D46)
  2849. \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
  2850. ConsonantPrefixed}) (9: U+111C2..111C3,
  2851. U+11A3A, U+11A84..11A89)
  2852. \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
  2853. ConsonantSubjoined}) (94: U+0F8D..0F97,
  2854. U+0F99..0FBC, U+1929..192B, U+1A57,
  2855. U+1A5B..1A5E, U+1BA1..1BA3 ...)
  2856. \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
  2857. \p{InSC=ConsonantSucceedingRepha}) (4:
  2858. U+17CC, U+1B03, U+1B81, U+A982)
  2859. \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
  2860. \p{InSC=ConsonantWithStacker}) (6:
  2861. U+0CF1..0CF2, U+1CF5..1CF6,
  2862. U+11003..11004)
  2863. \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
  2864. GeminationMark}) (3: U+0A71, U+11237,
  2865. U+11A98)
  2866. \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
  2867. InvisibleStacker}) (11: U+1039, U+17D2,
  2868. U+1A60, U+1BAB, U+AAF6, U+10A3F ...)
  2869. \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1:
  2870. U+200D)
  2871. \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
  2872. ModifyingLetter}) (1: U+0B83)
  2873. \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
  2874. NonJoiner}) (1: U+200C)
  2875. \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (30:
  2876. U+093C, U+09BC, U+0A3C, U+0ABC,
  2877. U+0AFD..0AFF, U+0B3C ...)
  2878. \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (481:
  2879. [0-9], U+0966..096F, U+09E6..09EF,
  2880. U+0A66..0A6F, U+0AE6..0AEF, U+0B66..0B6F
  2881. ...)
  2882. \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
  2883. NumberJoiner}) (1: U+1107F)
  2884. \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
  2885. (1_109_650 plus all above-Unicode code
  2886. points: [\x00-
  2887. \x20!\"#\$\%&\'\(\)*+,.\/:;<=>?\@A-
  2888. Z\[\\\]\^_`a-z\{\|\}~\x7f-\x9f\xa1-
  2889. \xb1\xb4-\xd6\xd8-\xff], U+0100..08FF,
  2890. U+0950, U+0953..0954, U+0964..0965,
  2891. U+0970..0971 ...)
  2892. \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
  2893. PureKiller}) (21: U+0D3B..0D3C, U+0E3A,
  2894. U+0E4E, U+0EBA, U+0F84, U+103A ...)
  2895. \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
  2896. RegisterShifter}) (2: U+17C9..17CA)
  2897. \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
  2898. SyllableModifier}) (25: [\xb2-\xb3],
  2899. U+09FE, U+0F35, U+0F37, U+0FC6, U+17CB
  2900. ...)
  2901. \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
  2902. ToneLetter}) (7: U+1970..1974, U+AAC0,
  2903. U+AAC2)
  2904. \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
  2905. (42: U+0E48..0E4B, U+0EC8..0ECB, U+1037,
  2906. U+1063..1064, U+1069..106D, U+1087..108D
  2907. ...)
  2908. \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (27:
  2909. U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
  2910. U+0BCD ...)
  2911. \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga})
  2912. (35: U+0903, U+0983, U+0A03, U+0A83,
  2913. U+0B03, U+0C03 ...)
  2914. \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30:
  2915. U+1963..196D, U+A85E..A861, U+A866,
  2916. U+A922..A92A, U+11150..11154)
  2917. \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
  2918. VowelDependent}) (673: U+093A..093B,
  2919. U+093E..094C, U+094E..094F,
  2920. U+0955..0957, U+0962..0963, U+09BE..09C4
  2921. ...)
  2922. \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
  2923. VowelIndependent}) (476: U+0904..0914,
  2924. U+0960..0961, U+0972..0977,
  2925. U+0985..098C, U+098F..0990, U+0993..0994
  2926. ...)
  2927. \p{Inherited} \p{Script_Extensions=Inherited} (Short:
  2928. \p{Zinh}) (502)
  2929. \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
  2930. (Short: \p{Pi}) (12)
  2931. \p{InPC: *} \p{Indic_Positional_Category: *}
  2932. \p{InSC: *} \p{Indic_Syllabic_Category: *}
  2933. \p{Inscriptional_Pahlavi} \p{Script_Extensions=
  2934. Inscriptional_Pahlavi} (Short: \p{Phli};
  2935. NOT \p{Block=Inscriptional_Pahlavi}) (27)
  2936. \p{Inscriptional_Parthian} \p{Script_Extensions=
  2937. Inscriptional_Parthian} (Short:
  2938. \p{Prti}; NOT \p{Block=
  2939. Inscriptional_Parthian}) (30)
  2940. X \p{IPA_Ext} \p{IPA_Extensions} (= \p{Block=
  2941. IPA_Extensions}) (96)
  2942. X \p{IPA_Extensions} \p{Block=IPA_Extensions} (Short:
  2943. \p{InIPAExt}) (96)
  2944. \p{Is_*} \p{*} (Any exceptions are individually
  2945. noted beginning with the word NOT.) If
  2946. an entry has flag(s) at its beginning,
  2947. like "D", the "Is_" form has the same
  2948. flag(s)
  2949. \p{Ital} \p{Old_Italic} (= \p{Script_Extensions=
  2950. Old_Italic}) (NOT \p{Block=Old_Italic})
  2951. (39)
  2952. X \p{Jamo} \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
  2953. (256)
  2954. X \p{Jamo_Ext_A} \p{Hangul_Jamo_Extended_A} (= \p{Block=
  2955. Hangul_Jamo_Extended_A}) (32)
  2956. X \p{Jamo_Ext_B} \p{Hangul_Jamo_Extended_B} (= \p{Block=
  2957. Hangul_Jamo_Extended_B}) (80)
  2958. \p{Java} \p{Javanese} (= \p{Script_Extensions=
  2959. Javanese}) (NOT \p{Block=Javanese}) (91)
  2960. \p{Javanese} \p{Script_Extensions=Javanese} (Short:
  2961. \p{Java}; NOT \p{Block=Javanese}) (91)
  2962. \p{Jg: *} \p{Joining_Group: *}
  2963. \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2)
  2964. \p{Join_C: *} \p{Join_Control: *}
  2965. \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2)
  2966. \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
  2967. plus all above-Unicode code points:
  2968. U+0000..200B, U+200E..infinity)
  2969. \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2:
  2970. U+200C..200D)
  2971. \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1:
  2972. U+08BB)
  2973. \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1:
  2974. U+08BD)
  2975. \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (1:
  2976. U+08BC)
  2977. \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (8: U+0639..063A,
  2978. U+06A0, U+06FC, U+075D..075F, U+08B3)
  2979. \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1: U+0710)
  2980. \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10: U+0622..0623,
  2981. U+0625, U+0627, U+0671..0673, U+0675,
  2982. U+0773..0774)
  2983. \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (24: U+0628,
  2984. U+062A..062B, U+066E, U+0679..0680,
  2985. U+0750..0756, U+08A0..08A1 ...)
  2986. \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2: U+0712, U+072D)
  2987. \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
  2988. BurushaskiYehBarree}) (2: U+077A..077B)
  2989. \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (15: U+062F..0630,
  2990. U+0688..0690, U+06EE, U+0759..075A,
  2991. U+08AE)
  2992. \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4:
  2993. U+0715..0716, U+072A, U+072F)
  2994. \p{Joining_Group: E} (Short: \p{Jg=E}) (1: U+0725)
  2995. \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7:
  2996. U+063D..063F, U+06CC, U+06CE,
  2997. U+0775..0776)
  2998. \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1: U+074F)
  2999. \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (10: U+0641,
  3000. U+06A1..06A6, U+0760..0761, U+08A4)
  3001. \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1:
  3002. U+0724)
  3003. \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (14: U+063B..063C,
  3004. U+06A9, U+06AB, U+06AF..06B4,
  3005. U+0762..0764, U+08B0)
  3006. \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3: U+0713..0714,
  3007. U+072E)
  3008. \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (18: U+062C..062E,
  3009. U+0681..0687, U+06BF, U+0757..0758,
  3010. U+076E..076F, U+0772 ...)
  3011. \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
  3012. HamzaOnHehGoal}) (1: U+06C3)
  3013. \p{Joining_Group: Hanifi_Rohingya_Kinna_Ya} (Short: \p{Jg=
  3014. HanifiRohingyaKinnaYa}) (4: U+10D19,
  3015. U+10D1E, U+10D20, U+10D23)
  3016. \p{Joining_Group: Hanifi_Rohingya_Pa} (Short: \p{Jg=
  3017. HanifiRohingyaPa}) (3: U+10D02, U+10D09,
  3018. U+10D1C)
  3019. \p{Joining_Group: He} (Short: \p{Jg=He}) (1: U+0717)
  3020. \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1: U+0647)
  3021. \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2:
  3022. U+06C1..06C2)
  3023. \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1: U+071A)
  3024. \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (6: U+0643,
  3025. U+06AC..06AE, U+077F, U+08B4)
  3026. \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1: U+071F)
  3027. \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1: U+074E)
  3028. \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2:
  3029. U+06BE, U+06FF)
  3030. \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (7: U+0644,
  3031. U+06B5..06B8, U+076A, U+08A6)
  3032. \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1: U+0720)
  3033. \p{Joining_Group: Malayalam_Bha} (Short: \p{Jg=MalayalamBha}) (1:
  3034. U+0866)
  3035. \p{Joining_Group: Malayalam_Ja} (Short: \p{Jg=MalayalamJa}) (1:
  3036. U+0861)
  3037. \p{Joining_Group: Malayalam_Lla} (Short: \p{Jg=MalayalamLla}) (1:
  3038. U+0868)
  3039. \p{Joining_Group: Malayalam_Llla} (Short: \p{Jg=MalayalamLlla})
  3040. (1: U+0869)
  3041. \p{Joining_Group: Malayalam_Nga} (Short: \p{Jg=MalayalamNga}) (1:
  3042. U+0860)
  3043. \p{Joining_Group: Malayalam_Nna} (Short: \p{Jg=MalayalamNna}) (1:
  3044. U+0864)
  3045. \p{Joining_Group: Malayalam_Nnna} (Short: \p{Jg=MalayalamNnna})
  3046. (1: U+0865)
  3047. \p{Joining_Group: Malayalam_Nya} (Short: \p{Jg=MalayalamNya}) (1:
  3048. U+0862)
  3049. \p{Joining_Group: Malayalam_Ra} (Short: \p{Jg=MalayalamRa}) (1:
  3050. U+0867)
  3051. \p{Joining_Group: Malayalam_Ssa} (Short: \p{Jg=MalayalamSsa}) (1:
  3052. U+086A)
  3053. \p{Joining_Group: Malayalam_Tta} (Short: \p{Jg=MalayalamTta}) (1:
  3054. U+0863)
  3055. \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
  3056. ManichaeanAleph}) (1: U+10AC0)
  3057. \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
  3058. (2: U+10AD9..10ADA)
  3059. \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
  3060. (2: U+10AC1..10AC2)
  3061. \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
  3062. ManichaeanDaleth}) (1: U+10AC5)
  3063. \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
  3064. ManichaeanDhamedh}) (1: U+10AD4)
  3065. \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
  3066. (1: U+10AEC)
  3067. \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
  3068. ManichaeanGimel}) (2: U+10AC3..10AC4)
  3069. \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
  3070. (1: U+10ACD)
  3071. \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
  3072. ManichaeanHundred}) (1: U+10AEF)
  3073. \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
  3074. (3: U+10AD0..10AD2)
  3075. \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
  3076. ManichaeanLamedh}) (1: U+10AD3)
  3077. \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem})
  3078. (1: U+10AD6)
  3079. \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun})
  3080. (1: U+10AD7)
  3081. \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne})
  3082. (1: U+10AEB)
  3083. \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2:
  3084. U+10ADB..10ADC)
  3085. \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
  3086. (3: U+10ADE..10AE0)
  3087. \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
  3088. (1: U+10AE1)
  3089. \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
  3090. ManichaeanSadhe}) (1: U+10ADD)
  3091. \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
  3092. ManichaeanSamekh}) (1: U+10AD8)
  3093. \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw})
  3094. (1: U+10AE4)
  3095. \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen})
  3096. (1: U+10AED)
  3097. \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
  3098. (1: U+10ACE)
  3099. \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
  3100. ManichaeanThamedh}) (1: U+10AD5)
  3101. \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
  3102. ManichaeanTwenty}) (1: U+10AEE)
  3103. \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw})
  3104. (1: U+10AC7)
  3105. \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
  3106. (1: U+10ACF)
  3107. \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
  3108. ManichaeanZayin}) (2: U+10AC9..10ACA)
  3109. \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4: U+0645,
  3110. U+0765..0766, U+08A7)
  3111. \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1: U+0721)
  3112. \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
  3113. (1_113_800 plus all above-Unicode code
  3114. points: U+0000..061F, U+0621, U+0640,
  3115. U+064B..066D, U+0670, U+0674 ...)
  3116. \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8: U+0646,
  3117. U+06B9..06BC, U+0767..0769)
  3118. \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1: U+0722)
  3119. \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1: U+06BD)
  3120. \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1: U+0726)
  3121. \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (5: U+0642, U+066F,
  3122. U+06A7..06A8, U+08A5)
  3123. \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1: U+0729)
  3124. \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (19: U+0631..0632,
  3125. U+0691..0699, U+06EF, U+075B,
  3126. U+076B..076C, U+0771 ...)
  3127. \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1:
  3128. U+0727)
  3129. \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1:
  3130. U+08AC)
  3131. \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (6: U+0635..0636,
  3132. U+069D..069E, U+06FB, U+08AF)
  3133. \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1: U+0728)
  3134. \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11: U+0633..0634,
  3135. U+069A..069C, U+06FA, U+075C, U+076D,
  3136. U+0770 ...)
  3137. \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1: U+0723)
  3138. \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1: U+072B)
  3139. \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1:
  3140. U+08B1)
  3141. \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1: U+06AA)
  3142. \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1: U+0718)
  3143. \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (4: U+0637..0638,
  3144. U+069F, U+08A3)
  3145. \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1: U+072C)
  3146. \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3:
  3147. U+0629, U+06C0, U+06D5)
  3148. \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
  3149. Hamza_On_Heh_Goal} (1)
  3150. \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2: U+071B..071C)
  3151. \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (16: U+0624, U+0648,
  3152. U+0676..0677, U+06C4..06CB, U+06CF,
  3153. U+0778..0779 ...)
  3154. \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (11: U+0620, U+0626,
  3155. U+0649..064A, U+0678, U+06D0..06D1,
  3156. U+0777 ...)
  3157. \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2:
  3158. U+06D2..06D3)
  3159. \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1:
  3160. U+06CD)
  3161. \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1: U+071D)
  3162. \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1: U+071E)
  3163. \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1: U+0719)
  3164. \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1: U+074D)
  3165. \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (4)
  3166. \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (565)
  3167. \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (565: U+0620,
  3168. U+0626, U+0628, U+062A..062E,
  3169. U+0633..063F, U+0641..0647 ...)
  3170. \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (4: U+0640,
  3171. U+07FA, U+180A, U+200D)
  3172. \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (4)
  3173. \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (4: U+A872,
  3174. U+10ACD, U+10AD7, U+10D00)
  3175. \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_437 plus
  3176. all above-Unicode code points: [\x00-
  3177. \xac\xae-\xff], U+0100..02FF,
  3178. U+0370..0482, U+048A..0590, U+05BE,
  3179. U+05C0 ...)
  3180. \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (118)
  3181. \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (118:
  3182. U+0622..0625, U+0627, U+0629,
  3183. U+062F..0632, U+0648, U+0671..0673 ...)
  3184. \p{Joining_Type: T} \p{Joining_Type=Transparent} (1984)
  3185. \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1984: [\xad],
  3186. U+0300..036F, U+0483..0489,
  3187. U+0591..05BD, U+05BF, U+05C1..05C2 ...)
  3188. \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_111_437
  3189. plus all above-Unicode code points)
  3190. \p{Jt: *} \p{Joining_Type: *}
  3191. \p{Kaithi} \p{Script_Extensions=Kaithi} (Short:
  3192. \p{Kthi}; NOT \p{Block=Kaithi}) (87)
  3193. \p{Kali} \p{Kayah_Li} (= \p{Script_Extensions=
  3194. Kayah_Li}) (48)
  3195. \p{Kana} \p{Katakana} (= \p{Script_Extensions=
  3196. Katakana}) (NOT \p{Block=Katakana}) (356)
  3197. X \p{Kana_Ext_A} \p{Kana_Extended_A} (= \p{Block=
  3198. Kana_Extended_A}) (48)
  3199. X \p{Kana_Extended_A} \p{Block=Kana_Extended_A} (Short:
  3200. \p{InKanaExtA}) (48)
  3201. X \p{Kana_Sup} \p{Kana_Supplement} (= \p{Block=
  3202. Kana_Supplement}) (256)
  3203. X \p{Kana_Supplement} \p{Block=Kana_Supplement} (Short:
  3204. \p{InKanaSup}) (256)
  3205. X \p{Kanbun} \p{Block=Kanbun} (16)
  3206. X \p{Kangxi} \p{Kangxi_Radicals} (= \p{Block=
  3207. Kangxi_Radicals}) (224)
  3208. X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (Short:
  3209. \p{InKangxi}) (224)
  3210. \p{Kannada} \p{Script_Extensions=Kannada} (Short:
  3211. \p{Knda}; NOT \p{Block=Kannada}) (104)
  3212. \p{Katakana} \p{Script_Extensions=Katakana} (Short:
  3213. \p{Kana}; NOT \p{Block=Katakana}) (356)
  3214. X \p{Katakana_Ext} \p{Katakana_Phonetic_Extensions} (=
  3215. \p{Block=Katakana_Phonetic_Extensions})
  3216. (16)
  3217. X \p{Katakana_Phonetic_Extensions} \p{Block=
  3218. Katakana_Phonetic_Extensions} (Short:
  3219. \p{InKatakanaExt}) (16)
  3220. \p{Kayah_Li} \p{Script_Extensions=Kayah_Li} (Short:
  3221. \p{Kali}) (48)
  3222. \p{Khar} \p{Kharoshthi} (= \p{Script_Extensions=
  3223. Kharoshthi}) (NOT \p{Block=Kharoshthi})
  3224. (68)
  3225. \p{Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short:
  3226. \p{Khar}; NOT \p{Block=Kharoshthi}) (68)
  3227. \p{Khmer} \p{Script_Extensions=Khmer} (Short:
  3228. \p{Khmr}; NOT \p{Block=Khmer}) (146)
  3229. X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32)
  3230. \p{Khmr} \p{Khmer} (= \p{Script_Extensions=Khmer})
  3231. (NOT \p{Block=Khmer}) (146)
  3232. \p{Khoj} \p{Khojki} (= \p{Script_Extensions=
  3233. Khojki}) (NOT \p{Block=Khojki}) (82)
  3234. \p{Khojki} \p{Script_Extensions=Khojki} (Short:
  3235. \p{Khoj}; NOT \p{Block=Khojki}) (82)
  3236. \p{Khudawadi} \p{Script_Extensions=Khudawadi} (Short:
  3237. \p{Sind}; NOT \p{Block=Khudawadi}) (81)
  3238. \p{Knda} \p{Kannada} (= \p{Script_Extensions=
  3239. Kannada}) (NOT \p{Block=Kannada}) (104)
  3240. \p{Kthi} \p{Kaithi} (= \p{Script_Extensions=
  3241. Kaithi}) (NOT \p{Block=Kaithi}) (87)
  3242. \p{L} \pL \p{Letter} (= \p{General_Category=Letter})
  3243. (125_643)
  3244. X \p{L&} \p{Cased_Letter} (= \p{General_Category=
  3245. Cased_Letter}) (3970)
  3246. X \p{L_} \p{Cased_Letter} (= \p{General_Category=
  3247. Cased_Letter}) Note the trailing '_'
  3248. matters in spite of loose matching
  3249. rules. (3970)
  3250. \p{Lana} \p{Tai_Tham} (= \p{Script_Extensions=
  3251. Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127)
  3252. \p{Lao} \p{Script_Extensions=Lao} (NOT \p{Block=
  3253. Lao}) (82)
  3254. \p{Laoo} \p{Lao} (= \p{Script_Extensions=Lao}) (NOT
  3255. \p{Block=Lao}) (82)
  3256. \p{Latin} \p{Script_Extensions=Latin} (Short:
  3257. \p{Latn}) (1387)
  3258. X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block=
  3259. Latin_1_Supplement}) (128)
  3260. X \p{Latin_1_Sup} \p{Latin_1_Supplement} (= \p{Block=
  3261. Latin_1_Supplement}) (128)
  3262. X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short:
  3263. \p{InLatin1}) (128)
  3264. X \p{Latin_Ext_A} \p{Latin_Extended_A} (= \p{Block=
  3265. Latin_Extended_A}) (128)
  3266. X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
  3267. \p{Block=Latin_Extended_Additional})
  3268. (256)
  3269. X \p{Latin_Ext_B} \p{Latin_Extended_B} (= \p{Block=
  3270. Latin_Extended_B}) (208)
  3271. X \p{Latin_Ext_C} \p{Latin_Extended_C} (= \p{Block=
  3272. Latin_Extended_C}) (32)
  3273. X \p{Latin_Ext_D} \p{Latin_Extended_D} (= \p{Block=
  3274. Latin_Extended_D}) (224)
  3275. X \p{Latin_Ext_E} \p{Latin_Extended_E} (= \p{Block=
  3276. Latin_Extended_E}) (64)
  3277. X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (Short:
  3278. \p{InLatinExtA}) (128)
  3279. X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
  3280. (Short: \p{InLatinExtAdditional}) (256)
  3281. X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (Short:
  3282. \p{InLatinExtB}) (208)
  3283. X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (Short:
  3284. \p{InLatinExtC}) (32)
  3285. X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (Short:
  3286. \p{InLatinExtD}) (224)
  3287. X \p{Latin_Extended_E} \p{Block=Latin_Extended_E} (Short:
  3288. \p{InLatinExtE}) (64)
  3289. \p{Latn} \p{Latin} (= \p{Script_Extensions=Latin})
  3290. (1387)
  3291. \p{Lb: *} \p{Line_Break: *}
  3292. \p{LC} \p{Cased_Letter} (= \p{General_Category=
  3293. Cased_Letter}) (3970)
  3294. \p{Lepc} \p{Lepcha} (= \p{Script_Extensions=
  3295. Lepcha}) (NOT \p{Block=Lepcha}) (74)
  3296. \p{Lepcha} \p{Script_Extensions=Lepcha} (Short:
  3297. \p{Lepc}; NOT \p{Block=Lepcha}) (74)
  3298. \p{Letter} \p{General_Category=Letter} (Short: \p{L})
  3299. (125_643)
  3300. \p{Letter_Number} \p{General_Category=Letter_Number} (Short:
  3301. \p{Nl}) (236)
  3302. X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80)
  3303. \p{Limb} \p{Limbu} (= \p{Script_Extensions=Limbu})
  3304. (NOT \p{Block=Limbu}) (69)
  3305. \p{Limbu} \p{Script_Extensions=Limbu} (Short:
  3306. \p{Limb}; NOT \p{Block=Limbu}) (69)
  3307. \p{Lina} \p{Linear_A} (= \p{Script_Extensions=
  3308. Linear_A}) (NOT \p{Block=Linear_A}) (386)
  3309. \p{Linb} \p{Linear_B} (= \p{Script_Extensions=
  3310. Linear_B}) (268)
  3311. \p{Line_Break: AI} \p{Line_Break=Ambiguous} (707)
  3312. \p{Line_Break: AL} \p{Line_Break=Alphabetic} (20_582)
  3313. \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (20_582: [#&*<=>\@A-
  3314. Z\^_`a-z~\xa6\xa9\xac\xae-\xaf\xb5\xc0-
  3315. \xd6\xd8-\xf6\xf8-\xff], U+0100..02C6,
  3316. U+02CE..02CF, U+02D1..02D7, U+02DC,
  3317. U+02DE ...)
  3318. \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707: [\xa7-
  3319. \xa8\xaa\xb2-\xb3\xb6-\xba\xbc-
  3320. \xbe\xd7\xf7], U+02C7, U+02C9..02CB,
  3321. U+02CD, U+02D0, U+02D8..02DB ...)
  3322. \p{Line_Break: B2} \p{Line_Break=Break_Both} (3)
  3323. \p{Line_Break: BA} \p{Line_Break=Break_After} (239)
  3324. \p{Line_Break: BB} \p{Line_Break=Break_Before} (45)
  3325. \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4)
  3326. \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (239: [\t\|\xad],
  3327. U+058A, U+05BE, U+0964..0965,
  3328. U+0E5A..0E5B, U+0F0B ...)
  3329. \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (45: [\xb4],
  3330. U+02C8, U+02CC, U+02DF, U+0C77, U+0C84
  3331. ...)
  3332. \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3: U+2014,
  3333. U+2E3A..2E3B)
  3334. \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1: [\/])
  3335. \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1: [\r])
  3336. \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1)
  3337. \p{Line_Break: CJ} \p{Line_Break=
  3338. Conditional_Japanese_Starter} (58)
  3339. \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (91)
  3340. \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2: [\)\]])
  3341. \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (91: [\}],
  3342. U+0F3B, U+0F3D, U+169C, U+2046, U+207E
  3343. ...)
  3344. \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (2260)
  3345. \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2260:
  3346. [^\t\n\cK\f\r\x20-\x7e\x85\xa0-\xff],
  3347. U+0300..034E, U+0350..035B,
  3348. U+0363..036F, U+0483..0489, U+0591..05BD
  3349. ...)
  3350. \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (750:
  3351. U+0E01..0E3A, U+0E40..0E4E,
  3352. U+0E81..0E82, U+0E84, U+0E86..0E8A,
  3353. U+0E8C..0EA3 ...)
  3354. \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
  3355. (58: U+3041, U+3043, U+3045, U+3047,
  3356. U+3049, U+3063 ...)
  3357. \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1: U+FFFC)
  3358. \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2)
  3359. \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1)
  3360. \p{Line_Break: E_Base} (Short: \p{Lb=EB}) (120: U+261D, U+26F9,
  3361. U+270A..270D, U+1F385, U+1F3C2..1F3C4,
  3362. U+1F3C7 ...)
  3363. \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5: U+1F3FB..1F3FF)
  3364. \p{Line_Break: EB} \p{Line_Break=E_Base} (120)
  3365. \p{Line_Break: EM} \p{Line_Break=E_Modifier} (5)
  3366. \p{Line_Break: EX} \p{Line_Break=Exclamation} (37)
  3367. \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (37: [!?], U+05C6,
  3368. U+061B, U+061E..061F, U+06D4, U+07F9 ...)
  3369. \p{Line_Break: GL} \p{Line_Break=Glue} (25)
  3370. \p{Line_Break: Glue} (Short: \p{Lb=GL}) (25: [\xa0], U+034F,
  3371. U+035C..0362, U+0F08, U+0F0C, U+0F12 ...)
  3372. \p{Line_Break: H2} (Short: \p{Lb=H2}) (399: U+AC00, U+AC1C,
  3373. U+AC38, U+AC54, U+AC70, U+AC8C ...)
  3374. \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773: U+AC01..AC1B,
  3375. U+AC1D..AC37, U+AC39..AC53,
  3376. U+AC55..AC6F, U+AC71..AC8B, U+AC8D..ACA7
  3377. ...)
  3378. \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (75:
  3379. U+05D0..05EA, U+05EF..05F2, U+FB1D,
  3380. U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
  3381. ...)
  3382. \p{Line_Break: HL} \p{Line_Break=Hebrew_Letter} (75)
  3383. \p{Line_Break: HY} \p{Line_Break=Hyphen} (1)
  3384. \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1: [\-])
  3385. \p{Line_Break: ID} \p{Line_Break=Ideographic} (172_693)
  3386. \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_693:
  3387. U+231A..231B, U+23F0..23F3,
  3388. U+2600..2603, U+2614..2615, U+2618,
  3389. U+261A..261C ...)
  3390. \p{Line_Break: IN} \p{Line_Break=Inseparable} (6)
  3391. \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13: [,.:;],
  3392. U+037E, U+0589, U+060C..060D, U+07F8,
  3393. U+2044 ...)
  3394. \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6: U+2024..2026,
  3395. U+22EF, U+FE19, U+10AF6)
  3396. \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
  3397. \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13)
  3398. \p{Line_Break: JL} (Short: \p{Lb=JL}) (125: U+1100..115F,
  3399. U+A960..A97C)
  3400. \p{Line_Break: JT} (Short: \p{Lb=JT}) (137: U+11A8..11FF,
  3401. U+D7CB..D7FB)
  3402. \p{Line_Break: JV} (Short: \p{Lb=JV}) (95: U+1160..11A7,
  3403. U+D7B0..D7C6)
  3404. \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1)
  3405. \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1: [\n])
  3406. \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4: [\cK\f],
  3407. U+2028..2029)
  3408. \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1: [\x85])
  3409. \p{Line_Break: NL} \p{Line_Break=Next_Line} (1)
  3410. \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (33: U+17D6,
  3411. U+203C..203D, U+2047..2049, U+3005,
  3412. U+301C, U+303B..303C ...)
  3413. \p{Line_Break: NS} \p{Line_Break=Nonstarter} (33)
  3414. \p{Line_Break: NU} \p{Line_Break=Numeric} (622)
  3415. \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (622: [0-9],
  3416. U+0660..0669, U+066B..066C,
  3417. U+06F0..06F9, U+07C0..07C9, U+0966..096F
  3418. ...)
  3419. \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (88)
  3420. \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (88:
  3421. [\(\[\{\xa1\xbf], U+0F3A, U+0F3C,
  3422. U+169B, U+201A, U+201E ...)
  3423. \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (36)
  3424. \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (36:
  3425. [\%\xa2\xb0], U+0609..060B, U+066A,
  3426. U+09F2..09F3, U+09F9, U+0D79 ...)
  3427. \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (68)
  3428. \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (68: [\$+\\\xa3-
  3429. \xa5\xb1], U+058F, U+07FE..07FF, U+09FB,
  3430. U+0AF1, U+0BF9 ...)
  3431. \p{Line_Break: QU} \p{Line_Break=Quotation} (39)
  3432. \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39: [\"\'\xab\xbb],
  3433. U+2018..2019, U+201B..201D, U+201F,
  3434. U+2039..203A, U+275B..2760 ...)
  3435. \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26:
  3436. U+1F1E6..1F1FF)
  3437. \p{Line_Break: RI} \p{Line_Break=Regional_Indicator} (26)
  3438. \p{Line_Break: SA} \p{Line_Break=Complex_Context} (750)
  3439. D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048)
  3440. \p{Line_Break: SP} \p{Line_Break=Space} (1)
  3441. \p{Line_Break: Space} (Short: \p{Lb=SP}) (1: [\x20])
  3442. D \p{Line_Break: Surrogate} Surrogates should never appear in well-
  3443. formed text, and therefore shouldn't be
  3444. the basis for line breaking (Short:
  3445. \p{Lb=SG}) (2048: U+D800..DFFF)
  3446. \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1)
  3447. \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (901_897 plus all
  3448. above-Unicode code points: U+0378..0379,
  3449. U+0380..0383, U+038B, U+038D, U+03A2,
  3450. U+0530 ...)
  3451. \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2)
  3452. \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2: U+2060, U+FEFF)
  3453. \p{Line_Break: XX} \p{Line_Break=Unknown} (901_897 plus all
  3454. above-Unicode code points)
  3455. \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1)
  3456. \p{Line_Break: ZWJ} (Short: \p{Lb=ZWJ}) (1: U+200D)
  3457. \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1: U+200B)
  3458. \p{Line_Separator} \p{General_Category=Line_Separator}
  3459. (Short: \p{Zl}) (1)
  3460. \p{Linear_A} \p{Script_Extensions=Linear_A} (Short:
  3461. \p{Lina}; NOT \p{Block=Linear_A}) (386)
  3462. \p{Linear_B} \p{Script_Extensions=Linear_B} (Short:
  3463. \p{Linb}) (268)
  3464. X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128)
  3465. X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128)
  3466. \p{Lisu} \p{Script_Extensions=Lisu} (48)
  3467. \p{Ll} \p{Lowercase_Letter} (=
  3468. \p{General_Category=Lowercase_Letter})
  3469. (/i= General_Category=Cased_Letter)
  3470. (2151)
  3471. \p{Lm} \p{Modifier_Letter} (=
  3472. \p{General_Category=Modifier_Letter})
  3473. (259)
  3474. \p{Lo} \p{Other_Letter} (= \p{General_Category=
  3475. Other_Letter}) (121_414)
  3476. \p{LOE} \p{Logical_Order_Exception} (=
  3477. \p{Logical_Order_Exception=Y}) (19)
  3478. \p{LOE: *} \p{Logical_Order_Exception: *}
  3479. \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
  3480. \p{LOE}) (19)
  3481. \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
  3482. (1_114_093 plus all above-Unicode code
  3483. points: U+0000..0E3F, U+0E45..0EBF,
  3484. U+0EC5..19B4, U+19B8..19B9,
  3485. U+19BB..AAB4, U+AAB7..AAB8 ...)
  3486. \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19:
  3487. U+0E40..0E44, U+0EC0..0EC4,
  3488. U+19B5..19B7, U+19BA, U+AAB5..AAB6,
  3489. U+AAB9 ...)
  3490. X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024)
  3491. \p{Lower} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
  3492. Cased=Yes) (2340)
  3493. \p{Lower: *} \p{Lowercase: *}
  3494. \p{Lowercase} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
  3495. Cased=Yes) (2340)
  3496. \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
  3497. No) (1_111_772 plus all above-Unicode
  3498. code points: [\x00-
  3499. \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
  3500. Z\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
  3501. \xb4\xb6-\xb9\xbb-\xde\xf7], U+0100,
  3502. U+0102, U+0104, U+0106, U+0108 ...)
  3503. \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
  3504. Yes) (2340: [a-z\xaa\xb5\xba\xdf-
  3505. \xf6\xf8-\xff], U+0101, U+0103, U+0105,
  3506. U+0107, U+0109 ...)
  3507. \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter}
  3508. (Short: \p{Ll}; /i= General_Category=
  3509. Cased_Letter) (2151)
  3510. \p{Lt} \p{Titlecase_Letter} (=
  3511. \p{General_Category=Titlecase_Letter})
  3512. (/i= General_Category=Cased_Letter) (31)
  3513. \p{Lu} \p{Uppercase_Letter} (=
  3514. \p{General_Category=Uppercase_Letter})
  3515. (/i= General_Category=Cased_Letter)
  3516. (1788)
  3517. \p{Lyci} \p{Lycian} (= \p{Script_Extensions=
  3518. Lycian}) (NOT \p{Block=Lycian}) (29)
  3519. \p{Lycian} \p{Script_Extensions=Lycian} (Short:
  3520. \p{Lyci}; NOT \p{Block=Lycian}) (29)
  3521. \p{Lydi} \p{Lydian} (= \p{Script_Extensions=
  3522. Lydian}) (NOT \p{Block=Lydian}) (27)
  3523. \p{Lydian} \p{Script_Extensions=Lydian} (Short:
  3524. \p{Lydi}; NOT \p{Block=Lydian}) (27)
  3525. \p{M} \pM \p{Mark} (= \p{General_Category=Mark})
  3526. (2268)
  3527. \p{Mahajani} \p{Script_Extensions=Mahajani} (Short:
  3528. \p{Mahj}; NOT \p{Block=Mahajani}) (61)
  3529. \p{Mahj} \p{Mahajani} (= \p{Script_Extensions=
  3530. Mahajani}) (NOT \p{Block=Mahajani}) (61)
  3531. X \p{Mahjong} \p{Mahjong_Tiles} (= \p{Block=
  3532. Mahjong_Tiles}) (48)
  3533. X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (Short:
  3534. \p{InMahjong}) (48)
  3535. \p{Maka} \p{Makasar} (= \p{Script_Extensions=
  3536. Makasar}) (NOT \p{Block=Makasar}) (25)
  3537. \p{Makasar} \p{Script_Extensions=Makasar} (Short:
  3538. \p{Maka}; NOT \p{Block=Makasar}) (25)
  3539. \p{Malayalam} \p{Script_Extensions=Malayalam} (Short:
  3540. \p{Mlym}; NOT \p{Block=Malayalam}) (125)
  3541. \p{Mand} \p{Mandaic} (= \p{Script_Extensions=
  3542. Mandaic}) (NOT \p{Block=Mandaic}) (30)
  3543. \p{Mandaic} \p{Script_Extensions=Mandaic} (Short:
  3544. \p{Mand}; NOT \p{Block=Mandaic}) (30)
  3545. \p{Mani} \p{Manichaean} (= \p{Script_Extensions=
  3546. Manichaean}) (NOT \p{Block=Manichaean})
  3547. (52)
  3548. \p{Manichaean} \p{Script_Extensions=Manichaean} (Short:
  3549. \p{Mani}; NOT \p{Block=Manichaean}) (52)
  3550. \p{Marc} \p{Marchen} (= \p{Script_Extensions=
  3551. Marchen}) (NOT \p{Block=Marchen}) (68)
  3552. \p{Marchen} \p{Script_Extensions=Marchen} (Short:
  3553. \p{Marc}; NOT \p{Block=Marchen}) (68)
  3554. \p{Mark} \p{General_Category=Mark} (Short: \p{M})
  3555. (2268)
  3556. \p{Masaram_Gondi} \p{Script_Extensions=Masaram_Gondi}
  3557. (Short: \p{Gonm}; NOT \p{Block=
  3558. Masaram_Gondi}) (77)
  3559. \p{Math} \p{Math=Y} (2310)
  3560. \p{Math: N*} (Single: \P{Math}) (1_111_802 plus all
  3561. above-Unicode code points: [\x00-
  3562. \x20!\"#\$\%&\'\(\)*,\-.\/0-9:;?\@A-
  3563. Z\[\\\]_`a-z\{\}\x7f-\xab\xad-\xb0\xb2-
  3564. \xd6\xd8-\xf6\xf8-\xff], U+0100..03CF,
  3565. U+03D3..03D4, U+03D6..03EF,
  3566. U+03F2..03F3, U+03F7..0605 ...)
  3567. \p{Math: Y*} (Single: \p{Math}) (2310: [+<=
  3568. >\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
  3569. U+03D5, U+03F0..03F1, U+03F4..03F6,
  3570. U+0606..0608 ...)
  3571. X \p{Math_Alphanum} \p{Mathematical_Alphanumeric_Symbols} (=
  3572. \p{Block=
  3573. Mathematical_Alphanumeric_Symbols})
  3574. (1024)
  3575. X \p{Math_Operators} \p{Mathematical_Operators} (= \p{Block=
  3576. Mathematical_Operators}) (256)
  3577. \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short:
  3578. \p{Sm}) (948)
  3579. X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
  3580. Mathematical_Alphanumeric_Symbols}
  3581. (Short: \p{InMathAlphanum}) (1024)
  3582. X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
  3583. (Short: \p{InMathOperators}) (256)
  3584. X \p{Mayan_Numerals} \p{Block=Mayan_Numerals} (32)
  3585. \p{Mc} \p{Spacing_Mark} (= \p{General_Category=
  3586. Spacing_Mark}) (429)
  3587. \p{Me} \p{Enclosing_Mark} (= \p{General_Category=
  3588. Enclosing_Mark}) (13)
  3589. \p{Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short:
  3590. \p{Medf}; NOT \p{Block=Medefaidrin}) (91)
  3591. \p{Medf} \p{Medefaidrin} (= \p{Script_Extensions=
  3592. Medefaidrin}) (NOT \p{Block=
  3593. Medefaidrin}) (91)
  3594. \p{Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek} (Short:
  3595. \p{Mtei}; NOT \p{Block=Meetei_Mayek})
  3596. (79)
  3597. X \p{Meetei_Mayek_Ext} \p{Meetei_Mayek_Extensions} (= \p{Block=
  3598. Meetei_Mayek_Extensions}) (32)
  3599. X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
  3600. (Short: \p{InMeeteiMayekExt}) (32)
  3601. \p{Mend} \p{Mende_Kikakui} (= \p{Script_Extensions=
  3602. Mende_Kikakui}) (NOT \p{Block=
  3603. Mende_Kikakui}) (213)
  3604. \p{Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
  3605. (Short: \p{Mend}; NOT \p{Block=
  3606. Mende_Kikakui}) (213)
  3607. \p{Merc} \p{Meroitic_Cursive} (=
  3608. \p{Script_Extensions=Meroitic_Cursive})
  3609. (NOT \p{Block=Meroitic_Cursive}) (90)
  3610. \p{Mero} \p{Meroitic_Hieroglyphs} (=
  3611. \p{Script_Extensions=
  3612. Meroitic_Hieroglyphs}) (32)
  3613. \p{Meroitic_Cursive} \p{Script_Extensions=Meroitic_Cursive}
  3614. (Short: \p{Merc}; NOT \p{Block=
  3615. Meroitic_Cursive}) (90)
  3616. \p{Meroitic_Hieroglyphs} \p{Script_Extensions=
  3617. Meroitic_Hieroglyphs} (Short: \p{Mero})
  3618. (32)
  3619. \p{Miao} \p{Script_Extensions=Miao} (NOT \p{Block=
  3620. Miao}) (149)
  3621. X \p{Misc_Arrows} \p{Miscellaneous_Symbols_And_Arrows} (=
  3622. \p{Block=
  3623. Miscellaneous_Symbols_And_Arrows}) (256)
  3624. X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
  3625. (= \p{Block=
  3626. Miscellaneous_Mathematical_Symbols_A})
  3627. (48)
  3628. X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
  3629. (= \p{Block=
  3630. Miscellaneous_Mathematical_Symbols_B})
  3631. (128)
  3632. X \p{Misc_Pictographs} \p{Miscellaneous_Symbols_And_Pictographs}
  3633. (= \p{Block=
  3634. Miscellaneous_Symbols_And_Pictographs})
  3635. (768)
  3636. X \p{Misc_Symbols} \p{Miscellaneous_Symbols} (= \p{Block=
  3637. Miscellaneous_Symbols}) (256)
  3638. X \p{Misc_Technical} \p{Miscellaneous_Technical} (= \p{Block=
  3639. Miscellaneous_Technical}) (256)
  3640. X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
  3641. Miscellaneous_Mathematical_Symbols_A}
  3642. (Short: \p{InMiscMathSymbolsA}) (48)
  3643. X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
  3644. Miscellaneous_Mathematical_Symbols_B}
  3645. (Short: \p{InMiscMathSymbolsB}) (128)
  3646. X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
  3647. \p{InMiscSymbols}) (256)
  3648. X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
  3649. Miscellaneous_Symbols_And_Arrows}
  3650. (Short: \p{InMiscArrows}) (256)
  3651. X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
  3652. Miscellaneous_Symbols_And_Pictographs}
  3653. (Short: \p{InMiscPictographs}) (768)
  3654. X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
  3655. (Short: \p{InMiscTechnical}) (256)
  3656. \p{Mlym} \p{Malayalam} (= \p{Script_Extensions=
  3657. Malayalam}) (NOT \p{Block=Malayalam})
  3658. (125)
  3659. \p{Mn} \p{Nonspacing_Mark} (=
  3660. \p{General_Category=Nonspacing_Mark})
  3661. (1826)
  3662. \p{Modi} \p{Script_Extensions=Modi} (NOT \p{Block=
  3663. Modi}) (89)
  3664. \p{Modifier_Letter} \p{General_Category=Modifier_Letter}
  3665. (Short: \p{Lm}) (259)
  3666. X \p{Modifier_Letters} \p{Spacing_Modifier_Letters} (= \p{Block=
  3667. Spacing_Modifier_Letters}) (80)
  3668. \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol}
  3669. (Short: \p{Sk}) (121)
  3670. X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
  3671. \p{Mong} \p{Mongolian} (= \p{Script_Extensions=
  3672. Mongolian}) (NOT \p{Block=Mongolian})
  3673. (171)
  3674. \p{Mongolian} \p{Script_Extensions=Mongolian} (Short:
  3675. \p{Mong}; NOT \p{Block=Mongolian}) (171)
  3676. X \p{Mongolian_Sup} \p{Mongolian_Supplement} (= \p{Block=
  3677. Mongolian_Supplement}) (32)
  3678. X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short:
  3679. \p{InMongolianSup}) (32)
  3680. \p{Mro} \p{Script_Extensions=Mro} (NOT \p{Block=
  3681. Mro}) (43)
  3682. \p{Mroo} \p{Mro} (= \p{Script_Extensions=Mro}) (NOT
  3683. \p{Block=Mro}) (43)
  3684. \p{Mtei} \p{Meetei_Mayek} (= \p{Script_Extensions=
  3685. Meetei_Mayek}) (NOT \p{Block=
  3686. Meetei_Mayek}) (79)
  3687. \p{Mult} \p{Multani} (= \p{Script_Extensions=
  3688. Multani}) (NOT \p{Block=Multani}) (48)
  3689. \p{Multani} \p{Script_Extensions=Multani} (Short:
  3690. \p{Mult}; NOT \p{Block=Multani}) (48)
  3691. X \p{Music} \p{Musical_Symbols} (= \p{Block=
  3692. Musical_Symbols}) (256)
  3693. X \p{Musical_Symbols} \p{Block=Musical_Symbols} (Short:
  3694. \p{InMusic}) (256)
  3695. \p{Myanmar} \p{Script_Extensions=Myanmar} (Short:
  3696. \p{Mymr}; NOT \p{Block=Myanmar}) (224)
  3697. X \p{Myanmar_Ext_A} \p{Myanmar_Extended_A} (= \p{Block=
  3698. Myanmar_Extended_A}) (32)
  3699. X \p{Myanmar_Ext_B} \p{Myanmar_Extended_B} (= \p{Block=
  3700. Myanmar_Extended_B}) (32)
  3701. X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (Short:
  3702. \p{InMyanmarExtA}) (32)
  3703. X \p{Myanmar_Extended_B} \p{Block=Myanmar_Extended_B} (Short:
  3704. \p{InMyanmarExtB}) (32)
  3705. \p{Mymr} \p{Myanmar} (= \p{Script_Extensions=
  3706. Myanmar}) (NOT \p{Block=Myanmar}) (224)
  3707. \p{N} \pN \p{Number} (= \p{General_Category=Number})
  3708. (1754)
  3709. \p{Nabataean} \p{Script_Extensions=Nabataean} (Short:
  3710. \p{Nbat}; NOT \p{Block=Nabataean}) (40)
  3711. \p{Nand} \p{Nandinagari} (= \p{Script_Extensions=
  3712. Nandinagari}) (NOT \p{Block=
  3713. Nandinagari}) (86)
  3714. \p{Nandinagari} \p{Script_Extensions=Nandinagari} (Short:
  3715. \p{Nand}; NOT \p{Block=Nandinagari}) (86)
  3716. \p{Narb} \p{Old_North_Arabian} (=
  3717. \p{Script_Extensions=Old_North_Arabian})
  3718. (32)
  3719. X \p{NB} \p{No_Block} (= \p{Block=No_Block})
  3720. (832_720 plus all above-Unicode code
  3721. points)
  3722. \p{Nbat} \p{Nabataean} (= \p{Script_Extensions=
  3723. Nabataean}) (NOT \p{Block=Nabataean})
  3724. (40)
  3725. \p{NChar} \p{Noncharacter_Code_Point} (=
  3726. \p{Noncharacter_Code_Point=Y}) (66)
  3727. \p{NChar: *} \p{Noncharacter_Code_Point: *}
  3728. \p{Nd} \p{XPosixDigit} (= \p{General_Category=
  3729. Decimal_Number}) (630)
  3730. \p{New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
  3731. \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83)
  3732. \p{Newa} \p{Script_Extensions=Newa} (NOT \p{Block=
  3733. Newa}) (94)
  3734. \p{NFC_QC: *} \p{NFC_Quick_Check: *}
  3735. \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (110)
  3736. \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (110:
  3737. U+0300..0304, U+0306..030C, U+030F,
  3738. U+0311, U+0313..0314, U+031B ...)
  3739. \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT
  3740. \P{NFC_Quick_Check} NOR \P{NFC_QC})
  3741. (1120)
  3742. \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
  3743. \P{NFC_Quick_Check} NOR \P{NFC_QC})
  3744. (1120: U+0340..0341, U+0343..0344,
  3745. U+0374, U+037E, U+0387, U+0958..095F ...)
  3746. \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT
  3747. \p{NFC_Quick_Check} NOR \p{NFC_QC})
  3748. (1_112_882 plus all above-Unicode code
  3749. points)
  3750. \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
  3751. \p{NFC_Quick_Check} NOR \p{NFC_QC})
  3752. (1_112_882 plus all above-Unicode code
  3753. points: U+0000..02FF, U+0305,
  3754. U+030D..030E, U+0310, U+0312,
  3755. U+0315..031A ...)
  3756. \p{NFD_QC: *} \p{NFD_Quick_Check: *}
  3757. \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT
  3758. \P{NFD_Quick_Check} NOR \P{NFD_QC})
  3759. (13_232)
  3760. \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
  3761. \P{NFD_Quick_Check} NOR \P{NFD_QC})
  3762. (13_232: [\xc0-\xc5\xc7-\xcf\xd1-
  3763. \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
  3764. \xf6\xf9-\xfd\xff], U+0100..010F,
  3765. U+0112..0125, U+0128..0130,
  3766. U+0134..0137, U+0139..013E ...)
  3767. \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT
  3768. \p{NFD_Quick_Check} NOR \p{NFD_QC})
  3769. (1_100_880 plus all above-Unicode code
  3770. points)
  3771. \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
  3772. \p{NFD_Quick_Check} NOR \p{NFD_QC})
  3773. (1_100_880 plus all above-Unicode code
  3774. points: [\x00-\xbf\xc6\xd0\xd7-\xd8\xde-
  3775. \xdf\xe6\xf0\xf7-\xf8\xfe],
  3776. U+0110..0111, U+0126..0127,
  3777. U+0131..0133, U+0138, U+013F..0142 ...)
  3778. \p{NFKC_QC: *} \p{NFKC_Quick_Check: *}
  3779. \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (110)
  3780. \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (110:
  3781. U+0300..0304, U+0306..030C, U+030F,
  3782. U+0311, U+0313..0314, U+031B ...)
  3783. \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
  3784. \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
  3785. (4796)
  3786. \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
  3787. \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
  3788. (4796: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
  3789. \xba\xbc-\xbe], U+0132..0133,
  3790. U+013F..0140, U+0149, U+017F,
  3791. U+01C4..01CC ...)
  3792. \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
  3793. \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
  3794. (1_109_206 plus all above-Unicode code
  3795. points)
  3796. \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
  3797. \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
  3798. (1_109_206 plus all above-Unicode code
  3799. points: [\x00-\x9f\xa1-\xa7\xa9\xab-
  3800. \xae\xb0-\xb1\xb6-\xb7\xbb\xbf-\xff],
  3801. U+0100..0131, U+0134..013E,
  3802. U+0141..0148, U+014A..017E, U+0180..01C3
  3803. ...)
  3804. \p{NFKD_QC: *} \p{NFKD_Quick_Check: *}
  3805. \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
  3806. \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
  3807. (16_896)
  3808. \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
  3809. \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
  3810. (16_896: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
  3811. \xba\xbc-\xbe\xc0-\xc5\xc7-\xcf\xd1-
  3812. \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
  3813. \xf6\xf9-\xfd\xff], U+0100..010F,
  3814. U+0112..0125, U+0128..0130,
  3815. U+0132..0137, U+0139..0140 ...)
  3816. \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
  3817. \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
  3818. (1_097_216 plus all above-Unicode code
  3819. points)
  3820. \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
  3821. \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
  3822. (1_097_216 plus all above-Unicode code
  3823. points: [\x00-\x9f\xa1-\xa7\xa9\xab-
  3824. \xae\xb0-\xb1\xb6-
  3825. \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-
  3826. \xdf\xe6\xf0\xf7-\xf8\xfe],
  3827. U+0110..0111, U+0126..0127, U+0131,
  3828. U+0138, U+0141..0142 ...)
  3829. \p{Nko} \p{Script_Extensions=Nko} (NOT \p{Block=
  3830. NKo}) (62)
  3831. \p{Nkoo} \p{Nko} (= \p{Script_Extensions=Nko}) (NOT
  3832. \p{Block=NKo}) (62)
  3833. \p{Nl} \p{Letter_Number} (= \p{General_Category=
  3834. Letter_Number}) (236)
  3835. \p{No} \p{Other_Number} (= \p{General_Category=
  3836. Other_Number}) (888)
  3837. X \p{No_Block} \p{Block=No_Block} (Short: \p{InNB})
  3838. (832_720 plus all above-Unicode code
  3839. points)
  3840. \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
  3841. \p{NChar}) (66)
  3842. \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
  3843. (1_114_046 plus all above-Unicode code
  3844. points: U+0000..FDCF, U+FDF0..FFFD,
  3845. U+10000..1FFFD, U+20000..2FFFD,
  3846. U+30000..3FFFD, U+40000..4FFFD ...)
  3847. \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
  3848. (66: U+FDD0..FDEF, U+FFFE..FFFF,
  3849. U+1FFFE..1FFFF, U+2FFFE..2FFFF,
  3850. U+3FFFE..3FFFF, U+4FFFE..4FFFF ...)
  3851. \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark}
  3852. (Short: \p{Mn}) (1826)
  3853. \p{Nshu} \p{Nushu} (= \p{Script_Extensions=Nushu})
  3854. (NOT \p{Block=Nushu}) (397)
  3855. \p{Nt: *} \p{Numeric_Type: *}
  3856. \p{Number} \p{General_Category=Number} (Short: \p{N})
  3857. (1754)
  3858. X \p{Number_Forms} \p{Block=Number_Forms} (64)
  3859. \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (630)
  3860. \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (630: [0-9],
  3861. U+0660..0669, U+06F0..06F9,
  3862. U+07C0..07C9, U+0966..096F, U+09E6..09EF
  3863. ...)
  3864. \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128)
  3865. \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128: [\xb2-\xb3\xb9],
  3866. U+1369..1371, U+19DA, U+2070,
  3867. U+2074..2079, U+2080..2089 ...)
  3868. \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_277 plus all
  3869. above-Unicode code points: [\x00-
  3870. \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-
  3871. Z\[\\\]\^_`a-z\{\|\}~\x7f-\xb1\xb4-
  3872. \xb8\xba-\xbb\xbf-\xff], U+0100..065F,
  3873. U+066A..06EF, U+06FA..07BF,
  3874. U+07CA..0965, U+0970..09E5 ...)
  3875. \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (1077)
  3876. \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (1077: [\xbc-\xbe],
  3877. U+09F4..09F9, U+0B72..0B77,
  3878. U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
  3879. ...)
  3880. T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1: U+0F33)
  3881. T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (81: [0], U+0660,
  3882. U+06F0, U+07C0, U+0966, U+09E6 ...)
  3883. T \p{Numeric_Value: 1/320} (Short: \p{Nv=1/320}) (2: U+11FC0,
  3884. U+11FD4)
  3885. T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (2: U+0D58, U+11FC1)
  3886. T \p{Numeric_Value: 1/80} (Short: \p{Nv=1/80}) (1: U+11FC2)
  3887. T \p{Numeric_Value: 1/64} (Short: \p{Nv=1/64}) (1: U+11FC3)
  3888. T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (2: U+0D59, U+11FC4)
  3889. T \p{Numeric_Value: 1/32} (Short: \p{Nv=1/32}) (1: U+11FC5)
  3890. T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (2: U+0D5A, U+11FC6)
  3891. T \p{Numeric_Value: 3/64} (Short: \p{Nv=3/64}) (1: U+11FC7)
  3892. T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (2: U+0D5B, U+11FC8)
  3893. T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (6: U+09F4, U+0B75,
  3894. U+0D76, U+A833, U+11FC9..11FCA)
  3895. T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1: U+109F6)
  3896. T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (3: U+0D5C, U+2152,
  3897. U+11FCB)
  3898. T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1: U+2151)
  3899. T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (7: U+09F5, U+0B76,
  3900. U+0D77, U+215B, U+A834, U+11FCC ...)
  3901. T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1: U+2150)
  3902. T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (2: U+0D5D, U+11FCD)
  3903. T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (4: U+2159, U+109F7,
  3904. U+12461, U+1ED3D)
  3905. T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (5: U+09F6, U+0B77,
  3906. U+0D78, U+A835, U+11FCE)
  3907. T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (3: U+0D5E, U+2155,
  3908. U+11FCF)
  3909. T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (14: [\xbc], U+09F7,
  3910. U+0B72, U+0D73, U+A830, U+10140 ...)
  3911. T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (6: U+2153, U+109F9,
  3912. U+10E7D, U+1245A, U+1245D, U+12465)
  3913. T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1: U+215C)
  3914. T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1: U+2156)
  3915. T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1: U+109FA)
  3916. T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (19: [\xbd], U+0B73,
  3917. U+0D74, U+0F2A, U+2CFD, U+A831 ...)
  3918. T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1: U+109FC)
  3919. T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1: U+2157)
  3920. T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1: U+215D)
  3921. T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (7: U+2154, U+10177,
  3922. U+109FD, U+10E7E, U+1245B, U+1245E ...)
  3923. T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (9: [\xbe], U+09F8,
  3924. U+0B74, U+0D75, U+A832, U+10178 ...)
  3925. T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1: U+2158)
  3926. T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (3: U+215A, U+109FF,
  3927. U+1245C)
  3928. T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1: U+215E)
  3929. T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1: U+109BC)
  3930. T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (137: [1\xb9], U+0661,
  3931. U+06F1, U+07C1, U+0967, U+09E7 ...)
  3932. T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1: U+0F2B)
  3933. T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (136: [2\xb2], U+0662,
  3934. U+06F2, U+07C2, U+0968, U+09E8 ...)
  3935. T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1: U+0F2C)
  3936. T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (137: [3\xb3], U+0663,
  3937. U+06F3, U+07C3, U+0969, U+09E9 ...)
  3938. T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1: U+0F2D)
  3939. T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (128: [4], U+0664,
  3940. U+06F4, U+07C4, U+096A, U+09EA ...)
  3941. T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1: U+0F2E)
  3942. T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (127: [5], U+0665,
  3943. U+06F5, U+07C5, U+096B, U+09EB ...)
  3944. T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1: U+0F2F)
  3945. T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (111: [6], U+0666,
  3946. U+06F6, U+07C6, U+096C, U+09EC ...)
  3947. T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1: U+0F30)
  3948. T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (110: [7], U+0667,
  3949. U+06F7, U+07C7, U+096D, U+09ED ...)
  3950. T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1: U+0F31)
  3951. T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (106: [8], U+0668,
  3952. U+06F8, U+07C8, U+096E, U+09EE ...)
  3953. T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1: U+0F32)
  3954. T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (110: [9], U+0669,
  3955. U+06F9, U+07C9, U+096F, U+09EF ...)
  3956. T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (61: U+0BF0, U+0D70,
  3957. U+1372, U+2169, U+2179, U+2469 ...)
  3958. T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (8: U+216A, U+217A,
  3959. U+246A, U+247E, U+2492, U+24EB ...)
  3960. T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (8: U+216B, U+217B,
  3961. U+246B, U+247F, U+2493, U+24EC ...)
  3962. T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (6: U+246C, U+2480,
  3963. U+2494, U+24ED, U+16E8D, U+1D2ED)
  3964. T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (6: U+246D, U+2481,
  3965. U+2495, U+24EE, U+16E8E, U+1D2EE)
  3966. T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (6: U+246E, U+2482,
  3967. U+2496, U+24EF, U+16E8F, U+1D2EF)
  3968. T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (7: U+09F9, U+246F,
  3969. U+2483, U+2497, U+24F0, U+16E90 ...)
  3970. T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (7: U+16EE, U+2470,
  3971. U+2484, U+2498, U+24F1, U+16E91 ...)
  3972. T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (7: U+16EF, U+2471,
  3973. U+2485, U+2499, U+24F2, U+16E92 ...)
  3974. T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (7: U+16F0, U+2472,
  3975. U+2486, U+249A, U+24F3, U+16E93 ...)
  3976. T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (35: U+1373, U+2473,
  3977. U+2487, U+249B, U+24F4, U+3039 ...)
  3978. T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1: U+3251)
  3979. T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1: U+3252)
  3980. T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1: U+3253)
  3981. T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1: U+3254)
  3982. T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1: U+3255)
  3983. T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1: U+3256)
  3984. T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1: U+3257)
  3985. T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1: U+3258)
  3986. T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1: U+3259)
  3987. T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (19: U+1374, U+303A,
  3988. U+324A, U+325A, U+5345, U+10112 ...)
  3989. T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1: U+325B)
  3990. T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1: U+325C)
  3991. T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1: U+325D)
  3992. T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1: U+325E)
  3993. T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1: U+325F)
  3994. T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1: U+32B1)
  3995. T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1: U+32B2)
  3996. T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1: U+32B3)
  3997. T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1: U+32B4)
  3998. T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (18: U+1375, U+324B,
  3999. U+32B5, U+534C, U+10113, U+102ED ...)
  4000. T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1: U+32B6)
  4001. T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1: U+32B7)
  4002. T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1: U+32B8)
  4003. T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1: U+32B9)
  4004. T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1: U+32BA)
  4005. T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1: U+32BB)
  4006. T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1: U+32BC)
  4007. T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1: U+32BD)
  4008. T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1: U+32BE)
  4009. T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (29: U+1376, U+216C,
  4010. U+217C, U+2186, U+324C, U+32BF ...)
  4011. T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (13: U+1377, U+324D,
  4012. U+10115, U+102EF, U+109CE, U+10E6E ...)
  4013. T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (13: U+1378, U+324E,
  4014. U+10116, U+102F0, U+109CF, U+10E6F ...)
  4015. T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (12: U+1379, U+324F,
  4016. U+10117, U+102F1, U+10E70, U+11062 ...)
  4017. T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (12: U+137A, U+10118,
  4018. U+102F2, U+10341, U+10E71, U+11063 ...)
  4019. T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (34: U+0BF1, U+0D71,
  4020. U+137B, U+216D, U+217D, U+4F70 ...)
  4021. T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (6: U+1011A, U+102F4,
  4022. U+109D3, U+10E73, U+1EC84, U+1ED14)
  4023. T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (7: U+1011B, U+1016B,
  4024. U+102F5, U+109D4, U+10E74, U+1EC85 ...)
  4025. T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (7: U+1011C, U+102F6,
  4026. U+109D5, U+10E75, U+1EC86, U+1ED16 ...)
  4027. T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (16: U+216E, U+217E,
  4028. U+1011D, U+10145, U+1014C, U+10153 ...)
  4029. T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (7: U+1011E, U+102F8,
  4030. U+109D7, U+10E77, U+1EC88, U+1ED18 ...)
  4031. T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (6: U+1011F, U+102F9,
  4032. U+109D8, U+10E78, U+1EC89, U+1ED19)
  4033. T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (6: U+10120, U+102FA,
  4034. U+109D9, U+10E79, U+1EC8A, U+1ED1A)
  4035. T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (7: U+10121, U+102FB,
  4036. U+1034A, U+109DA, U+10E7A, U+1EC8B ...)
  4037. T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (22: U+0BF2, U+0D72,
  4038. U+216F, U+217F..2180, U+4EDF, U+5343 ...)
  4039. T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (5: U+10123, U+109DC,
  4040. U+1EC8D, U+1ED1D, U+1ED3A)
  4041. T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (4: U+10124, U+109DD,
  4042. U+1EC8E, U+1ED1E)
  4043. T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (4: U+10125, U+109DE,
  4044. U+1EC8F, U+1ED1F)
  4045. T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (8: U+2181, U+10126,
  4046. U+10146, U+1014E, U+10172, U+109DF ...)
  4047. T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (4: U+10127, U+109E0,
  4048. U+1EC91, U+1ED21)
  4049. T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (4: U+10128, U+109E1,
  4050. U+1EC92, U+1ED22)
  4051. T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (4: U+10129, U+109E2,
  4052. U+1EC93, U+1ED23)
  4053. T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (4: U+1012A, U+109E3,
  4054. U+1EC94, U+1ED24)
  4055. T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (13:
  4056. U+137C, U+2182, U+4E07, U+842C, U+1012B,
  4057. U+10155 ...)
  4058. T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (4:
  4059. U+1012C, U+109E5, U+1EC96, U+1ED26)
  4060. T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (4:
  4061. U+1012D, U+109E6, U+1EC97, U+1ED27)
  4062. T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (4:
  4063. U+1012E, U+109E7, U+1EC98, U+1ED28)
  4064. T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (7:
  4065. U+2187, U+1012F, U+10147, U+10156,
  4066. U+109E8, U+1EC99 ...)
  4067. T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (4:
  4068. U+10130, U+109E9, U+1EC9A, U+1ED2A)
  4069. T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (4:
  4070. U+10131, U+109EA, U+1EC9B, U+1ED2B)
  4071. T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (4:
  4072. U+10132, U+109EB, U+1EC9C, U+1ED2C)
  4073. T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (4:
  4074. U+10133, U+109EC, U+1EC9D, U+1ED2D)
  4075. T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (5:
  4076. U+2188, U+109ED, U+1EC9E, U+1ECA0,
  4077. U+1ECB4)
  4078. T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (2:
  4079. U+109EE, U+1EC9F)
  4080. T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1:
  4081. U+12432)
  4082. T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1:
  4083. U+109EF)
  4084. T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1:
  4085. U+109F0)
  4086. T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1:
  4087. U+12433)
  4088. T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1:
  4089. U+109F1)
  4090. T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1:
  4091. U+109F2)
  4092. T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1:
  4093. U+109F3)
  4094. T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1:
  4095. U+109F4)
  4096. T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1:
  4097. U+109F5)
  4098. T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1:
  4099. U+16B5E)
  4100. T \p{Numeric_Value: 10000000} (= 1.0e+07) (Short: \p{Nv=10000000})
  4101. (1: U+1ECA1)
  4102. T \p{Numeric_Value: 20000000} (= 2.0e+07) (Short: \p{Nv=20000000})
  4103. (1: U+1ECA2)
  4104. T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
  4105. (3: U+4EBF, U+5104, U+16B5F)
  4106. T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
  4107. 10000000000}) (1: U+16B60)
  4108. T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
  4109. 1000000000000}) (2: U+5146, U+16B61)
  4110. \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_277 plus all
  4111. above-Unicode code points: [\x00-
  4112. \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-
  4113. Z\[\\\]\^_`a-z\{\|\}~\x7f-\xb1\xb4-
  4114. \xb8\xba-\xbb\xbf-\xff], U+0100..065F,
  4115. U+066A..06EF, U+06FA..07BF,
  4116. U+07CA..0965, U+0970..09E5 ...)
  4117. \p{Nushu} \p{Script_Extensions=Nushu} (Short:
  4118. \p{Nshu}; NOT \p{Block=Nushu}) (397)
  4119. \p{Nv: *} \p{Numeric_Value: *}
  4120. \p{Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
  4121. Nyiakeng_Puachue_Hmong} (Short:
  4122. \p{Hmnp}; NOT \p{Block=
  4123. Nyiakeng_Puachue_Hmong}) (71)
  4124. X \p{OCR} \p{Optical_Character_Recognition} (=
  4125. \p{Block=Optical_Character_Recognition})
  4126. (32)
  4127. \p{Ogam} \p{Ogham} (= \p{Script_Extensions=Ogham})
  4128. (NOT \p{Block=Ogham}) (29)
  4129. \p{Ogham} \p{Script_Extensions=Ogham} (Short:
  4130. \p{Ogam}; NOT \p{Block=Ogham}) (29)
  4131. \p{Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short:
  4132. \p{Olck}) (48)
  4133. \p{Olck} \p{Ol_Chiki} (= \p{Script_Extensions=
  4134. Ol_Chiki}) (48)
  4135. \p{Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
  4136. (Short: \p{Hung}; NOT \p{Block=
  4137. Old_Hungarian}) (108)
  4138. \p{Old_Italic} \p{Script_Extensions=Old_Italic} (Short:
  4139. \p{Ital}; NOT \p{Block=Old_Italic}) (39)
  4140. \p{Old_North_Arabian} \p{Script_Extensions=Old_North_Arabian}
  4141. (Short: \p{Narb}) (32)
  4142. \p{Old_Permic} \p{Script_Extensions=Old_Permic} (Short:
  4143. \p{Perm}; NOT \p{Block=Old_Permic}) (44)
  4144. \p{Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
  4145. \p{Xpeo}; NOT \p{Block=Old_Persian}) (50)
  4146. \p{Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short:
  4147. \p{Sogo}; NOT \p{Block=Old_Sogdian}) (40)
  4148. \p{Old_South_Arabian} \p{Script_Extensions=Old_South_Arabian}
  4149. (Short: \p{Sarb}) (32)
  4150. \p{Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short:
  4151. \p{Orkh}; NOT \p{Block=Old_Turkic}) (73)
  4152. \p{Open_Punctuation} \p{General_Category=Open_Punctuation}
  4153. (Short: \p{Ps}) (75)
  4154. X \p{Optical_Character_Recognition} \p{Block=
  4155. Optical_Character_Recognition} (Short:
  4156. \p{InOCR}) (32)
  4157. \p{Oriya} \p{Script_Extensions=Oriya} (Short:
  4158. \p{Orya}; NOT \p{Block=Oriya}) (96)
  4159. \p{Orkh} \p{Old_Turkic} (= \p{Script_Extensions=
  4160. Old_Turkic}) (NOT \p{Block=Old_Turkic})
  4161. (73)
  4162. X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
  4163. \p{Orya} \p{Oriya} (= \p{Script_Extensions=Oriya})
  4164. (NOT \p{Block=Oriya}) (96)
  4165. \p{Osage} \p{Script_Extensions=Osage} (Short:
  4166. \p{Osge}; NOT \p{Block=Osage}) (72)
  4167. \p{Osge} \p{Osage} (= \p{Script_Extensions=Osage})
  4168. (NOT \p{Block=Osage}) (72)
  4169. \p{Osma} \p{Osmanya} (= \p{Script_Extensions=
  4170. Osmanya}) (NOT \p{Block=Osmanya}) (40)
  4171. \p{Osmanya} \p{Script_Extensions=Osmanya} (Short:
  4172. \p{Osma}; NOT \p{Block=Osmanya}) (40)
  4173. \p{Other} \p{General_Category=Other} (Short: \p{C})
  4174. (976_344 plus all above-Unicode code
  4175. points)
  4176. \p{Other_Letter} \p{General_Category=Other_Letter} (Short:
  4177. \p{Lo}) (121_414)
  4178. \p{Other_Number} \p{General_Category=Other_Number} (Short:
  4179. \p{No}) (888)
  4180. \p{Other_Punctuation} \p{General_Category=Other_Punctuation}
  4181. (Short: \p{Po}) (588)
  4182. \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short:
  4183. \p{So}) (6161)
  4184. X \p{Ottoman_Siyaq_Numbers} \p{Block=Ottoman_Siyaq_Numbers} (80)
  4185. \p{P} \pP \p{Punct} (= \p{General_Category=
  4186. Punctuation}) (NOT
  4187. \p{General_Punctuation}) (792)
  4188. \p{Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong} (Short:
  4189. \p{Hmng}; NOT \p{Block=Pahawh_Hmong})
  4190. (127)
  4191. \p{Palm} \p{Palmyrene} (= \p{Script_Extensions=
  4192. Palmyrene}) (32)
  4193. \p{Palmyrene} \p{Script_Extensions=Palmyrene} (Short:
  4194. \p{Palm}) (32)
  4195. \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
  4196. (Short: \p{Zp}) (1)
  4197. \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax=
  4198. Y}) (2760)
  4199. \p{Pat_Syn: *} \p{Pattern_Syntax: *}
  4200. \p{Pat_WS} \p{Pattern_White_Space} (=
  4201. \p{Pattern_White_Space=Y}) (11)
  4202. \p{Pat_WS: *} \p{Pattern_White_Space: *}
  4203. \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
  4204. (2760)
  4205. \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn})
  4206. (1_111_352 plus all above-Unicode code
  4207. points: [\x00-\x200-9A-Z_a-z\x7f-
  4208. \xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb7-
  4209. \xba\xbc-\xbe\xc0-\xd6\xd8-\xf6\xf8-
  4210. \xff], U+0100..200F, U+2028..202F,
  4211. U+203F..2040, U+2054, U+205F..218F ...)
  4212. \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760:
  4213. [!\"#\$\%&\'\(\)*+,\-.\/:;<=
  4214. >?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9\xab-
  4215. \xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
  4216. U+2010..2027, U+2030..203E,
  4217. U+2041..2053, U+2055..205E, U+2190..245F
  4218. ...)
  4219. \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
  4220. \p{PatWS}) (11)
  4221. \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
  4222. (1_114_101 plus all above-Unicode code
  4223. points: [^\t\n\cK\f\r\x20\x85],
  4224. U+0100..200D, U+2010..2027,
  4225. U+202A..infinity)
  4226. \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11:
  4227. [\t\n\cK\f\r\x20\x85], U+200E..200F,
  4228. U+2028..2029)
  4229. \p{Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
  4230. \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57)
  4231. \p{Pauc} \p{Pau_Cin_Hau} (= \p{Script_Extensions=
  4232. Pau_Cin_Hau}) (NOT \p{Block=
  4233. Pau_Cin_Hau}) (57)
  4234. \p{Pc} \p{Connector_Punctuation} (=
  4235. \p{General_Category=
  4236. Connector_Punctuation}) (10)
  4237. \p{PCM} \p{Prepended_Concatenation_Mark} (=
  4238. \p{Prepended_Concatenation_Mark=Y}) (11)
  4239. \p{PCM: *} \p{Prepended_Concatenation_Mark: *}
  4240. \p{Pd} \p{Dash_Punctuation} (=
  4241. \p{General_Category=Dash_Punctuation})
  4242. (24)
  4243. \p{Pe} \p{Close_Punctuation} (=
  4244. \p{General_Category=Close_Punctuation})
  4245. (73)
  4246. \p{PerlSpace} \p{PosixSpace} (6)
  4247. \p{PerlWord} \p{PosixWord} (63)
  4248. \p{Perm} \p{Old_Permic} (= \p{Script_Extensions=
  4249. Old_Permic}) (NOT \p{Block=Old_Permic})
  4250. (44)
  4251. \p{Pf} \p{Final_Punctuation} (=
  4252. \p{General_Category=Final_Punctuation})
  4253. (10)
  4254. \p{Phag} \p{Phags_Pa} (= \p{Script_Extensions=
  4255. Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59)
  4256. \p{Phags_Pa} \p{Script_Extensions=Phags_Pa} (Short:
  4257. \p{Phag}; NOT \p{Block=Phags_Pa}) (59)
  4258. X \p{Phaistos} \p{Phaistos_Disc} (= \p{Block=
  4259. Phaistos_Disc}) (48)
  4260. X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (Short:
  4261. \p{InPhaistos}) (48)
  4262. \p{Phli} \p{Inscriptional_Pahlavi} (=
  4263. \p{Script_Extensions=
  4264. Inscriptional_Pahlavi}) (NOT \p{Block=
  4265. Inscriptional_Pahlavi}) (27)
  4266. \p{Phlp} \p{Psalter_Pahlavi} (=
  4267. \p{Script_Extensions=Psalter_Pahlavi})
  4268. (NOT \p{Block=Psalter_Pahlavi}) (30)
  4269. \p{Phnx} \p{Phoenician} (= \p{Script_Extensions=
  4270. Phoenician}) (NOT \p{Block=Phoenician})
  4271. (29)
  4272. \p{Phoenician} \p{Script_Extensions=Phoenician} (Short:
  4273. \p{Phnx}; NOT \p{Block=Phoenician}) (29)
  4274. X \p{Phonetic_Ext} \p{Phonetic_Extensions} (= \p{Block=
  4275. Phonetic_Extensions}) (128)
  4276. X \p{Phonetic_Ext_Sup} \p{Phonetic_Extensions_Supplement} (=
  4277. \p{Block=
  4278. Phonetic_Extensions_Supplement}) (64)
  4279. X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
  4280. \p{InPhoneticExt}) (128)
  4281. X \p{Phonetic_Extensions_Supplement} \p{Block=
  4282. Phonetic_Extensions_Supplement} (Short:
  4283. \p{InPhoneticExtSup}) (64)
  4284. \p{Pi} \p{Initial_Punctuation} (=
  4285. \p{General_Category=
  4286. Initial_Punctuation}) (12)
  4287. X \p{Playing_Cards} \p{Block=Playing_Cards} (96)
  4288. \p{Plrd} \p{Miao} (= \p{Script_Extensions=Miao})
  4289. (NOT \p{Block=Miao}) (149)
  4290. \p{Po} \p{Other_Punctuation} (=
  4291. \p{General_Category=Other_Punctuation})
  4292. (588)
  4293. \p{PosixAlnum} (62: [0-9A-Za-z])
  4294. \p{PosixAlpha} (52: [A-Za-z])
  4295. \p{PosixBlank} (2: [\t\x20])
  4296. \p{PosixCntrl} ASCII control characters (33: ACK, BEL,
  4297. BS, CAN, CR, DC1, DC2, DC3, DC4, DEL,
  4298. DLE, ENQ, EOM, EOT, ESC, ETB, ETX, FF,
  4299. FS, GS, HT, LF, NAK, NUL, RS, SI, SO,
  4300. SOH, STX, SUB, SYN, US, VT)
  4301. \p{PosixDigit} (10: [0-9])
  4302. \p{PosixGraph} (94: [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
  4303. Z\[\\\]\^_`a-z\{\|\}~])
  4304. \p{PosixLower} (/i= PosixAlpha) (26: [a-z])
  4305. \p{PosixPrint} (95: [\x20-\x7e])
  4306. \p{PosixPunct} (32: [!\"#\$\%&\'\(\)*+,\-.\/:;<=
  4307. >?\@\[\\\]\^_`\{\|\}~])
  4308. \p{PosixSpace} (Short: \p{PerlSpace}) (6:
  4309. [\t\n\cK\f\r\x20])
  4310. \p{PosixUpper} (/i= PosixAlpha) (26: [A-Z])
  4311. \p{PosixWord} \w, restricted to ASCII (Short:
  4312. \p{PerlWord}) (63: [0-9A-Z_a-z])
  4313. \p{PosixXDigit} \p{ASCII_Hex_Digit=Y} (Short: \p{AHex})
  4314. (22)
  4315. \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark=
  4316. Y} (Short: \p{PCM}) (11)
  4317. \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM})
  4318. (1_114_101 plus all above-Unicode code
  4319. points: U+0000..05FF, U+0606..06DC,
  4320. U+06DE..070E, U+0710..08E1,
  4321. U+08E3..110BC, U+110BE..110CC ...)
  4322. \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM})
  4323. (11: U+0600..0605, U+06DD, U+070F,
  4324. U+08E2, U+110BD, U+110CD)
  4325. T \p{Present_In: 1.1} \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
  4326. extension) (33_979)
  4327. T \p{Present_In: 2.0} Code point's usage introduced in version
  4328. 2.0 or earlier (Short: \p{In=2.0}) (Perl
  4329. extension) (178_500: U+0000..01F5,
  4330. U+01FA..0217, U+0250..02A8,
  4331. U+02B0..02DE, U+02E0..02E9, U+0300..0345
  4332. ...)
  4333. \p{Present_In: V2_0} \p{Present_In=2.0} (Perl extension)
  4334. (178_500)
  4335. T \p{Present_In: 2.1} Code point's usage introduced in version
  4336. 2.1 or earlier (Short: \p{In=2.1}) (Perl
  4337. extension) (178_502: U+0000..01F5,
  4338. U+01FA..0217, U+0250..02A8,
  4339. U+02B0..02DE, U+02E0..02E9, U+0300..0345
  4340. ...)
  4341. \p{Present_In: V2_1} \p{Present_In=2.1} (Perl extension)
  4342. (178_502)
  4343. T \p{Present_In: 3.0} Code point's usage introduced in version
  4344. 3.0 or earlier (Short: \p{In=3.0}) (Perl
  4345. extension) (188_809: U+0000..021F,
  4346. U+0222..0233, U+0250..02AD,
  4347. U+02B0..02EE, U+0300..034E, U+0360..0362
  4348. ...)
  4349. \p{Present_In: V3_0} \p{Present_In=3.0} (Perl extension)
  4350. (188_809)
  4351. T \p{Present_In: 3.1} Code point's usage introduced in version
  4352. 3.1 or earlier (Short: \p{In=3.1}) (Perl
  4353. extension) (233_787: U+0000..021F,
  4354. U+0222..0233, U+0250..02AD,
  4355. U+02B0..02EE, U+0300..034E, U+0360..0362
  4356. ...)
  4357. \p{Present_In: V3_1} \p{Present_In=3.1} (Perl extension)
  4358. (233_787)
  4359. T \p{Present_In: 3.2} Code point's usage introduced in version
  4360. 3.2 or earlier (Short: \p{In=3.2}) (Perl
  4361. extension) (234_803: U+0000..0220,
  4362. U+0222..0233, U+0250..02AD,
  4363. U+02B0..02EE, U+0300..034F, U+0360..036F
  4364. ...)
  4365. \p{Present_In: V3_2} \p{Present_In=3.2} (Perl extension)
  4366. (234_803)
  4367. T \p{Present_In: 4.0} Code point's usage introduced in version
  4368. 4.0 or earlier (Short: \p{In=4.0}) (Perl
  4369. extension) (236_029: U+0000..0236,
  4370. U+0250..0357, U+035D..036F,
  4371. U+0374..0375, U+037A, U+037E ...)
  4372. \p{Present_In: V4_0} \p{Present_In=4.0} (Perl extension)
  4373. (236_029)
  4374. T \p{Present_In: 4.1} Code point's usage introduced in version
  4375. 4.1 or earlier (Short: \p{In=4.1}) (Perl
  4376. extension) (237_302: U+0000..0241,
  4377. U+0250..036F, U+0374..0375, U+037A,
  4378. U+037E, U+0384..038A ...)
  4379. \p{Present_In: V4_1} \p{Present_In=4.1} (Perl extension)
  4380. (237_302)
  4381. T \p{Present_In: 5.0} Code point's usage introduced in version
  4382. 5.0 or earlier (Short: \p{In=5.0}) (Perl
  4383. extension) (238_671: U+0000..036F,
  4384. U+0374..0375, U+037A..037E,
  4385. U+0384..038A, U+038C, U+038E..03A1 ...)
  4386. \p{Present_In: V5_0} \p{Present_In=5.0} (Perl extension)
  4387. (238_671)
  4388. T \p{Present_In: 5.1} Code point's usage introduced in version
  4389. 5.1 or earlier (Short: \p{In=5.1}) (Perl
  4390. extension) (240_295: U+0000..0377,
  4391. U+037A..037E, U+0384..038A, U+038C,
  4392. U+038E..03A1, U+03A3..0523 ...)
  4393. \p{Present_In: V5_1} \p{Present_In=5.1} (Perl extension)
  4394. (240_295)
  4395. T \p{Present_In: 5.2} Code point's usage introduced in version
  4396. 5.2 or earlier (Short: \p{In=5.2}) (Perl
  4397. extension) (246_943: U+0000..0377,
  4398. U+037A..037E, U+0384..038A, U+038C,
  4399. U+038E..03A1, U+03A3..0525 ...)
  4400. \p{Present_In: V5_2} \p{Present_In=5.2} (Perl extension)
  4401. (246_943)
  4402. T \p{Present_In: 6.0} Code point's usage introduced in version
  4403. 6.0 or earlier (Short: \p{In=6.0}) (Perl
  4404. extension) (249_031: U+0000..0377,
  4405. U+037A..037E, U+0384..038A, U+038C,
  4406. U+038E..03A1, U+03A3..0527 ...)
  4407. \p{Present_In: V6_0} \p{Present_In=6.0} (Perl extension)
  4408. (249_031)
  4409. T \p{Present_In: 6.1} Code point's usage introduced in version
  4410. 6.1 or earlier (Short: \p{In=6.1}) (Perl
  4411. extension) (249_763: U+0000..0377,
  4412. U+037A..037E, U+0384..038A, U+038C,
  4413. U+038E..03A1, U+03A3..0527 ...)
  4414. \p{Present_In: V6_1} \p{Present_In=6.1} (Perl extension)
  4415. (249_763)
  4416. T \p{Present_In: 6.2} Code point's usage introduced in version
  4417. 6.2 or earlier (Short: \p{In=6.2}) (Perl
  4418. extension) (249_764: U+0000..0377,
  4419. U+037A..037E, U+0384..038A, U+038C,
  4420. U+038E..03A1, U+03A3..0527 ...)
  4421. \p{Present_In: V6_2} \p{Present_In=6.2} (Perl extension)
  4422. (249_764)
  4423. T \p{Present_In: 6.3} Code point's usage introduced in version
  4424. 6.3 or earlier (Short: \p{In=6.3}) (Perl
  4425. extension) (249_769: U+0000..0377,
  4426. U+037A..037E, U+0384..038A, U+038C,
  4427. U+038E..03A1, U+03A3..0527 ...)
  4428. \p{Present_In: V6_3} \p{Present_In=6.3} (Perl extension)
  4429. (249_769)
  4430. T \p{Present_In: 7.0} Code point's usage introduced in version
  4431. 7.0 or earlier (Short: \p{In=7.0}) (Perl
  4432. extension) (252_603: U+0000..0377,
  4433. U+037A..037F, U+0384..038A, U+038C,
  4434. U+038E..03A1, U+03A3..052F ...)
  4435. \p{Present_In: V7_0} \p{Present_In=7.0} (Perl extension)
  4436. (252_603)
  4437. T \p{Present_In: 8.0} Code point's usage introduced in version
  4438. 8.0 or earlier (Short: \p{In=8.0}) (Perl
  4439. extension) (260_319: U+0000..0377,
  4440. U+037A..037F, U+0384..038A, U+038C,
  4441. U+038E..03A1, U+03A3..052F ...)
  4442. \p{Present_In: V8_0} \p{Present_In=8.0} (Perl extension)
  4443. (260_319)
  4444. T \p{Present_In: 9.0} Code point's usage introduced in version
  4445. 9.0 or earlier (Short: \p{In=9.0}) (Perl
  4446. extension) (267_819: U+0000..0377,
  4447. U+037A..037F, U+0384..038A, U+038C,
  4448. U+038E..03A1, U+03A3..052F ...)
  4449. \p{Present_In: V9_0} \p{Present_In=9.0} (Perl extension)
  4450. (267_819)
  4451. T \p{Present_In: 10.0} Code point's usage introduced in version
  4452. 10.0 or earlier (Short: \p{In=10.0})
  4453. (Perl extension) (276_337: U+0000..0377,
  4454. U+037A..037F, U+0384..038A, U+038C,
  4455. U+038E..03A1, U+03A3..052F ...)
  4456. \p{Present_In: V10_0} \p{Present_In=10.0} (Perl extension)
  4457. (276_337)
  4458. T \p{Present_In: 11.0} Code point's usage introduced in version
  4459. 11.0 or earlier (Short: \p{In=11.0})
  4460. (Perl extension) (277_021: U+0000..0377,
  4461. U+037A..037F, U+0384..038A, U+038C,
  4462. U+038E..03A1, U+03A3..052F ...)
  4463. \p{Present_In: V11_0} \p{Present_In=11.0} (Perl extension)
  4464. (277_021)
  4465. T \p{Present_In: 12.0} Code point's usage introduced in version
  4466. 12.0 or earlier (Short: \p{In=12.0})
  4467. (Perl extension) (277_575: U+0000..0377,
  4468. U+037A..037F, U+0384..038A, U+038C,
  4469. U+038E..03A1, U+03A3..052F ...)
  4470. \p{Present_In: V12_0} \p{Present_In=12.0} (Perl extension)
  4471. (277_575)
  4472. T \p{Present_In: 12.1} Code point's usage introduced in version
  4473. 12.1 or earlier (Short: \p{In=12.1})
  4474. (Perl extension) (277_576: U+0000..0377,
  4475. U+037A..037F, U+0384..038A, U+038C,
  4476. U+038E..03A1, U+03A3..052F ...)
  4477. \p{Present_In: V12_1} \p{Present_In=12.1} (Perl extension)
  4478. (277_576)
  4479. \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
  4480. Unassigned}) (Perl extension) (836_536
  4481. plus all above-Unicode code points)
  4482. \p{Print} \p{XPosixPrint} (275_395)
  4483. \p{Private_Use} \p{General_Category=Private_Use} (Short:
  4484. \p{Co}; NOT \p{Private_Use_Area})
  4485. (137_468)
  4486. X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short:
  4487. \p{InPUA}) (6400)
  4488. \p{Prti} \p{Inscriptional_Parthian} (=
  4489. \p{Script_Extensions=
  4490. Inscriptional_Parthian}) (NOT \p{Block=
  4491. Inscriptional_Parthian}) (30)
  4492. \p{Ps} \p{Open_Punctuation} (=
  4493. \p{General_Category=Open_Punctuation})
  4494. (75)
  4495. \p{Psalter_Pahlavi} \p{Script_Extensions=Psalter_Pahlavi}
  4496. (Short: \p{Phlp}; NOT \p{Block=
  4497. Psalter_Pahlavi}) (30)
  4498. X \p{PUA} \p{Private_Use_Area} (= \p{Block=
  4499. Private_Use_Area}) (6400)
  4500. \p{Punct} \p{General_Category=Punctuation} (Short:
  4501. \p{P}; NOT \p{General_Punctuation}) (792)
  4502. \p{Punctuation} \p{Punct} (= \p{General_Category=
  4503. Punctuation}) (NOT
  4504. \p{General_Punctuation}) (792)
  4505. \p{Qaac} \p{Coptic} (= \p{Script_Extensions=
  4506. Coptic}) (NOT \p{Block=Coptic}) (165)
  4507. \p{Qaai} \p{Inherited} (= \p{Script_Extensions=
  4508. Inherited}) (502)
  4509. \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark=
  4510. Y}) (30)
  4511. \p{QMark: *} \p{Quotation_Mark: *}
  4512. \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark})
  4513. (30)
  4514. \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_082
  4515. plus all above-Unicode code points:
  4516. [\x00-\x20!#\$\%&\(\)*+,\-.\/0-9:;<=
  4517. >?\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
  4518. \xaa\xac-\xba\xbc-\xff], U+0100..2017,
  4519. U+2020..2038, U+203B..2E41,
  4520. U+2E43..300B, U+3010..301C ...)
  4521. \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (30:
  4522. [\"\'\xab\xbb], U+2018..201F,
  4523. U+2039..203A, U+2E42, U+300C..300F,
  4524. U+301D..301F ...)
  4525. \p{Radical} \p{Radical=Y} (329)
  4526. \p{Radical: N*} (Single: \P{Radical}) (1_113_783 plus all
  4527. above-Unicode code points: U+0000..2E7F,
  4528. U+2E9A, U+2EF4..2EFF, U+2FD6..infinity)
  4529. \p{Radical: Y*} (Single: \p{Radical}) (329: U+2E80..2E99,
  4530. U+2E9B..2EF3, U+2F00..2FD5)
  4531. \p{Regional_Indicator} \p{Regional_Indicator=Y} (Short: \p{RI})
  4532. (26)
  4533. \p{Regional_Indicator: N*} (Short: \p{RI=N}, \P{RI}) (1_114_086
  4534. plus all above-Unicode code points:
  4535. U+0000..1F1E5, U+1F200..infinity)
  4536. \p{Regional_Indicator: Y*} (Short: \p{RI=Y}, \p{RI}) (26:
  4537. U+1F1E6..1F1FF)
  4538. \p{Rejang} \p{Script_Extensions=Rejang} (Short:
  4539. \p{Rjng}; NOT \p{Block=Rejang}) (37)
  4540. \p{RI} \p{Regional_Indicator} (=
  4541. \p{Regional_Indicator=Y}) (26)
  4542. \p{RI: *} \p{Regional_Indicator: *}
  4543. \p{Rjng} \p{Rejang} (= \p{Script_Extensions=
  4544. Rejang}) (NOT \p{Block=Rejang}) (37)
  4545. \p{Rohg} \p{Hanifi_Rohingya} (=
  4546. \p{Script_Extensions=Hanifi_Rohingya})
  4547. (NOT \p{Block=Hanifi_Rohingya}) (55)
  4548. X \p{Rumi} \p{Rumi_Numeral_Symbols} (= \p{Block=
  4549. Rumi_Numeral_Symbols}) (32)
  4550. X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
  4551. \p{InRumi}) (32)
  4552. \p{Runic} \p{Script_Extensions=Runic} (Short:
  4553. \p{Runr}; NOT \p{Block=Runic}) (86)
  4554. \p{Runr} \p{Runic} (= \p{Script_Extensions=Runic})
  4555. (NOT \p{Block=Runic}) (86)
  4556. \p{S} \pS \p{Symbol} (= \p{General_Category=Symbol})
  4557. (7292)
  4558. \p{Samaritan} \p{Script_Extensions=Samaritan} (Short:
  4559. \p{Samr}; NOT \p{Block=Samaritan}) (61)
  4560. \p{Samr} \p{Samaritan} (= \p{Script_Extensions=
  4561. Samaritan}) (NOT \p{Block=Samaritan})
  4562. (61)
  4563. \p{Sarb} \p{Old_South_Arabian} (=
  4564. \p{Script_Extensions=Old_South_Arabian})
  4565. (32)
  4566. \p{Saur} \p{Saurashtra} (= \p{Script_Extensions=
  4567. Saurashtra}) (NOT \p{Block=Saurashtra})
  4568. (82)
  4569. \p{Saurashtra} \p{Script_Extensions=Saurashtra} (Short:
  4570. \p{Saur}; NOT \p{Block=Saurashtra}) (82)
  4571. \p{SB: *} \p{Sentence_Break: *}
  4572. \p{Sc} \p{Currency_Symbol} (=
  4573. \p{General_Category=Currency_Symbol})
  4574. (62)
  4575. \p{Sc: *} \p{Script: *}
  4576. \p{Script: Adlam} (Short: \p{Sc=Adlm}) (88: U+1E900..1E94B,
  4577. U+1E950..1E959, U+1E95E..1E95F)
  4578. \p{Script: Adlm} \p{Script=Adlam} (88)
  4579. \p{Script: Aghb} \p{Script=Caucasian_Albanian} (=
  4580. \p{Script_Extensions=
  4581. Caucasian_Albanian}) (53)
  4582. \p{Script: Ahom} \p{Script_Extensions=Ahom} (Short: \p{Sc=
  4583. Ahom}, \p{Ahom}) (58)
  4584. \p{Script: Anatolian_Hieroglyphs} \p{Script_Extensions=
  4585. Anatolian_Hieroglyphs} (Short: \p{Sc=
  4586. Hluw}, \p{Hluw}) (583)
  4587. \p{Script: Arab} \p{Script=Arabic} (1281)
  4588. \p{Script: Arabic} (Short: \p{Sc=Arab}) (1281: U+0600..0604,
  4589. U+0606..060B, U+060D..061A, U+061C,
  4590. U+061E, U+0620..063F ...)
  4591. \p{Script: Armenian} (Short: \p{Sc=Armn}) (95: U+0531..0556,
  4592. U+0559..0588, U+058A, U+058D..058F,
  4593. U+FB13..FB17)
  4594. \p{Script: Armi} \p{Script=Imperial_Aramaic} (=
  4595. \p{Script_Extensions=Imperial_Aramaic})
  4596. (31)
  4597. \p{Script: Armn} \p{Script=Armenian} (95)
  4598. \p{Script: Avestan} \p{Script_Extensions=Avestan} (Short:
  4599. \p{Sc=Avst}, \p{Avst}) (61)
  4600. \p{Script: Avst} \p{Script=Avestan} (=
  4601. \p{Script_Extensions=Avestan}) (61)
  4602. \p{Script: Bali} \p{Script=Balinese} (=
  4603. \p{Script_Extensions=Balinese}) (121)
  4604. \p{Script: Balinese} \p{Script_Extensions=Balinese} (Short:
  4605. \p{Sc=Bali}, \p{Bali}) (121)
  4606. \p{Script: Bamu} \p{Script=Bamum} (= \p{Script_Extensions=
  4607. Bamum}) (657)
  4608. \p{Script: Bamum} \p{Script_Extensions=Bamum} (Short: \p{Sc=
  4609. Bamu}, \p{Bamu}) (657)
  4610. \p{Script: Bass} \p{Script=Bassa_Vah} (=
  4611. \p{Script_Extensions=Bassa_Vah}) (36)
  4612. \p{Script: Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short:
  4613. \p{Sc=Bass}, \p{Bass}) (36)
  4614. \p{Script: Batak} \p{Script_Extensions=Batak} (Short: \p{Sc=
  4615. Batk}, \p{Batk}) (56)
  4616. \p{Script: Batk} \p{Script=Batak} (= \p{Script_Extensions=
  4617. Batak}) (56)
  4618. \p{Script: Beng} \p{Script=Bengali} (96)
  4619. \p{Script: Bengali} (Short: \p{Sc=Beng}) (96: U+0980..0983,
  4620. U+0985..098C, U+098F..0990,
  4621. U+0993..09A8, U+09AA..09B0, U+09B2 ...)
  4622. \p{Script: Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short:
  4623. \p{Sc=Bhks}, \p{Bhks}) (97)
  4624. \p{Script: Bhks} \p{Script=Bhaiksuki} (=
  4625. \p{Script_Extensions=Bhaiksuki}) (97)
  4626. \p{Script: Bopo} \p{Script=Bopomofo} (72)
  4627. \p{Script: Bopomofo} (Short: \p{Sc=Bopo}) (72: U+02EA..02EB,
  4628. U+3105..312F, U+31A0..31BA)
  4629. \p{Script: Brah} \p{Script=Brahmi} (= \p{Script_Extensions=
  4630. Brahmi}) (109)
  4631. \p{Script: Brahmi} \p{Script_Extensions=Brahmi} (Short:
  4632. \p{Sc=Brah}, \p{Brah}) (109)
  4633. \p{Script: Brai} \p{Script=Braille} (=
  4634. \p{Script_Extensions=Braille}) (256)
  4635. \p{Script: Braille} \p{Script_Extensions=Braille} (Short:
  4636. \p{Sc=Brai}, \p{Brai}) (256)
  4637. \p{Script: Bugi} \p{Script=Buginese} (30)
  4638. \p{Script: Buginese} (Short: \p{Sc=Bugi}) (30: U+1A00..1A1B,
  4639. U+1A1E..1A1F)
  4640. \p{Script: Buhd} \p{Script=Buhid} (20)
  4641. \p{Script: Buhid} (Short: \p{Sc=Buhd}) (20: U+1740..1753)
  4642. \p{Script: Cakm} \p{Script=Chakma} (70)
  4643. \p{Script: Canadian_Aboriginal} \p{Script_Extensions=
  4644. Canadian_Aboriginal} (Short: \p{Sc=
  4645. Cans}, \p{Cans}) (710)
  4646. \p{Script: Cans} \p{Script=Canadian_Aboriginal} (=
  4647. \p{Script_Extensions=
  4648. Canadian_Aboriginal}) (710)
  4649. \p{Script: Cari} \p{Script=Carian} (= \p{Script_Extensions=
  4650. Carian}) (49)
  4651. \p{Script: Carian} \p{Script_Extensions=Carian} (Short:
  4652. \p{Sc=Cari}, \p{Cari}) (49)
  4653. \p{Script: Caucasian_Albanian} \p{Script_Extensions=
  4654. Caucasian_Albanian} (Short: \p{Sc=Aghb},
  4655. \p{Aghb}) (53)
  4656. \p{Script: Chakma} (Short: \p{Sc=Cakm}) (70: U+11100..11134,
  4657. U+11136..11146)
  4658. \p{Script: Cham} \p{Script_Extensions=Cham} (Short: \p{Sc=
  4659. Cham}, \p{Cham}) (83)
  4660. \p{Script: Cher} \p{Script=Cherokee} (=
  4661. \p{Script_Extensions=Cherokee}) (172)
  4662. \p{Script: Cherokee} \p{Script_Extensions=Cherokee} (Short:
  4663. \p{Sc=Cher}, \p{Cher}) (172)
  4664. \p{Script: Common} (Short: \p{Sc=Zyyy}) (7805: [\x00-
  4665. \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
  4666. >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
  4667. \xb9\xbb-\xbf\xd7\xf7], U+02B9..02DF,
  4668. U+02E5..02E9, U+02EC..02FF, U+0374,
  4669. U+037E ...)
  4670. \p{Script: Copt} \p{Script=Coptic} (137)
  4671. \p{Script: Coptic} (Short: \p{Sc=Copt}) (137: U+03E2..03EF,
  4672. U+2C80..2CF3, U+2CF9..2CFF)
  4673. \p{Script: Cprt} \p{Script=Cypriot} (55)
  4674. \p{Script: Cuneiform} \p{Script_Extensions=Cuneiform} (Short:
  4675. \p{Sc=Xsux}, \p{Xsux}) (1234)
  4676. \p{Script: Cypriot} (Short: \p{Sc=Cprt}) (55: U+10800..10805,
  4677. U+10808, U+1080A..10835, U+10837..10838,
  4678. U+1083C, U+1083F)
  4679. \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}) (443: U+0400..0484,
  4680. U+0487..052F, U+1C80..1C88, U+1D2B,
  4681. U+1D78, U+2DE0..2DFF ...)
  4682. \p{Script: Cyrl} \p{Script=Cyrillic} (443)
  4683. \p{Script: Deseret} \p{Script_Extensions=Deseret} (Short:
  4684. \p{Sc=Dsrt}, \p{Dsrt}) (80)
  4685. \p{Script: Deva} \p{Script=Devanagari} (154)
  4686. \p{Script: Devanagari} (Short: \p{Sc=Deva}) (154: U+0900..0950,
  4687. U+0955..0963, U+0966..097F, U+A8E0..A8FF)
  4688. \p{Script: Dogr} \p{Script=Dogra} (60)
  4689. \p{Script: Dogra} (Short: \p{Sc=Dogr}) (60: U+11800..1183B)
  4690. \p{Script: Dsrt} \p{Script=Deseret} (=
  4691. \p{Script_Extensions=Deseret}) (80)
  4692. \p{Script: Dupl} \p{Script=Duployan} (143)
  4693. \p{Script: Duployan} (Short: \p{Sc=Dupl}) (143: U+1BC00..1BC6A,
  4694. U+1BC70..1BC7C, U+1BC80..1BC88,
  4695. U+1BC90..1BC99, U+1BC9C..1BC9F)
  4696. \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (=
  4697. \p{Script_Extensions=
  4698. Egyptian_Hieroglyphs}) (1080)
  4699. \p{Script: Egyptian_Hieroglyphs} \p{Script_Extensions=
  4700. Egyptian_Hieroglyphs} (Short: \p{Sc=
  4701. Egyp}, \p{Egyp}) (1080)
  4702. \p{Script: Elba} \p{Script=Elbasan} (=
  4703. \p{Script_Extensions=Elbasan}) (40)
  4704. \p{Script: Elbasan} \p{Script_Extensions=Elbasan} (Short:
  4705. \p{Sc=Elba}, \p{Elba}) (40)
  4706. \p{Script: Elym} \p{Script=Elymaic} (=
  4707. \p{Script_Extensions=Elymaic}) (23)
  4708. \p{Script: Elymaic} \p{Script_Extensions=Elymaic} (Short:
  4709. \p{Sc=Elym}, \p{Elym}) (23)
  4710. \p{Script: Ethi} \p{Script=Ethiopic} (=
  4711. \p{Script_Extensions=Ethiopic}) (495)
  4712. \p{Script: Ethiopic} \p{Script_Extensions=Ethiopic} (Short:
  4713. \p{Sc=Ethi}, \p{Ethi}) (495)
  4714. \p{Script: Geor} \p{Script=Georgian} (173)
  4715. \p{Script: Georgian} (Short: \p{Sc=Geor}) (173: U+10A0..10C5,
  4716. U+10C7, U+10CD, U+10D0..10FA,
  4717. U+10FC..10FF, U+1C90..1CBA ...)
  4718. \p{Script: Glag} \p{Script=Glagolitic} (132)
  4719. \p{Script: Glagolitic} (Short: \p{Sc=Glag}) (132: U+2C00..2C2E,
  4720. U+2C30..2C5E, U+1E000..1E006,
  4721. U+1E008..1E018, U+1E01B..1E021,
  4722. U+1E023..1E024 ...)
  4723. \p{Script: Gong} \p{Script=Gunjala_Gondi} (63)
  4724. \p{Script: Gonm} \p{Script=Masaram_Gondi} (75)
  4725. \p{Script: Goth} \p{Script=Gothic} (= \p{Script_Extensions=
  4726. Gothic}) (27)
  4727. \p{Script: Gothic} \p{Script_Extensions=Gothic} (Short:
  4728. \p{Sc=Goth}, \p{Goth}) (27)
  4729. \p{Script: Gran} \p{Script=Grantha} (85)
  4730. \p{Script: Grantha} (Short: \p{Sc=Gran}) (85: U+11300..11303,
  4731. U+11305..1130C, U+1130F..11310,
  4732. U+11313..11328, U+1132A..11330,
  4733. U+11332..11333 ...)
  4734. \p{Script: Greek} (Short: \p{Sc=Grek}) (518: U+0370..0373,
  4735. U+0375..0377, U+037A..037D, U+037F,
  4736. U+0384, U+0386 ...)
  4737. \p{Script: Grek} \p{Script=Greek} (518)
  4738. \p{Script: Gujarati} (Short: \p{Sc=Gujr}) (91: U+0A81..0A83,
  4739. U+0A85..0A8D, U+0A8F..0A91,
  4740. U+0A93..0AA8, U+0AAA..0AB0, U+0AB2..0AB3
  4741. ...)
  4742. \p{Script: Gujr} \p{Script=Gujarati} (91)
  4743. \p{Script: Gunjala_Gondi} (Short: \p{Sc=Gong}) (63:
  4744. U+11D60..11D65, U+11D67..11D68,
  4745. U+11D6A..11D8E, U+11D90..11D91,
  4746. U+11D93..11D98, U+11DA0..11DA9)
  4747. \p{Script: Gurmukhi} (Short: \p{Sc=Guru}) (80: U+0A01..0A03,
  4748. U+0A05..0A0A, U+0A0F..0A10,
  4749. U+0A13..0A28, U+0A2A..0A30, U+0A32..0A33
  4750. ...)
  4751. \p{Script: Guru} \p{Script=Gurmukhi} (80)
  4752. \p{Script: Han} (Short: \p{Sc=Han}) (89_233: U+2E80..2E99,
  4753. U+2E9B..2EF3, U+2F00..2FD5, U+3005,
  4754. U+3007, U+3021..3029 ...)
  4755. \p{Script: Hang} \p{Script=Hangul} (11_739)
  4756. \p{Script: Hangul} (Short: \p{Sc=Hang}) (11_739:
  4757. U+1100..11FF, U+302E..302F,
  4758. U+3131..318E, U+3200..321E,
  4759. U+3260..327E, U+A960..A97C ...)
  4760. \p{Script: Hani} \p{Script=Han} (89_233)
  4761. \p{Script: Hanifi_Rohingya} (Short: \p{Sc=Rohg}) (50:
  4762. U+10D00..10D27, U+10D30..10D39)
  4763. \p{Script: Hano} \p{Script=Hanunoo} (21)
  4764. \p{Script: Hanunoo} (Short: \p{Sc=Hano}) (21: U+1720..1734)
  4765. \p{Script: Hatr} \p{Script=Hatran} (= \p{Script_Extensions=
  4766. Hatran}) (26)
  4767. \p{Script: Hatran} \p{Script_Extensions=Hatran} (Short:
  4768. \p{Sc=Hatr}, \p{Hatr}) (26)
  4769. \p{Script: Hebr} \p{Script=Hebrew} (= \p{Script_Extensions=
  4770. Hebrew}) (134)
  4771. \p{Script: Hebrew} \p{Script_Extensions=Hebrew} (Short:
  4772. \p{Sc=Hebr}, \p{Hebr}) (134)
  4773. \p{Script: Hira} \p{Script=Hiragana} (379)
  4774. \p{Script: Hiragana} (Short: \p{Sc=Hira}) (379: U+3041..3096,
  4775. U+309D..309F, U+1B001..1B11E,
  4776. U+1B150..1B152, U+1F200)
  4777. \p{Script: Hluw} \p{Script=Anatolian_Hieroglyphs} (=
  4778. \p{Script_Extensions=
  4779. Anatolian_Hieroglyphs}) (583)
  4780. \p{Script: Hmng} \p{Script=Pahawh_Hmong} (=
  4781. \p{Script_Extensions=Pahawh_Hmong}) (127)
  4782. \p{Script: Hmnp} \p{Script=Nyiakeng_Puachue_Hmong} (=
  4783. \p{Script_Extensions=
  4784. Nyiakeng_Puachue_Hmong}) (71)
  4785. \p{Script: Hung} \p{Script=Old_Hungarian} (=
  4786. \p{Script_Extensions=Old_Hungarian})
  4787. (108)
  4788. \p{Script: Imperial_Aramaic} \p{Script_Extensions=
  4789. Imperial_Aramaic} (Short: \p{Sc=Armi},
  4790. \p{Armi}) (31)
  4791. \p{Script: Inherited} (Short: \p{Sc=Zinh}) (571: U+0300..036F,
  4792. U+0485..0486, U+064B..0655, U+0670,
  4793. U+0951..0954, U+1AB0..1ABE ...)
  4794. \p{Script: Inscriptional_Pahlavi} \p{Script_Extensions=
  4795. Inscriptional_Pahlavi} (Short: \p{Sc=
  4796. Phli}, \p{Phli}) (27)
  4797. \p{Script: Inscriptional_Parthian} \p{Script_Extensions=
  4798. Inscriptional_Parthian} (Short: \p{Sc=
  4799. Prti}, \p{Prti}) (30)
  4800. \p{Script: Ital} \p{Script=Old_Italic} (=
  4801. \p{Script_Extensions=Old_Italic}) (39)
  4802. \p{Script: Java} \p{Script=Javanese} (90)
  4803. \p{Script: Javanese} (Short: \p{Sc=Java}) (90: U+A980..A9CD,
  4804. U+A9D0..A9D9, U+A9DE..A9DF)
  4805. \p{Script: Kaithi} (Short: \p{Sc=Kthi}) (67: U+11080..110C1,
  4806. U+110CD)
  4807. \p{Script: Kali} \p{Script=Kayah_Li} (47)
  4808. \p{Script: Kana} \p{Script=Katakana} (304)
  4809. \p{Script: Kannada} (Short: \p{Sc=Knda}) (89: U+0C80..0C8C,
  4810. U+0C8E..0C90, U+0C92..0CA8,
  4811. U+0CAA..0CB3, U+0CB5..0CB9, U+0CBC..0CC4
  4812. ...)
  4813. \p{Script: Katakana} (Short: \p{Sc=Kana}) (304: U+30A1..30FA,
  4814. U+30FD..30FF, U+31F0..31FF,
  4815. U+32D0..32FE, U+3300..3357, U+FF66..FF6F
  4816. ...)
  4817. \p{Script: Kayah_Li} (Short: \p{Sc=Kali}) (47: U+A900..A92D,
  4818. U+A92F)
  4819. \p{Script: Khar} \p{Script=Kharoshthi} (=
  4820. \p{Script_Extensions=Kharoshthi}) (68)
  4821. \p{Script: Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short:
  4822. \p{Sc=Khar}, \p{Khar}) (68)
  4823. \p{Script: Khmer} \p{Script_Extensions=Khmer} (Short: \p{Sc=
  4824. Khmr}, \p{Khmr}) (146)
  4825. \p{Script: Khmr} \p{Script=Khmer} (= \p{Script_Extensions=
  4826. Khmer}) (146)
  4827. \p{Script: Khoj} \p{Script=Khojki} (62)
  4828. \p{Script: Khojki} (Short: \p{Sc=Khoj}) (62: U+11200..11211,
  4829. U+11213..1123E)
  4830. \p{Script: Khudawadi} (Short: \p{Sc=Sind}) (69: U+112B0..112EA,
  4831. U+112F0..112F9)
  4832. \p{Script: Knda} \p{Script=Kannada} (89)
  4833. \p{Script: Kthi} \p{Script=Kaithi} (67)
  4834. \p{Script: Lana} \p{Script=Tai_Tham} (=
  4835. \p{Script_Extensions=Tai_Tham}) (127)
  4836. \p{Script: Lao} \p{Script_Extensions=Lao} (Short: \p{Sc=
  4837. Lao}, \p{Lao}) (82)
  4838. \p{Script: Laoo} \p{Script=Lao} (= \p{Script_Extensions=
  4839. Lao}) (82)
  4840. \p{Script: Latin} (Short: \p{Sc=Latn}) (1366: [A-Za-
  4841. z\xaa\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
  4842. U+0100..02B8, U+02E0..02E4,
  4843. U+1D00..1D25, U+1D2C..1D5C, U+1D62..1D65
  4844. ...)
  4845. \p{Script: Latn} \p{Script=Latin} (1366)
  4846. \p{Script: Lepc} \p{Script=Lepcha} (= \p{Script_Extensions=
  4847. Lepcha}) (74)
  4848. \p{Script: Lepcha} \p{Script_Extensions=Lepcha} (Short:
  4849. \p{Sc=Lepc}, \p{Lepc}) (74)
  4850. \p{Script: Limb} \p{Script=Limbu} (68)
  4851. \p{Script: Limbu} (Short: \p{Sc=Limb}) (68: U+1900..191E,
  4852. U+1920..192B, U+1930..193B, U+1940,
  4853. U+1944..194F)
  4854. \p{Script: Lina} \p{Script=Linear_A} (341)
  4855. \p{Script: Linb} \p{Script=Linear_B} (211)
  4856. \p{Script: Linear_A} (Short: \p{Sc=Lina}) (341: U+10600..10736,
  4857. U+10740..10755, U+10760..10767)
  4858. \p{Script: Linear_B} (Short: \p{Sc=Linb}) (211: U+10000..1000B,
  4859. U+1000D..10026, U+10028..1003A,
  4860. U+1003C..1003D, U+1003F..1004D,
  4861. U+10050..1005D ...)
  4862. \p{Script: Lisu} \p{Script_Extensions=Lisu} (Short: \p{Sc=
  4863. Lisu}, \p{Lisu}) (48)
  4864. \p{Script: Lyci} \p{Script=Lycian} (= \p{Script_Extensions=
  4865. Lycian}) (29)
  4866. \p{Script: Lycian} \p{Script_Extensions=Lycian} (Short:
  4867. \p{Sc=Lyci}, \p{Lyci}) (29)
  4868. \p{Script: Lydi} \p{Script=Lydian} (= \p{Script_Extensions=
  4869. Lydian}) (27)
  4870. \p{Script: Lydian} \p{Script_Extensions=Lydian} (Short:
  4871. \p{Sc=Lydi}, \p{Lydi}) (27)
  4872. \p{Script: Mahajani} (Short: \p{Sc=Mahj}) (39: U+11150..11176)
  4873. \p{Script: Mahj} \p{Script=Mahajani} (39)
  4874. \p{Script: Maka} \p{Script=Makasar} (=
  4875. \p{Script_Extensions=Makasar}) (25)
  4876. \p{Script: Makasar} \p{Script_Extensions=Makasar} (Short:
  4877. \p{Sc=Maka}, \p{Maka}) (25)
  4878. \p{Script: Malayalam} (Short: \p{Sc=Mlym}) (117: U+0D00..0D03,
  4879. U+0D05..0D0C, U+0D0E..0D10,
  4880. U+0D12..0D44, U+0D46..0D48, U+0D4A..0D4F
  4881. ...)
  4882. \p{Script: Mand} \p{Script=Mandaic} (29)
  4883. \p{Script: Mandaic} (Short: \p{Sc=Mand}) (29: U+0840..085B,
  4884. U+085E)
  4885. \p{Script: Mani} \p{Script=Manichaean} (51)
  4886. \p{Script: Manichaean} (Short: \p{Sc=Mani}) (51: U+10AC0..10AE6,
  4887. U+10AEB..10AF6)
  4888. \p{Script: Marc} \p{Script=Marchen} (=
  4889. \p{Script_Extensions=Marchen}) (68)
  4890. \p{Script: Marchen} \p{Script_Extensions=Marchen} (Short:
  4891. \p{Sc=Marc}, \p{Marc}) (68)
  4892. \p{Script: Masaram_Gondi} (Short: \p{Sc=Gonm}) (75:
  4893. U+11D00..11D06, U+11D08..11D09,
  4894. U+11D0B..11D36, U+11D3A, U+11D3C..11D3D,
  4895. U+11D3F..11D47 ...)
  4896. \p{Script: Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short:
  4897. \p{Sc=Medf}, \p{Medf}) (91)
  4898. \p{Script: Medf} \p{Script=Medefaidrin} (=
  4899. \p{Script_Extensions=Medefaidrin}) (91)
  4900. \p{Script: Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek}
  4901. (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
  4902. \p{Script: Mend} \p{Script=Mende_Kikakui} (=
  4903. \p{Script_Extensions=Mende_Kikakui})
  4904. (213)
  4905. \p{Script: Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
  4906. (Short: \p{Sc=Mend}, \p{Mend}) (213)
  4907. \p{Script: Merc} \p{Script=Meroitic_Cursive} (=
  4908. \p{Script_Extensions=Meroitic_Cursive})
  4909. (90)
  4910. \p{Script: Mero} \p{Script=Meroitic_Hieroglyphs} (=
  4911. \p{Script_Extensions=
  4912. Meroitic_Hieroglyphs}) (32)
  4913. \p{Script: Meroitic_Cursive} \p{Script_Extensions=
  4914. Meroitic_Cursive} (Short: \p{Sc=Merc},
  4915. \p{Merc}) (90)
  4916. \p{Script: Meroitic_Hieroglyphs} \p{Script_Extensions=
  4917. Meroitic_Hieroglyphs} (Short: \p{Sc=
  4918. Mero}, \p{Mero}) (32)
  4919. \p{Script: Miao} \p{Script_Extensions=Miao} (Short: \p{Sc=
  4920. Miao}, \p{Miao}) (149)
  4921. \p{Script: Mlym} \p{Script=Malayalam} (117)
  4922. \p{Script: Modi} (Short: \p{Sc=Modi}) (79: U+11600..11644,
  4923. U+11650..11659)
  4924. \p{Script: Mong} \p{Script=Mongolian} (167)
  4925. \p{Script: Mongolian} (Short: \p{Sc=Mong}) (167: U+1800..1801,
  4926. U+1804, U+1806..180E, U+1810..1819,
  4927. U+1820..1878, U+1880..18AA ...)
  4928. \p{Script: Mro} \p{Script_Extensions=Mro} (Short: \p{Sc=
  4929. Mro}, \p{Mro}) (43)
  4930. \p{Script: Mroo} \p{Script=Mro} (= \p{Script_Extensions=
  4931. Mro}) (43)
  4932. \p{Script: Mtei} \p{Script=Meetei_Mayek} (=
  4933. \p{Script_Extensions=Meetei_Mayek}) (79)
  4934. \p{Script: Mult} \p{Script=Multani} (38)
  4935. \p{Script: Multani} (Short: \p{Sc=Mult}) (38: U+11280..11286,
  4936. U+11288, U+1128A..1128D, U+1128F..1129D,
  4937. U+1129F..112A9)
  4938. \p{Script: Myanmar} (Short: \p{Sc=Mymr}) (223: U+1000..109F,
  4939. U+A9E0..A9FE, U+AA60..AA7F)
  4940. \p{Script: Mymr} \p{Script=Myanmar} (223)
  4941. \p{Script: Nabataean} \p{Script_Extensions=Nabataean} (Short:
  4942. \p{Sc=Nbat}, \p{Nbat}) (40)
  4943. \p{Script: Nand} \p{Script=Nandinagari} (65)
  4944. \p{Script: Nandinagari} (Short: \p{Sc=Nand}) (65: U+119A0..119A7,
  4945. U+119AA..119D7, U+119DA..119E4)
  4946. \p{Script: Narb} \p{Script=Old_North_Arabian} (=
  4947. \p{Script_Extensions=Old_North_Arabian})
  4948. (32)
  4949. \p{Script: Nbat} \p{Script=Nabataean} (=
  4950. \p{Script_Extensions=Nabataean}) (40)
  4951. \p{Script: New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
  4952. \p{Sc=Talu}, \p{Talu}) (83)
  4953. \p{Script: Newa} \p{Script_Extensions=Newa} (Short: \p{Sc=
  4954. Newa}, \p{Newa}) (94)
  4955. \p{Script: Nko} \p{Script_Extensions=Nko} (Short: \p{Sc=
  4956. Nko}, \p{Nko}) (62)
  4957. \p{Script: Nkoo} \p{Script=Nko} (= \p{Script_Extensions=
  4958. Nko}) (62)
  4959. \p{Script: Nshu} \p{Script=Nushu} (= \p{Script_Extensions=
  4960. Nushu}) (397)
  4961. \p{Script: Nushu} \p{Script_Extensions=Nushu} (Short: \p{Sc=
  4962. Nshu}, \p{Nshu}) (397)
  4963. \p{Script: Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
  4964. Nyiakeng_Puachue_Hmong} (Short: \p{Sc=
  4965. Hmnp}, \p{Hmnp}) (71)
  4966. \p{Script: Ogam} \p{Script=Ogham} (= \p{Script_Extensions=
  4967. Ogham}) (29)
  4968. \p{Script: Ogham} \p{Script_Extensions=Ogham} (Short: \p{Sc=
  4969. Ogam}, \p{Ogam}) (29)
  4970. \p{Script: Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short:
  4971. \p{Sc=Olck}, \p{Olck}) (48)
  4972. \p{Script: Olck} \p{Script=Ol_Chiki} (=
  4973. \p{Script_Extensions=Ol_Chiki}) (48)
  4974. \p{Script: Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
  4975. (Short: \p{Sc=Hung}, \p{Hung}) (108)
  4976. \p{Script: Old_Italic} \p{Script_Extensions=Old_Italic} (Short:
  4977. \p{Sc=Ital}, \p{Ital}) (39)
  4978. \p{Script: Old_North_Arabian} \p{Script_Extensions=
  4979. Old_North_Arabian} (Short: \p{Sc=Narb},
  4980. \p{Narb}) (32)
  4981. \p{Script: Old_Permic} (Short: \p{Sc=Perm}) (43: U+10350..1037A)
  4982. \p{Script: Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
  4983. \p{Sc=Xpeo}, \p{Xpeo}) (50)
  4984. \p{Script: Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short:
  4985. \p{Sc=Sogo}, \p{Sogo}) (40)
  4986. \p{Script: Old_South_Arabian} \p{Script_Extensions=
  4987. Old_South_Arabian} (Short: \p{Sc=Sarb},
  4988. \p{Sarb}) (32)
  4989. \p{Script: Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short:
  4990. \p{Sc=Orkh}, \p{Orkh}) (73)
  4991. \p{Script: Oriya} (Short: \p{Sc=Orya}) (90: U+0B01..0B03,
  4992. U+0B05..0B0C, U+0B0F..0B10,
  4993. U+0B13..0B28, U+0B2A..0B30, U+0B32..0B33
  4994. ...)
  4995. \p{Script: Orkh} \p{Script=Old_Turkic} (=
  4996. \p{Script_Extensions=Old_Turkic}) (73)
  4997. \p{Script: Orya} \p{Script=Oriya} (90)
  4998. \p{Script: Osage} \p{Script_Extensions=Osage} (Short: \p{Sc=
  4999. Osge}, \p{Osge}) (72)
  5000. \p{Script: Osge} \p{Script=Osage} (= \p{Script_Extensions=
  5001. Osage}) (72)
  5002. \p{Script: Osma} \p{Script=Osmanya} (=
  5003. \p{Script_Extensions=Osmanya}) (40)
  5004. \p{Script: Osmanya} \p{Script_Extensions=Osmanya} (Short:
  5005. \p{Sc=Osma}, \p{Osma}) (40)
  5006. \p{Script: Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong}
  5007. (Short: \p{Sc=Hmng}, \p{Hmng}) (127)
  5008. \p{Script: Palm} \p{Script=Palmyrene} (=
  5009. \p{Script_Extensions=Palmyrene}) (32)
  5010. \p{Script: Palmyrene} \p{Script_Extensions=Palmyrene} (Short:
  5011. \p{Sc=Palm}, \p{Palm}) (32)
  5012. \p{Script: Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
  5013. \p{Sc=Pauc}, \p{Pauc}) (57)
  5014. \p{Script: Pauc} \p{Script=Pau_Cin_Hau} (=
  5015. \p{Script_Extensions=Pau_Cin_Hau}) (57)
  5016. \p{Script: Perm} \p{Script=Old_Permic} (43)
  5017. \p{Script: Phag} \p{Script=Phags_Pa} (56)
  5018. \p{Script: Phags_Pa} (Short: \p{Sc=Phag}) (56: U+A840..A877)
  5019. \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (=
  5020. \p{Script_Extensions=
  5021. Inscriptional_Pahlavi}) (27)
  5022. \p{Script: Phlp} \p{Script=Psalter_Pahlavi} (29)
  5023. \p{Script: Phnx} \p{Script=Phoenician} (=
  5024. \p{Script_Extensions=Phoenician}) (29)
  5025. \p{Script: Phoenician} \p{Script_Extensions=Phoenician} (Short:
  5026. \p{Sc=Phnx}, \p{Phnx}) (29)
  5027. \p{Script: Plrd} \p{Script=Miao} (= \p{Script_Extensions=
  5028. Miao}) (149)
  5029. \p{Script: Prti} \p{Script=Inscriptional_Parthian} (=
  5030. \p{Script_Extensions=
  5031. Inscriptional_Parthian}) (30)
  5032. \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29:
  5033. U+10B80..10B91, U+10B99..10B9C,
  5034. U+10BA9..10BAF)
  5035. \p{Script: Qaac} \p{Script=Coptic} (137)
  5036. \p{Script: Qaai} \p{Script=Inherited} (571)
  5037. \p{Script: Rejang} \p{Script_Extensions=Rejang} (Short:
  5038. \p{Sc=Rjng}, \p{Rjng}) (37)
  5039. \p{Script: Rjng} \p{Script=Rejang} (= \p{Script_Extensions=
  5040. Rejang}) (37)
  5041. \p{Script: Rohg} \p{Script=Hanifi_Rohingya} (50)
  5042. \p{Script: Runic} \p{Script_Extensions=Runic} (Short: \p{Sc=
  5043. Runr}, \p{Runr}) (86)
  5044. \p{Script: Runr} \p{Script=Runic} (= \p{Script_Extensions=
  5045. Runic}) (86)
  5046. \p{Script: Samaritan} \p{Script_Extensions=Samaritan} (Short:
  5047. \p{Sc=Samr}, \p{Samr}) (61)
  5048. \p{Script: Samr} \p{Script=Samaritan} (=
  5049. \p{Script_Extensions=Samaritan}) (61)
  5050. \p{Script: Sarb} \p{Script=Old_South_Arabian} (=
  5051. \p{Script_Extensions=Old_South_Arabian})
  5052. (32)
  5053. \p{Script: Saur} \p{Script=Saurashtra} (=
  5054. \p{Script_Extensions=Saurashtra}) (82)
  5055. \p{Script: Saurashtra} \p{Script_Extensions=Saurashtra} (Short:
  5056. \p{Sc=Saur}, \p{Saur}) (82)
  5057. \p{Script: Sgnw} \p{Script=SignWriting} (=
  5058. \p{Script_Extensions=SignWriting}) (672)
  5059. \p{Script: Sharada} (Short: \p{Sc=Shrd}) (94: U+11180..111CD,
  5060. U+111D0..111DF)
  5061. \p{Script: Shavian} \p{Script_Extensions=Shavian} (Short:
  5062. \p{Sc=Shaw}, \p{Shaw}) (48)
  5063. \p{Script: Shaw} \p{Script=Shavian} (=
  5064. \p{Script_Extensions=Shavian}) (48)
  5065. \p{Script: Shrd} \p{Script=Sharada} (94)
  5066. \p{Script: Sidd} \p{Script=Siddham} (=
  5067. \p{Script_Extensions=Siddham}) (92)
  5068. \p{Script: Siddham} \p{Script_Extensions=Siddham} (Short:
  5069. \p{Sc=Sidd}, \p{Sidd}) (92)
  5070. \p{Script: SignWriting} \p{Script_Extensions=SignWriting} (Short:
  5071. \p{Sc=Sgnw}, \p{Sgnw}) (672)
  5072. \p{Script: Sind} \p{Script=Khudawadi} (69)
  5073. \p{Script: Sinh} \p{Script=Sinhala} (110)
  5074. \p{Script: Sinhala} (Short: \p{Sc=Sinh}) (110: U+0D82..0D83,
  5075. U+0D85..0D96, U+0D9A..0DB1,
  5076. U+0DB3..0DBB, U+0DBD, U+0DC0..0DC6 ...)
  5077. \p{Script: Sogd} \p{Script=Sogdian} (42)
  5078. \p{Script: Sogdian} (Short: \p{Sc=Sogd}) (42: U+10F30..10F59)
  5079. \p{Script: Sogo} \p{Script=Old_Sogdian} (=
  5080. \p{Script_Extensions=Old_Sogdian}) (40)
  5081. \p{Script: Sora} \p{Script=Sora_Sompeng} (=
  5082. \p{Script_Extensions=Sora_Sompeng}) (35)
  5083. \p{Script: Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng}
  5084. (Short: \p{Sc=Sora}, \p{Sora}) (35)
  5085. \p{Script: Soyo} \p{Script=Soyombo} (=
  5086. \p{Script_Extensions=Soyombo}) (83)
  5087. \p{Script: Soyombo} \p{Script_Extensions=Soyombo} (Short:
  5088. \p{Sc=Soyo}, \p{Soyo}) (83)
  5089. \p{Script: Sund} \p{Script=Sundanese} (=
  5090. \p{Script_Extensions=Sundanese}) (72)
  5091. \p{Script: Sundanese} \p{Script_Extensions=Sundanese} (Short:
  5092. \p{Sc=Sund}, \p{Sund}) (72)
  5093. \p{Script: Sylo} \p{Script=Syloti_Nagri} (44)
  5094. \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (44: U+A800..A82B)
  5095. \p{Script: Syrc} \p{Script=Syriac} (88)
  5096. \p{Script: Syriac} (Short: \p{Sc=Syrc}) (88: U+0700..070D,
  5097. U+070F..074A, U+074D..074F, U+0860..086A)
  5098. \p{Script: Tagalog} (Short: \p{Sc=Tglg}) (20: U+1700..170C,
  5099. U+170E..1714)
  5100. \p{Script: Tagb} \p{Script=Tagbanwa} (18)
  5101. \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}) (18: U+1760..176C,
  5102. U+176E..1770, U+1772..1773)
  5103. \p{Script: Tai_Le} (Short: \p{Sc=Tale}) (35: U+1950..196D,
  5104. U+1970..1974)
  5105. \p{Script: Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short:
  5106. \p{Sc=Lana}, \p{Lana}) (127)
  5107. \p{Script: Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short:
  5108. \p{Sc=Tavt}, \p{Tavt}) (72)
  5109. \p{Script: Takr} \p{Script=Takri} (67)
  5110. \p{Script: Takri} (Short: \p{Sc=Takr}) (67: U+11680..116B8,
  5111. U+116C0..116C9)
  5112. \p{Script: Tale} \p{Script=Tai_Le} (35)
  5113. \p{Script: Talu} \p{Script=New_Tai_Lue} (=
  5114. \p{Script_Extensions=New_Tai_Lue}) (83)
  5115. \p{Script: Tamil} (Short: \p{Sc=Taml}) (123: U+0B82..0B83,
  5116. U+0B85..0B8A, U+0B8E..0B90,
  5117. U+0B92..0B95, U+0B99..0B9A, U+0B9C ...)
  5118. \p{Script: Taml} \p{Script=Tamil} (123)
  5119. \p{Script: Tang} \p{Script=Tangut} (= \p{Script_Extensions=
  5120. Tangut}) (6892)
  5121. \p{Script: Tangut} \p{Script_Extensions=Tangut} (Short:
  5122. \p{Sc=Tang}, \p{Tang}) (6892)
  5123. \p{Script: Tavt} \p{Script=Tai_Viet} (=
  5124. \p{Script_Extensions=Tai_Viet}) (72)
  5125. \p{Script: Telu} \p{Script=Telugu} (98)
  5126. \p{Script: Telugu} (Short: \p{Sc=Telu}) (98: U+0C00..0C0C,
  5127. U+0C0E..0C10, U+0C12..0C28,
  5128. U+0C2A..0C39, U+0C3D..0C44, U+0C46..0C48
  5129. ...)
  5130. \p{Script: Tfng} \p{Script=Tifinagh} (=
  5131. \p{Script_Extensions=Tifinagh}) (59)
  5132. \p{Script: Tglg} \p{Script=Tagalog} (20)
  5133. \p{Script: Thaa} \p{Script=Thaana} (50)
  5134. \p{Script: Thaana} (Short: \p{Sc=Thaa}) (50: U+0780..07B1)
  5135. \p{Script: Thai} \p{Script_Extensions=Thai} (Short: \p{Sc=
  5136. Thai}, \p{Thai}) (86)
  5137. \p{Script: Tibetan} \p{Script_Extensions=Tibetan} (Short:
  5138. \p{Sc=Tibt}, \p{Tibt}) (207)
  5139. \p{Script: Tibt} \p{Script=Tibetan} (=
  5140. \p{Script_Extensions=Tibetan}) (207)
  5141. \p{Script: Tifinagh} \p{Script_Extensions=Tifinagh} (Short:
  5142. \p{Sc=Tfng}, \p{Tfng}) (59)
  5143. \p{Script: Tirh} \p{Script=Tirhuta} (82)
  5144. \p{Script: Tirhuta} (Short: \p{Sc=Tirh}) (82: U+11480..114C7,
  5145. U+114D0..114D9)
  5146. \p{Script: Ugar} \p{Script=Ugaritic} (=
  5147. \p{Script_Extensions=Ugaritic}) (31)
  5148. \p{Script: Ugaritic} \p{Script_Extensions=Ugaritic} (Short:
  5149. \p{Sc=Ugar}, \p{Ugar}) (31)
  5150. \p{Script: Unknown} \p{Script_Extensions=Unknown} (Short:
  5151. \p{Sc=Zzzz}, \p{Zzzz}) (976_118 plus all
  5152. above-Unicode code points)
  5153. \p{Script: Vai} \p{Script_Extensions=Vai} (Short: \p{Sc=
  5154. Vai}, \p{Vai}) (300)
  5155. \p{Script: Vaii} \p{Script=Vai} (= \p{Script_Extensions=
  5156. Vai}) (300)
  5157. \p{Script: Wancho} \p{Script_Extensions=Wancho} (Short:
  5158. \p{Sc=Wcho}, \p{Wcho}) (59)
  5159. \p{Script: Wara} \p{Script=Warang_Citi} (=
  5160. \p{Script_Extensions=Warang_Citi}) (84)
  5161. \p{Script: Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
  5162. \p{Sc=Wara}, \p{Wara}) (84)
  5163. \p{Script: Wcho} \p{Script=Wancho} (= \p{Script_Extensions=
  5164. Wancho}) (59)
  5165. \p{Script: Xpeo} \p{Script=Old_Persian} (=
  5166. \p{Script_Extensions=Old_Persian}) (50)
  5167. \p{Script: Xsux} \p{Script=Cuneiform} (=
  5168. \p{Script_Extensions=Cuneiform}) (1234)
  5169. \p{Script: Yi} (Short: \p{Sc=Yi}) (1220: U+A000..A48C,
  5170. U+A490..A4C6)
  5171. \p{Script: Yiii} \p{Script=Yi} (1220)
  5172. \p{Script: Zanabazar_Square} \p{Script_Extensions=
  5173. Zanabazar_Square} (Short: \p{Sc=Zanb},
  5174. \p{Zanb}) (72)
  5175. \p{Script: Zanb} \p{Script=Zanabazar_Square} (=
  5176. \p{Script_Extensions=Zanabazar_Square})
  5177. (72)
  5178. \p{Script: Zinh} \p{Script=Inherited} (571)
  5179. \p{Script: Zyyy} \p{Script=Common} (7805)
  5180. \p{Script: Zzzz} \p{Script=Unknown} (=
  5181. \p{Script_Extensions=Unknown}) (976_118
  5182. plus all above-Unicode code points)
  5183. \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (89:
  5184. U+0640, U+1E900..1E94B, U+1E950..1E959,
  5185. U+1E95E..1E95F)
  5186. \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (89)
  5187. \p{Script_Extensions: Aghb} \p{Script_Extensions=
  5188. Caucasian_Albanian} (53)
  5189. \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (58:
  5190. U+11700..1171A, U+1171D..1172B,
  5191. U+11730..1173F)
  5192. \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw},
  5193. \p{Hluw}) (583: U+14400..14646)
  5194. \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1325)
  5195. \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab})
  5196. (1325: U+0600..0604, U+0606..061C,
  5197. U+061E..06DC, U+06DE..06FF,
  5198. U+0750..077F, U+08A0..08B4 ...)
  5199. \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn})
  5200. (96: U+0531..0556, U+0559..058A,
  5201. U+058D..058F, U+FB13..FB17)
  5202. \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
  5203. (31)
  5204. \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (96)
  5205. \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst})
  5206. (61: U+10B00..10B35, U+10B39..10B3F)
  5207. \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
  5208. \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
  5209. \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali})
  5210. (121: U+1B00..1B4B, U+1B50..1B7C)
  5211. \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
  5212. \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657:
  5213. U+A6A0..A6F7, U+16800..16A38)
  5214. \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
  5215. \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass})
  5216. (36: U+16AD0..16AED, U+16AF0..16AF5)
  5217. \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56:
  5218. U+1BC0..1BF3, U+1BFC..1BFF)
  5219. \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
  5220. \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (113)
  5221. \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng})
  5222. (113: U+0951..0952, U+0964..0965,
  5223. U+0980..0983, U+0985..098C,
  5224. U+098F..0990, U+0993..09A8 ...)
  5225. \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks})
  5226. (97: U+11C00..11C08, U+11C0A..11C36,
  5227. U+11C38..11C45, U+11C50..11C6C)
  5228. \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97)
  5229. \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (112)
  5230. \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo})
  5231. (112: U+02EA..02EB, U+3001..3003,
  5232. U+3008..3011, U+3013..301F,
  5233. U+302A..302D, U+3030 ...)
  5234. \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (109)
  5235. \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah})
  5236. (109: U+11000..1104D, U+11052..1106F,
  5237. U+1107F)
  5238. \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
  5239. \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai})
  5240. (256: U+2800..28FF)
  5241. \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
  5242. \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi})
  5243. (31: U+1A00..1A1B, U+1A1E..1A1F, U+A9CF)
  5244. \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
  5245. \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22:
  5246. U+1735..1736, U+1740..1753)
  5247. \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (90)
  5248. \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans},
  5249. \p{Cans}) (710: U+1400..167F,
  5250. U+18B0..18F5)
  5251. \p{Script_Extensions: Cans} \p{Script_Extensions=
  5252. Canadian_Aboriginal} (710)
  5253. \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
  5254. \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49:
  5255. U+102A0..102D0)
  5256. \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb},
  5257. \p{Aghb}) (53: U+10530..10563, U+1056F)
  5258. \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (90:
  5259. U+09E6..09EF, U+1040..1049,
  5260. U+11100..11134, U+11136..11146)
  5261. \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83:
  5262. U+AA00..AA36, U+AA40..AA4D,
  5263. U+AA50..AA59, U+AA5C..AA5F)
  5264. \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
  5265. \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher})
  5266. (172: U+13A0..13F5, U+13F8..13FD,
  5267. U+AB70..ABBF)
  5268. \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy})
  5269. (7386: [\x00-
  5270. \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
  5271. >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
  5272. \xb9\xbb-\xbf\xd7\xf7], U+02B9..02DF,
  5273. U+02E5..02E9, U+02EC..02FF, U+0374,
  5274. U+037E ...)
  5275. \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
  5276. \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt})
  5277. (165: U+03E2..03EF, U+2C80..2CF3,
  5278. U+2CF9..2CFF, U+102E0..102FB)
  5279. \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
  5280. \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux})
  5281. (1234: U+12000..12399, U+12400..1246E,
  5282. U+12470..12474, U+12480..12543)
  5283. \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt})
  5284. (112: U+10100..10102, U+10107..10133,
  5285. U+10137..1013F, U+10800..10805, U+10808,
  5286. U+1080A..10835 ...)
  5287. \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl})
  5288. (446: U+0400..052F, U+1C80..1C88,
  5289. U+1D2B, U+1D78, U+2DE0..2DFF, U+2E43 ...)
  5290. \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (446)
  5291. \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt})
  5292. (80: U+10400..1044F)
  5293. \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (210)
  5294. \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva})
  5295. (210: U+0900..0952, U+0955..097F,
  5296. U+1CD0..1CF6, U+1CF8..1CF9, U+20F0,
  5297. U+A830..A839 ...)
  5298. \p{Script_Extensions: Dogr} \p{Script_Extensions=Dogra} (82)
  5299. \p{Script_Extensions: Dogra} (Short: \p{Scx=Dogr}, \p{Dogr}) (82:
  5300. U+0964..096F, U+A830..A839,
  5301. U+11800..1183B)
  5302. \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
  5303. \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
  5304. \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl})
  5305. (147: U+1BC00..1BC6A, U+1BC70..1BC7C,
  5306. U+1BC80..1BC88, U+1BC90..1BC99,
  5307. U+1BC9C..1BCA3)
  5308. \p{Script_Extensions: Egyp} \p{Script_Extensions=
  5309. Egyptian_Hieroglyphs} (1080)
  5310. \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp},
  5311. \p{Egyp}) (1080: U+13000..1342E,
  5312. U+13430..13438)
  5313. \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
  5314. \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba})
  5315. (40: U+10500..10527)
  5316. \p{Script_Extensions: Elym} \p{Script_Extensions=Elymaic} (23)
  5317. \p{Script_Extensions: Elymaic} (Short: \p{Scx=Elym}, \p{Elym})
  5318. (23: U+10FE0..10FF6)
  5319. \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
  5320. \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi})
  5321. (495: U+1200..1248, U+124A..124D,
  5322. U+1250..1256, U+1258, U+125A..125D,
  5323. U+1260..1288 ...)
  5324. \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (175)
  5325. \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor})
  5326. (175: U+0589, U+10A0..10C5, U+10C7,
  5327. U+10CD, U+10D0..10FF, U+1C90..1CBA ...)
  5328. \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (136)
  5329. \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag})
  5330. (136: U+0484, U+0487, U+2C00..2C2E,
  5331. U+2C30..2C5E, U+2E43, U+A66F ...)
  5332. \p{Script_Extensions: Gong} \p{Script_Extensions=Gunjala_Gondi}
  5333. (65)
  5334. \p{Script_Extensions: Gonm} \p{Script_Extensions=Masaram_Gondi}
  5335. (77)
  5336. \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
  5337. \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27:
  5338. U+10330..1034A)
  5339. \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (116)
  5340. \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran})
  5341. (116: U+0951..0952, U+0964..0965,
  5342. U+0BE6..0BF3, U+1CD0, U+1CD2..1CD3,
  5343. U+1CF2..1CF4 ...)
  5344. \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522:
  5345. U+0342, U+0345, U+0370..0373,
  5346. U+0375..0377, U+037A..037D, U+037F ...)
  5347. \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522)
  5348. \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr})
  5349. (105: U+0951..0952, U+0964..0965,
  5350. U+0A81..0A83, U+0A85..0A8D,
  5351. U+0A8F..0A91, U+0A93..0AA8 ...)
  5352. \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (105)
  5353. \p{Script_Extensions: Gunjala_Gondi} (Short: \p{Scx=Gong},
  5354. \p{Gong}) (65: U+0964..0965,
  5355. U+11D60..11D65, U+11D67..11D68,
  5356. U+11D6A..11D8E, U+11D90..11D91,
  5357. U+11D93..11D98 ...)
  5358. \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru})
  5359. (94: U+0951..0952, U+0964..0965,
  5360. U+0A01..0A03, U+0A05..0A0A,
  5361. U+0A0F..0A10, U+0A13..0A28 ...)
  5362. \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (94)
  5363. \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (89_513:
  5364. U+2E80..2E99, U+2E9B..2EF3,
  5365. U+2F00..2FD5, U+3001..3003,
  5366. U+3005..3011, U+3013..301F ...)
  5367. \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775)
  5368. \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang})
  5369. (11_775: U+1100..11FF, U+3001..3003,
  5370. U+3008..3011, U+3013..301F,
  5371. U+302E..3030, U+3037 ...)
  5372. \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (89_513)
  5373. \p{Script_Extensions: Hanifi_Rohingya} (Short: \p{Scx=Rohg},
  5374. \p{Rohg}) (55: U+060C, U+061B, U+061F,
  5375. U+0640, U+06D4, U+10D00..10D27 ...)
  5376. \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
  5377. \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano})
  5378. (23: U+1720..1736)
  5379. \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
  5380. \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26:
  5381. U+108E0..108F2, U+108F4..108F5,
  5382. U+108FB..108FF)
  5383. \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (134)
  5384. \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr})
  5385. (134: U+0591..05C7, U+05D0..05EA,
  5386. U+05EF..05F4, U+FB1D..FB36,
  5387. U+FB38..FB3C, U+FB3E ...)
  5388. \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (431)
  5389. \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira})
  5390. (431: U+3001..3003, U+3008..3011,
  5391. U+3013..301F, U+3030..3035, U+3037,
  5392. U+303C..303D ...)
  5393. \p{Script_Extensions: Hluw} \p{Script_Extensions=
  5394. Anatolian_Hieroglyphs} (583)
  5395. \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
  5396. (127)
  5397. \p{Script_Extensions: Hmnp} \p{Script_Extensions=
  5398. Nyiakeng_Puachue_Hmong} (71)
  5399. \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
  5400. (108)
  5401. \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi},
  5402. \p{Armi}) (31: U+10840..10855,
  5403. U+10857..1085F)
  5404. \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh})
  5405. (502: U+0300..0341, U+0343..0344,
  5406. U+0346..0362, U+0953..0954,
  5407. U+1AB0..1ABE, U+1DC2..1DF9 ...)
  5408. \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli},
  5409. \p{Phli}) (27: U+10B60..10B72,
  5410. U+10B78..10B7F)
  5411. \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
  5412. Prti}, \p{Prti}) (30: U+10B40..10B55,
  5413. U+10B58..10B5F)
  5414. \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (39)
  5415. \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
  5416. \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java})
  5417. (91: U+A980..A9CD, U+A9CF..A9D9,
  5418. U+A9DE..A9DF)
  5419. \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (87:
  5420. U+0966..096F, U+A830..A839,
  5421. U+11080..110C1, U+110CD)
  5422. \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
  5423. \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (356)
  5424. \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda})
  5425. (104: U+0951..0952, U+0964..0965,
  5426. U+0C80..0C8C, U+0C8E..0C90,
  5427. U+0C92..0CA8, U+0CAA..0CB3 ...)
  5428. \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana})
  5429. (356: U+3001..3003, U+3008..3011,
  5430. U+3013..301F, U+3030..3035, U+3037,
  5431. U+303C..303D ...)
  5432. \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali})
  5433. (48: U+A900..A92F)
  5434. \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (68)
  5435. \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar})
  5436. (68: U+10A00..10A03, U+10A05..10A06,
  5437. U+10A0C..10A13, U+10A15..10A17,
  5438. U+10A19..10A35, U+10A38..10A3A ...)
  5439. \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146:
  5440. U+1780..17DD, U+17E0..17E9,
  5441. U+17F0..17F9, U+19E0..19FF)
  5442. \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
  5443. \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (82)
  5444. \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (82:
  5445. U+0AE6..0AEF, U+A830..A839,
  5446. U+11200..11211, U+11213..1123E)
  5447. \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind})
  5448. (81: U+0964..0965, U+A830..A839,
  5449. U+112B0..112EA, U+112F0..112F9)
  5450. \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (104)
  5451. \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (87)
  5452. \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
  5453. \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (82:
  5454. U+0E81..0E82, U+0E84, U+0E86..0E8A,
  5455. U+0E8C..0EA3, U+0EA5, U+0EA7..0EBD ...)
  5456. \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (82)
  5457. \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn})
  5458. (1387: [A-Za-z\xaa\xba\xc0-\xd6\xd8-
  5459. \xf6\xf8-\xff], U+0100..02B8,
  5460. U+02E0..02E4, U+0363..036F,
  5461. U+0485..0486, U+0951..0952 ...)
  5462. \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1387)
  5463. \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
  5464. \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74:
  5465. U+1C00..1C37, U+1C3B..1C49, U+1C4D..1C4F)
  5466. \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
  5467. \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69:
  5468. U+0965, U+1900..191E, U+1920..192B,
  5469. U+1930..193B, U+1940, U+1944..194F)
  5470. \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386)
  5471. \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
  5472. \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina})
  5473. (386: U+10107..10133, U+10600..10736,
  5474. U+10740..10755, U+10760..10767)
  5475. \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb})
  5476. (268: U+10000..1000B, U+1000D..10026,
  5477. U+10028..1003A, U+1003C..1003D,
  5478. U+1003F..1004D, U+10050..1005D ...)
  5479. \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (48:
  5480. U+A4D0..A4FF)
  5481. \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
  5482. \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29:
  5483. U+10280..1029C)
  5484. \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
  5485. \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27:
  5486. U+10920..10939, U+1093F)
  5487. \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj})
  5488. (61: U+0964..096F, U+A830..A839,
  5489. U+11150..11176)
  5490. \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
  5491. \p{Script_Extensions: Maka} \p{Script_Extensions=Makasar} (25)
  5492. \p{Script_Extensions: Makasar} (Short: \p{Scx=Maka}, \p{Maka})
  5493. (25: U+11EE0..11EF8)
  5494. \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym})
  5495. (125: U+0951..0952, U+0964..0965,
  5496. U+0D00..0D03, U+0D05..0D0C,
  5497. U+0D0E..0D10, U+0D12..0D44 ...)
  5498. \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
  5499. \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand})
  5500. (30: U+0640, U+0840..085B, U+085E)
  5501. \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
  5502. \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani})
  5503. (52: U+0640, U+10AC0..10AE6,
  5504. U+10AEB..10AF6)
  5505. \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68)
  5506. \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc})
  5507. (68: U+11C70..11C8F, U+11C92..11CA7,
  5508. U+11CA9..11CB6)
  5509. \p{Script_Extensions: Masaram_Gondi} (Short: \p{Scx=Gonm},
  5510. \p{Gonm}) (77: U+0964..0965,
  5511. U+11D00..11D06, U+11D08..11D09,
  5512. U+11D0B..11D36, U+11D3A, U+11D3C..11D3D
  5513. ...)
  5514. \p{Script_Extensions: Medefaidrin} (Short: \p{Scx=Medf}, \p{Medf})
  5515. (91: U+16E40..16E9A)
  5516. \p{Script_Extensions: Medf} \p{Script_Extensions=Medefaidrin} (91)
  5517. \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei},
  5518. \p{Mtei}) (79: U+AAE0..AAF6,
  5519. U+ABC0..ABED, U+ABF0..ABF9)
  5520. \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
  5521. (213)
  5522. \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend},
  5523. \p{Mend}) (213: U+1E800..1E8C4,
  5524. U+1E8C7..1E8D6)
  5525. \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
  5526. (90)
  5527. \p{Script_Extensions: Mero} \p{Script_Extensions=
  5528. Meroitic_Hieroglyphs} (32)
  5529. \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc},
  5530. \p{Merc}) (90: U+109A0..109B7,
  5531. U+109BC..109CF, U+109D2..109FF)
  5532. \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero},
  5533. \p{Mero}) (32: U+10980..1099F)
  5534. \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (149:
  5535. U+16F00..16F4A, U+16F4F..16F87,
  5536. U+16F8F..16F9F)
  5537. \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (125)
  5538. \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89:
  5539. U+A830..A839, U+11600..11644,
  5540. U+11650..11659)
  5541. \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (171)
  5542. \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong})
  5543. (171: U+1800..180E, U+1810..1819,
  5544. U+1820..1878, U+1880..18AA, U+202F,
  5545. U+11660..1166C)
  5546. \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43:
  5547. U+16A40..16A5E, U+16A60..16A69,
  5548. U+16A6E..16A6F)
  5549. \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
  5550. \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
  5551. \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
  5552. \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult})
  5553. (48: U+0A66..0A6F, U+11280..11286,
  5554. U+11288, U+1128A..1128D, U+1128F..1129D,
  5555. U+1129F..112A9)
  5556. \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr})
  5557. (224: U+1000..109F, U+A92E,
  5558. U+A9E0..A9FE, U+AA60..AA7F)
  5559. \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
  5560. \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat})
  5561. (40: U+10880..1089E, U+108A7..108AF)
  5562. \p{Script_Extensions: Nand} \p{Script_Extensions=Nandinagari} (86)
  5563. \p{Script_Extensions: Nandinagari} (Short: \p{Scx=Nand}, \p{Nand})
  5564. (86: U+0964..0965, U+0CE6..0CEF, U+1CE9,
  5565. U+1CF2, U+1CFA, U+A830..A835 ...)
  5566. \p{Script_Extensions: Narb} \p{Script_Extensions=
  5567. Old_North_Arabian} (32)
  5568. \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
  5569. \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu})
  5570. (83: U+1980..19AB, U+19B0..19C9,
  5571. U+19D0..19DA, U+19DE..19DF)
  5572. \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (94:
  5573. U+11400..11459, U+1145B, U+1145D..1145F)
  5574. \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (62:
  5575. U+07C0..07FA, U+07FD..07FF)
  5576. \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (62)
  5577. \p{Script_Extensions: Nshu} \p{Script_Extensions=Nushu} (397)
  5578. \p{Script_Extensions: Nushu} (Short: \p{Scx=Nshu}, \p{Nshu}) (397:
  5579. U+16FE1, U+1B170..1B2FB)
  5580. \p{Script_Extensions: Nyiakeng_Puachue_Hmong} (Short: \p{Scx=
  5581. Hmnp}, \p{Hmnp}) (71: U+1E100..1E12C,
  5582. U+1E130..1E13D, U+1E140..1E149,
  5583. U+1E14E..1E14F)
  5584. \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
  5585. \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29:
  5586. U+1680..169C)
  5587. \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck})
  5588. (48: U+1C50..1C7F)
  5589. \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
  5590. \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung},
  5591. \p{Hung}) (108: U+10C80..10CB2,
  5592. U+10CC0..10CF2, U+10CFA..10CFF)
  5593. \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital})
  5594. (39: U+10300..10323, U+1032D..1032F)
  5595. \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb},
  5596. \p{Narb}) (32: U+10A80..10A9F)
  5597. \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm})
  5598. (44: U+0483, U+10350..1037A)
  5599. \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo})
  5600. (50: U+103A0..103C3, U+103C8..103D5)
  5601. \p{Script_Extensions: Old_Sogdian} (Short: \p{Scx=Sogo}, \p{Sogo})
  5602. (40: U+10F00..10F27)
  5603. \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb},
  5604. \p{Sarb}) (32: U+10A60..10A7F)
  5605. \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh})
  5606. (73: U+10C00..10C48)
  5607. \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (96:
  5608. U+0951..0952, U+0964..0965,
  5609. U+0B01..0B03, U+0B05..0B0C,
  5610. U+0B0F..0B10, U+0B13..0B28 ...)
  5611. \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
  5612. \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (96)
  5613. \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72:
  5614. U+104B0..104D3, U+104D8..104FB)
  5615. \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72)
  5616. \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
  5617. \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma})
  5618. (40: U+10480..1049D, U+104A0..104A9)
  5619. \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng},
  5620. \p{Hmng}) (127: U+16B00..16B45,
  5621. U+16B50..16B59, U+16B5B..16B61,
  5622. U+16B63..16B77, U+16B7D..16B8F)
  5623. \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
  5624. \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm})
  5625. (32: U+10860..1087F)
  5626. \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc})
  5627. (57: U+11AC0..11AF8)
  5628. \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
  5629. \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
  5630. \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
  5631. \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag})
  5632. (59: U+1802..1803, U+1805, U+A840..A877)
  5633. \p{Script_Extensions: Phli} \p{Script_Extensions=
  5634. Inscriptional_Pahlavi} (27)
  5635. \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
  5636. (30)
  5637. \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
  5638. \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx})
  5639. (29: U+10900..1091B, U+1091F)
  5640. \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (149)
  5641. \p{Script_Extensions: Prti} \p{Script_Extensions=
  5642. Inscriptional_Parthian} (30)
  5643. \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp},
  5644. \p{Phlp}) (30: U+0640, U+10B80..10B91,
  5645. U+10B99..10B9C, U+10BA9..10BAF)
  5646. \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
  5647. \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (502)
  5648. \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37:
  5649. U+A930..A953, U+A95F)
  5650. \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
  5651. \p{Script_Extensions: Rohg} \p{Script_Extensions=Hanifi_Rohingya}
  5652. (55)
  5653. \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86:
  5654. U+16A0..16EA, U+16EE..16F8)
  5655. \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
  5656. \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr})
  5657. (61: U+0800..082D, U+0830..083E)
  5658. \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
  5659. \p{Script_Extensions: Sarb} \p{Script_Extensions=
  5660. Old_South_Arabian} (32)
  5661. \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82)
  5662. \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur})
  5663. (82: U+A880..A8C5, U+A8CE..A8D9)
  5664. \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
  5665. \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd})
  5666. (100: U+0951, U+1CD7, U+1CD9,
  5667. U+1CDC..1CDD, U+1CE0, U+11180..111CD ...)
  5668. \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw})
  5669. (48: U+10450..1047F)
  5670. \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
  5671. \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (100)
  5672. \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
  5673. \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd})
  5674. (92: U+11580..115B5, U+115B8..115DD)
  5675. \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw})
  5676. (672: U+1D800..1DA8B, U+1DA9B..1DA9F,
  5677. U+1DAA1..1DAAF)
  5678. \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
  5679. \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (112)
  5680. \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh})
  5681. (112: U+0964..0965, U+0D82..0D83,
  5682. U+0D85..0D96, U+0D9A..0DB1,
  5683. U+0DB3..0DBB, U+0DBD ...)
  5684. \p{Script_Extensions: Sogd} \p{Script_Extensions=Sogdian} (43)
  5685. \p{Script_Extensions: Sogdian} (Short: \p{Scx=Sogd}, \p{Sogd})
  5686. (43: U+0640, U+10F30..10F59)
  5687. \p{Script_Extensions: Sogo} \p{Script_Extensions=Old_Sogdian} (40)
  5688. \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
  5689. \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora},
  5690. \p{Sora}) (35: U+110D0..110E8,
  5691. U+110F0..110F9)
  5692. \p{Script_Extensions: Soyo} \p{Script_Extensions=Soyombo} (83)
  5693. \p{Script_Extensions: Soyombo} (Short: \p{Scx=Soyo}, \p{Soyo})
  5694. (83: U+11A50..11AA2)
  5695. \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
  5696. \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund})
  5697. (72: U+1B80..1BBF, U+1CC0..1CC7)
  5698. \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (56)
  5699. \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo},
  5700. \p{Sylo}) (56: U+0964..0965,
  5701. U+09E6..09EF, U+A800..A82B)
  5702. \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (105)
  5703. \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc})
  5704. (105: U+060C, U+061B..061C, U+061F,
  5705. U+0640, U+064B..0655, U+0670 ...)
  5706. \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg})
  5707. (22: U+1700..170C, U+170E..1714,
  5708. U+1735..1736)
  5709. \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
  5710. \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb})
  5711. (20: U+1735..1736, U+1760..176C,
  5712. U+176E..1770, U+1772..1773)
  5713. \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45:
  5714. U+1040..1049, U+1950..196D, U+1970..1974)
  5715. \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana})
  5716. (127: U+1A20..1A5E, U+1A60..1A7C,
  5717. U+1A7F..1A89, U+1A90..1A99, U+1AA0..1AAD)
  5718. \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt})
  5719. (72: U+AA80..AAC2, U+AADB..AADF)
  5720. \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (79)
  5721. \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (79:
  5722. U+0964..0965, U+A830..A839,
  5723. U+11680..116B8, U+116C0..116C9)
  5724. \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
  5725. \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
  5726. \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (133:
  5727. U+0951..0952, U+0964..0965,
  5728. U+0B82..0B83, U+0B85..0B8A,
  5729. U+0B8E..0B90, U+0B92..0B95 ...)
  5730. \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (133)
  5731. \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6892)
  5732. \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang})
  5733. (6892: U+16FE0, U+17000..187F7,
  5734. U+18800..18AF2)
  5735. \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
  5736. \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (104)
  5737. \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu})
  5738. (104: U+0951..0952, U+0964..0965,
  5739. U+0C00..0C0C, U+0C0E..0C10,
  5740. U+0C12..0C28, U+0C2A..0C39 ...)
  5741. \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
  5742. \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
  5743. \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (66)
  5744. \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (66:
  5745. U+060C, U+061B..061C, U+061F,
  5746. U+0660..0669, U+0780..07B1, U+FDF2 ...)
  5747. \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86:
  5748. U+0E01..0E3A, U+0E40..0E5B)
  5749. \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt})
  5750. (207: U+0F00..0F47, U+0F49..0F6C,
  5751. U+0F71..0F97, U+0F99..0FBC,
  5752. U+0FBE..0FCC, U+0FCE..0FD4 ...)
  5753. \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
  5754. \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng})
  5755. (59: U+2D30..2D67, U+2D6F..2D70, U+2D7F)
  5756. \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (97)
  5757. \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh})
  5758. (97: U+0951..0952, U+0964..0965, U+1CF2,
  5759. U+A830..A839, U+11480..114C7,
  5760. U+114D0..114D9)
  5761. \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
  5762. \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar})
  5763. (31: U+10380..1039D, U+1039F)
  5764. \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz})
  5765. (976_118 plus all above-Unicode code
  5766. points: U+0378..0379, U+0380..0383,
  5767. U+038B, U+038D, U+03A2, U+0530 ...)
  5768. \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300:
  5769. U+A500..A62B)
  5770. \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
  5771. \p{Script_Extensions: Wancho} (Short: \p{Scx=Wcho}, \p{Wcho}) (59:
  5772. U+1E2C0..1E2F9, U+1E2FF)
  5773. \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
  5774. \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara})
  5775. (84: U+118A0..118F2, U+118FF)
  5776. \p{Script_Extensions: Wcho} \p{Script_Extensions=Wancho} (59)
  5777. \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
  5778. \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
  5779. \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246:
  5780. U+3001..3002, U+3008..3011,
  5781. U+3014..301B, U+30FB, U+A000..A48C,
  5782. U+A490..A4C6 ...)
  5783. \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
  5784. \p{Script_Extensions: Zanabazar_Square} (Short: \p{Scx=Zanb},
  5785. \p{Zanb}) (72: U+11A00..11A47)
  5786. \p{Script_Extensions: Zanb} \p{Script_Extensions=Zanabazar_Square}
  5787. (72)
  5788. \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (502)
  5789. \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (7386)
  5790. \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (976_118
  5791. plus all above-Unicode code points)
  5792. \p{Scx: *} \p{Script_Extensions: *}
  5793. \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
  5794. \p{SD: *} \p{Soft_Dotted: *}
  5795. \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4)
  5796. \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4: [.], U+2024,
  5797. U+FE52, U+FF0E)
  5798. \p{Sentence_Break: CL} \p{Sentence_Break=Close} (187)
  5799. \p{Sentence_Break: Close} (Short: \p{SB=CL}) (187:
  5800. [\"\'\(\)\[\]\{\}\xab\xbb],
  5801. U+0F3A..0F3D, U+169B..169C,
  5802. U+2018..201F, U+2039..203A, U+2045..2046
  5803. ...)
  5804. \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1: [\r])
  5805. \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (2368)
  5806. \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2368: U+0300..036F,
  5807. U+0483..0489, U+0591..05BD, U+05BF,
  5808. U+05C1..05C2, U+05C4..05C5 ...)
  5809. \p{Sentence_Break: FO} \p{Sentence_Break=Format} (63)
  5810. \p{Sentence_Break: Format} (Short: \p{SB=FO}) (63: [\xad],
  5811. U+0600..0605, U+061C, U+06DD, U+070F,
  5812. U+08E2 ...)
  5813. \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (121_822)
  5814. \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1: [\n])
  5815. \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (2293)
  5816. \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2293: [a-
  5817. z\xaa\xb5\xba\xdf-\xf6\xf8-\xff],
  5818. U+0101, U+0103, U+0105, U+0107, U+0109
  5819. ...)
  5820. \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (632)
  5821. \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (632: [0-9],
  5822. U+0660..0669, U+066B..066C,
  5823. U+06F0..06F9, U+07C0..07C9, U+0966..096F
  5824. ...)
  5825. \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (121_822: U+01BB,
  5826. U+01C0..01C3, U+0294, U+02B9..02BF,
  5827. U+02C6..02D1, U+02EC ...)
  5828. \p{Sentence_Break: Other} (Short: \p{SB=XX}) (984_661 plus all
  5829. above-Unicode code points:
  5830. [^\t\n\cK\f\r\x20!\"\'\(\),\-.0-9:?A-
  5831. Z\[\]a-z\{\}\x85\xa0\xaa-
  5832. \xab\xad\xb5\xba-\xbb\xc0-\xd6\xd8-
  5833. \xf6\xf8-\xff], U+02C2..02C5,
  5834. U+02D2..02DF, U+02E5..02EB, U+02ED,
  5835. U+02EF..02FF ...)
  5836. \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26)
  5837. \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26: [,\-:],
  5838. U+055D, U+060C..060D, U+07F8, U+1802,
  5839. U+1808 ...)
  5840. \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3)
  5841. \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3: [\x85],
  5842. U+2028..2029)
  5843. \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (20: [\t\cK\f\x20\xa0],
  5844. U+1680, U+2000..200A, U+202F, U+205F,
  5845. U+3000)
  5846. \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (138)
  5847. \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (138: [!?], U+0589,
  5848. U+061E..061F, U+06D4, U+0700..0702,
  5849. U+07F9 ...)
  5850. \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1893)
  5851. \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1893: [A-Z\xc0-
  5852. \xd6\xd8-\xde], U+0100, U+0102, U+0104,
  5853. U+0106, U+0108 ...)
  5854. \p{Sentence_Break: XX} \p{Sentence_Break=Other} (984_661 plus all
  5855. above-Unicode code points)
  5856. \p{Sentence_Terminal} \p{Sentence_Terminal=Y} (Short: \p{STerm})
  5857. (141)
  5858. \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm})
  5859. (1_113_971 plus all above-Unicode code
  5860. points: [\x00-\x20\"#\$\%&\'\(\)*+,\-
  5861. \/0-9:;<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
  5862. \xff], U+0100..0588, U+058A..061D,
  5863. U+0620..06D3, U+06D5..06FF, U+0703..07F8
  5864. ...)
  5865. \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (141:
  5866. [!.?], U+0589, U+061E..061F, U+06D4,
  5867. U+0700..0702, U+07F9 ...)
  5868. \p{Separator} \p{General_Category=Separator} (Short:
  5869. \p{Z}) (19)
  5870. \p{Sgnw} \p{SignWriting} (= \p{Script_Extensions=
  5871. SignWriting}) (672)
  5872. \p{Sharada} \p{Script_Extensions=Sharada} (Short:
  5873. \p{Shrd}; NOT \p{Block=Sharada}) (100)
  5874. \p{Shavian} \p{Script_Extensions=Shavian} (Short:
  5875. \p{Shaw}) (48)
  5876. \p{Shaw} \p{Shavian} (= \p{Script_Extensions=
  5877. Shavian}) (48)
  5878. X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
  5879. (16)
  5880. \p{Shrd} \p{Sharada} (= \p{Script_Extensions=
  5881. Sharada}) (NOT \p{Block=Sharada}) (100)
  5882. \p{Sidd} \p{Siddham} (= \p{Script_Extensions=
  5883. Siddham}) (NOT \p{Block=Siddham}) (92)
  5884. \p{Siddham} \p{Script_Extensions=Siddham} (Short:
  5885. \p{Sidd}; NOT \p{Block=Siddham}) (92)
  5886. \p{SignWriting} \p{Script_Extensions=SignWriting} (Short:
  5887. \p{Sgnw}) (672)
  5888. \p{Sind} \p{Khudawadi} (= \p{Script_Extensions=
  5889. Khudawadi}) (NOT \p{Block=Khudawadi})
  5890. (81)
  5891. \p{Sinh} \p{Sinhala} (= \p{Script_Extensions=
  5892. Sinhala}) (NOT \p{Block=Sinhala}) (112)
  5893. \p{Sinhala} \p{Script_Extensions=Sinhala} (Short:
  5894. \p{Sinh}; NOT \p{Block=Sinhala}) (112)
  5895. X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
  5896. \p{Sk} \p{Modifier_Symbol} (=
  5897. \p{General_Category=Modifier_Symbol})
  5898. (121)
  5899. \p{Sm} \p{Math_Symbol} (= \p{General_Category=
  5900. Math_Symbol}) (948)
  5901. X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
  5902. \p{InSmallForms}) (32)
  5903. X \p{Small_Forms} \p{Small_Form_Variants} (= \p{Block=
  5904. Small_Form_Variants}) (32)
  5905. X \p{Small_Kana_Ext} \p{Small_Kana_Extension} (= \p{Block=
  5906. Small_Kana_Extension}) (64)
  5907. X \p{Small_Kana_Extension} \p{Block=Small_Kana_Extension} (Short:
  5908. \p{InSmallKanaExt}) (64)
  5909. \p{So} \p{Other_Symbol} (= \p{General_Category=
  5910. Other_Symbol}) (6161)
  5911. \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
  5912. \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_066 plus
  5913. all above-Unicode code points: [\x00-
  5914. \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
  5915. Z\[\\\]\^_`a-hk-z\{\|\}~\x7f-\xff],
  5916. U+0100..012E, U+0130..0248,
  5917. U+024A..0267, U+0269..029C, U+029E..02B1
  5918. ...)
  5919. \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (46: [i-j],
  5920. U+012F, U+0249, U+0268, U+029D, U+02B2
  5921. ...)
  5922. \p{Sogd} \p{Sogdian} (= \p{Script_Extensions=
  5923. Sogdian}) (NOT \p{Block=Sogdian}) (43)
  5924. \p{Sogdian} \p{Script_Extensions=Sogdian} (Short:
  5925. \p{Sogd}; NOT \p{Block=Sogdian}) (43)
  5926. \p{Sogo} \p{Old_Sogdian} (= \p{Script_Extensions=
  5927. Old_Sogdian}) (NOT \p{Block=
  5928. Old_Sogdian}) (40)
  5929. \p{Sora} \p{Sora_Sompeng} (= \p{Script_Extensions=
  5930. Sora_Sompeng}) (NOT \p{Block=
  5931. Sora_Sompeng}) (35)
  5932. \p{Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng} (Short:
  5933. \p{Sora}; NOT \p{Block=Sora_Sompeng})
  5934. (35)
  5935. \p{Soyo} \p{Soyombo} (= \p{Script_Extensions=
  5936. Soyombo}) (NOT \p{Block=Soyombo}) (83)
  5937. \p{Soyombo} \p{Script_Extensions=Soyombo} (Short:
  5938. \p{Soyo}; NOT \p{Block=Soyombo}) (83)
  5939. \p{Space} \p{White_Space} (= \p{White_Space=Y}) (25)
  5940. \p{Space: *} \p{White_Space: *}
  5941. \p{Space_Separator} \p{General_Category=Space_Separator}
  5942. (Short: \p{Zs}) (17)
  5943. \p{SpacePerl} \p{XPosixSpace} (25)
  5944. \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short:
  5945. \p{Mc}) (429)
  5946. X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
  5947. (Short: \p{InModifierLetters}) (80)
  5948. X \p{Specials} \p{Block=Specials} (16)
  5949. \p{STerm} \p{Sentence_Terminal} (=
  5950. \p{Sentence_Terminal=Y}) (141)
  5951. \p{STerm: *} \p{Sentence_Terminal: *}
  5952. \p{Sund} \p{Sundanese} (= \p{Script_Extensions=
  5953. Sundanese}) (NOT \p{Block=Sundanese})
  5954. (72)
  5955. \p{Sundanese} \p{Script_Extensions=Sundanese} (Short:
  5956. \p{Sund}; NOT \p{Block=Sundanese}) (72)
  5957. X \p{Sundanese_Sup} \p{Sundanese_Supplement} (= \p{Block=
  5958. Sundanese_Supplement}) (16)
  5959. X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
  5960. \p{InSundaneseSup}) (16)
  5961. X \p{Sup_Arrows_A} \p{Supplemental_Arrows_A} (= \p{Block=
  5962. Supplemental_Arrows_A}) (16)
  5963. X \p{Sup_Arrows_B} \p{Supplemental_Arrows_B} (= \p{Block=
  5964. Supplemental_Arrows_B}) (128)
  5965. X \p{Sup_Arrows_C} \p{Supplemental_Arrows_C} (= \p{Block=
  5966. Supplemental_Arrows_C}) (256)
  5967. X \p{Sup_Math_Operators} \p{Supplemental_Mathematical_Operators} (=
  5968. \p{Block=
  5969. Supplemental_Mathematical_Operators})
  5970. (256)
  5971. X \p{Sup_PUA_A} \p{Supplementary_Private_Use_Area_A} (=
  5972. \p{Block=
  5973. Supplementary_Private_Use_Area_A})
  5974. (65_536)
  5975. X \p{Sup_PUA_B} \p{Supplementary_Private_Use_Area_B} (=
  5976. \p{Block=
  5977. Supplementary_Private_Use_Area_B})
  5978. (65_536)
  5979. X \p{Sup_Punctuation} \p{Supplemental_Punctuation} (= \p{Block=
  5980. Supplemental_Punctuation}) (128)
  5981. X \p{Sup_Symbols_And_Pictographs}
  5982. \p{Supplemental_Symbols_And_Pictographs}
  5983. (= \p{Block=
  5984. Supplemental_Symbols_And_Pictographs})
  5985. (256)
  5986. X \p{Super_And_Sub} \p{Superscripts_And_Subscripts} (=
  5987. \p{Block=Superscripts_And_Subscripts})
  5988. (48)
  5989. X \p{Superscripts_And_Subscripts} \p{Block=
  5990. Superscripts_And_Subscripts} (Short:
  5991. \p{InSuperAndSub}) (48)
  5992. X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
  5993. \p{InSupArrowsA}) (16)
  5994. X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
  5995. \p{InSupArrowsB}) (128)
  5996. X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
  5997. \p{InSupArrowsC}) (256)
  5998. X \p{Supplemental_Mathematical_Operators} \p{Block=
  5999. Supplemental_Mathematical_Operators}
  6000. (Short: \p{InSupMathOperators}) (256)
  6001. X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
  6002. (Short: \p{InSupPunctuation}) (128)
  6003. X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
  6004. Supplemental_Symbols_And_Pictographs}
  6005. (Short: \p{InSupSymbolsAndPictographs})
  6006. (256)
  6007. X \p{Supplementary_Private_Use_Area_A} \p{Block=
  6008. Supplementary_Private_Use_Area_A}
  6009. (Short: \p{InSupPUAA}) (65_536)
  6010. X \p{Supplementary_Private_Use_Area_B} \p{Block=
  6011. Supplementary_Private_Use_Area_B}
  6012. (Short: \p{InSupPUAB}) (65_536)
  6013. \p{Surrogate} \p{General_Category=Surrogate} (Short:
  6014. \p{Cs}) (2048)
  6015. X \p{Sutton_SignWriting} \p{Block=Sutton_SignWriting} (688)
  6016. \p{Sylo} \p{Syloti_Nagri} (= \p{Script_Extensions=
  6017. Syloti_Nagri}) (NOT \p{Block=
  6018. Syloti_Nagri}) (56)
  6019. \p{Syloti_Nagri} \p{Script_Extensions=Syloti_Nagri} (Short:
  6020. \p{Sylo}; NOT \p{Block=Syloti_Nagri})
  6021. (56)
  6022. \p{Symbol} \p{General_Category=Symbol} (Short: \p{S})
  6023. (7292)
  6024. X \p{Symbols_And_Pictographs_Ext_A}
  6025. \p{Symbols_And_Pictographs_Extended_A}
  6026. (= \p{Block=
  6027. Symbols_And_Pictographs_Extended_A})
  6028. (144)
  6029. X \p{Symbols_And_Pictographs_Extended_A} \p{Block=
  6030. Symbols_And_Pictographs_Extended_A} (144)
  6031. \p{Syrc} \p{Syriac} (= \p{Script_Extensions=
  6032. Syriac}) (NOT \p{Block=Syriac}) (105)
  6033. \p{Syriac} \p{Script_Extensions=Syriac} (Short:
  6034. \p{Syrc}; NOT \p{Block=Syriac}) (105)
  6035. X \p{Syriac_Sup} \p{Syriac_Supplement} (= \p{Block=
  6036. Syriac_Supplement}) (16)
  6037. X \p{Syriac_Supplement} \p{Block=Syriac_Supplement} (Short:
  6038. \p{InSyriacSup}) (16)
  6039. \p{Tagalog} \p{Script_Extensions=Tagalog} (Short:
  6040. \p{Tglg}; NOT \p{Block=Tagalog}) (22)
  6041. \p{Tagb} \p{Tagbanwa} (= \p{Script_Extensions=
  6042. Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20)
  6043. \p{Tagbanwa} \p{Script_Extensions=Tagbanwa} (Short:
  6044. \p{Tagb}; NOT \p{Block=Tagbanwa}) (20)
  6045. X \p{Tags} \p{Block=Tags} (128)
  6046. \p{Tai_Le} \p{Script_Extensions=Tai_Le} (Short:
  6047. \p{Tale}; NOT \p{Block=Tai_Le}) (45)
  6048. \p{Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short:
  6049. \p{Lana}; NOT \p{Block=Tai_Tham}) (127)
  6050. \p{Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short:
  6051. \p{Tavt}; NOT \p{Block=Tai_Viet}) (72)
  6052. X \p{Tai_Xuan_Jing} \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
  6053. Tai_Xuan_Jing_Symbols}) (96)
  6054. X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
  6055. \p{InTaiXuanJing}) (96)
  6056. \p{Takr} \p{Takri} (= \p{Script_Extensions=Takri})
  6057. (NOT \p{Block=Takri}) (79)
  6058. \p{Takri} \p{Script_Extensions=Takri} (Short:
  6059. \p{Takr}; NOT \p{Block=Takri}) (79)
  6060. \p{Tale} \p{Tai_Le} (= \p{Script_Extensions=
  6061. Tai_Le}) (NOT \p{Block=Tai_Le}) (45)
  6062. \p{Talu} \p{New_Tai_Lue} (= \p{Script_Extensions=
  6063. New_Tai_Lue}) (NOT \p{Block=
  6064. New_Tai_Lue}) (83)
  6065. \p{Tamil} \p{Script_Extensions=Tamil} (Short:
  6066. \p{Taml}; NOT \p{Block=Tamil}) (133)
  6067. X \p{Tamil_Sup} \p{Tamil_Supplement} (= \p{Block=
  6068. Tamil_Supplement}) (64)
  6069. X \p{Tamil_Supplement} \p{Block=Tamil_Supplement} (Short:
  6070. \p{InTamilSup}) (64)
  6071. \p{Taml} \p{Tamil} (= \p{Script_Extensions=Tamil})
  6072. (NOT \p{Block=Tamil}) (133)
  6073. \p{Tang} \p{Tangut} (= \p{Script_Extensions=
  6074. Tangut}) (NOT \p{Block=Tangut}) (6892)
  6075. \p{Tangut} \p{Script_Extensions=Tangut} (Short:
  6076. \p{Tang}; NOT \p{Block=Tangut}) (6892)
  6077. X \p{Tangut_Components} \p{Block=Tangut_Components} (768)
  6078. \p{Tavt} \p{Tai_Viet} (= \p{Script_Extensions=
  6079. Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72)
  6080. \p{Telu} \p{Telugu} (= \p{Script_Extensions=
  6081. Telugu}) (NOT \p{Block=Telugu}) (104)
  6082. \p{Telugu} \p{Script_Extensions=Telugu} (Short:
  6083. \p{Telu}; NOT \p{Block=Telugu}) (104)
  6084. \p{Term} \p{Terminal_Punctuation} (=
  6085. \p{Terminal_Punctuation=Y}) (264)
  6086. \p{Term: *} \p{Terminal_Punctuation: *}
  6087. \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
  6088. \p{Term}) (264)
  6089. \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
  6090. (1_113_848 plus all above-Unicode code
  6091. points: [\x00-\x20\"#\$\%&\'\(\)*+\-\/0-
  6092. 9<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
  6093. U+0100..037D, U+037F..0386,
  6094. U+0388..0588, U+058A..05C2, U+05C4..060B
  6095. ...)
  6096. \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (264:
  6097. [!,.:;?], U+037E, U+0387, U+0589,
  6098. U+05C3, U+060C ...)
  6099. \p{Tfng} \p{Tifinagh} (= \p{Script_Extensions=
  6100. Tifinagh}) (NOT \p{Block=Tifinagh}) (59)
  6101. \p{Tglg} \p{Tagalog} (= \p{Script_Extensions=
  6102. Tagalog}) (NOT \p{Block=Tagalog}) (22)
  6103. \p{Thaa} \p{Thaana} (= \p{Script_Extensions=
  6104. Thaana}) (NOT \p{Block=Thaana}) (66)
  6105. \p{Thaana} \p{Script_Extensions=Thaana} (Short:
  6106. \p{Thaa}; NOT \p{Block=Thaana}) (66)
  6107. \p{Thai} \p{Script_Extensions=Thai} (NOT \p{Block=
  6108. Thai}) (86)
  6109. \p{Tibetan} \p{Script_Extensions=Tibetan} (Short:
  6110. \p{Tibt}; NOT \p{Block=Tibetan}) (207)
  6111. \p{Tibt} \p{Tibetan} (= \p{Script_Extensions=
  6112. Tibetan}) (NOT \p{Block=Tibetan}) (207)
  6113. \p{Tifinagh} \p{Script_Extensions=Tifinagh} (Short:
  6114. \p{Tfng}; NOT \p{Block=Tifinagh}) (59)
  6115. \p{Tirh} \p{Tirhuta} (= \p{Script_Extensions=
  6116. Tirhuta}) (NOT \p{Block=Tirhuta}) (97)
  6117. \p{Tirhuta} \p{Script_Extensions=Tirhuta} (Short:
  6118. \p{Tirh}; NOT \p{Block=Tirhuta}) (97)
  6119. \p{Title} \p{Titlecase} (/i= Cased=Yes) (31)
  6120. \p{Titlecase} (= \p{Gc=Lt}) (Short: \p{Title}; /i=
  6121. Cased=Yes) (31: U+01C5, U+01C8, U+01CB,
  6122. U+01F2, U+1F88..1F8F, U+1F98..1F9F ...)
  6123. \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter}
  6124. (Short: \p{Lt}; /i= General_Category=
  6125. Cased_Letter) (31)
  6126. X \p{Transport_And_Map} \p{Transport_And_Map_Symbols} (= \p{Block=
  6127. Transport_And_Map_Symbols}) (128)
  6128. X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
  6129. (Short: \p{InTransportAndMap}) (128)
  6130. X \p{UCAS} \p{Unified_Canadian_Aboriginal_Syllabics}
  6131. (= \p{Block=
  6132. Unified_Canadian_Aboriginal_Syllabics})
  6133. (640)
  6134. X \p{UCAS_Ext} \p{Unified_Canadian_Aboriginal_Syllabics_-
  6135. Extended} (= \p{Block=
  6136. Unified_Canadian_Aboriginal_Syllabics_-
  6137. Extended}) (80)
  6138. \p{Ugar} \p{Ugaritic} (= \p{Script_Extensions=
  6139. Ugaritic}) (NOT \p{Block=Ugaritic}) (31)
  6140. \p{Ugaritic} \p{Script_Extensions=Ugaritic} (Short:
  6141. \p{Ugar}; NOT \p{Block=Ugaritic}) (31)
  6142. \p{UIdeo} \p{Unified_Ideograph} (=
  6143. \p{Unified_Ideograph=Y}) (87_887)
  6144. \p{UIdeo: *} \p{Unified_Ideograph: *}
  6145. \p{Unassigned} \p{General_Category=Unassigned} (Short:
  6146. \p{Cn}) (836_602 plus all above-Unicode
  6147. code points)
  6148. \p{Unicode} \p{Any} (1_114_112)
  6149. X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
  6150. Unified_Canadian_Aboriginal_Syllabics}
  6151. (Short: \p{InUCAS}) (640)
  6152. X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
  6153. Unified_Canadian_Aboriginal_Syllabics_-
  6154. Extended} (Short: \p{InUCASExt}) (80)
  6155. \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
  6156. (87_887)
  6157. \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
  6158. (1_026_225 plus all above-Unicode code
  6159. points: U+0000..33FF, U+4DB6..4DFF,
  6160. U+9FF0..FA0D, U+FA10, U+FA12,
  6161. U+FA15..FA1E ...)
  6162. \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (87_887:
  6163. U+3400..4DB5, U+4E00..9FEF,
  6164. U+FA0E..FA0F, U+FA11, U+FA13..FA14,
  6165. U+FA1F ...)
  6166. \p{Unknown} \p{Script_Extensions=Unknown} (Short:
  6167. \p{Zzzz}) (976_118 plus all above-
  6168. Unicode code points)
  6169. \p{Upper} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
  6170. Cased=Yes) (1908)
  6171. \p{Upper: *} \p{Uppercase: *}
  6172. \p{Uppercase} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
  6173. Cased=Yes) (1908)
  6174. \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
  6175. No) (1_112_204 plus all above-Unicode
  6176. code points: [\x00-
  6177. \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
  6178. >?\@\[\\\]\^_`a-z\{\|\}~\x7f-
  6179. \xbf\xd7\xdf-\xff], U+0101, U+0103,
  6180. U+0105, U+0107, U+0109 ...)
  6181. \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
  6182. Yes) (1908: [A-Z\xc0-\xd6\xd8-\xde],
  6183. U+0100, U+0102, U+0104, U+0106, U+0108
  6184. ...)
  6185. \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter}
  6186. (Short: \p{Lu}; /i= General_Category=
  6187. Cased_Letter) (1788)
  6188. \p{Vai} \p{Script_Extensions=Vai} (NOT \p{Block=
  6189. Vai}) (300)
  6190. \p{Vaii} \p{Vai} (= \p{Script_Extensions=Vai}) (NOT
  6191. \p{Block=Vai}) (300)
  6192. \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS};
  6193. NOT \p{Variation_Selectors}) (259)
  6194. \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853
  6195. plus all above-Unicode code points:
  6196. U+0000..180A, U+180E..FDFF,
  6197. U+FE10..E00FF, U+E01F0..infinity)
  6198. \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259:
  6199. U+180B..180D, U+FE00..FE0F,
  6200. U+E0100..E01EF)
  6201. X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
  6202. \p{InVS}) (16)
  6203. X \p{Variation_Selectors_Supplement} \p{Block=
  6204. Variation_Selectors_Supplement} (Short:
  6205. \p{InVSSup}) (240)
  6206. X \p{Vedic_Ext} \p{Vedic_Extensions} (= \p{Block=
  6207. Vedic_Extensions}) (48)
  6208. X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (Short:
  6209. \p{InVedicExt}) (48)
  6210. X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16)
  6211. \p{Vertical_Orientation: R} \p{Vertical_Orientation=Rotated}
  6212. (787_620 plus all above-Unicode code
  6213. points)
  6214. \p{Vertical_Orientation: Rotated} (Short: \p{Vo=R}) (787_620 plus
  6215. all above-Unicode code points: [\x00-
  6216. \xa6\xa8\xaa-\xad\xaf-\xb0\xb2-\xbb\xbf-
  6217. \xd6\xd8-\xf6\xf8-\xff], U+0100..02E9,
  6218. U+02EC..10FF, U+1200..1400,
  6219. U+1680..18AF, U+1900..2015 ...)
  6220. \p{Vertical_Orientation: Tr} \p{Vertical_Orientation=
  6221. Transformed_Rotated} (47)
  6222. \p{Vertical_Orientation: Transformed_Rotated} (Short: \p{Vo=Tr})
  6223. (47: U+2329..232A, U+3008..3011,
  6224. U+3014..301F, U+3030, U+30A0, U+30FC ...)
  6225. \p{Vertical_Orientation: Transformed_Upright} (Short: \p{Vo=Tu})
  6226. (148: U+3001..3002, U+3041, U+3043,
  6227. U+3045, U+3047, U+3049 ...)
  6228. \p{Vertical_Orientation: Tu} \p{Vertical_Orientation=
  6229. Transformed_Upright} (148)
  6230. \p{Vertical_Orientation: U} \p{Vertical_Orientation=Upright}
  6231. (326_297)
  6232. \p{Vertical_Orientation: Upright} (Short: \p{Vo=U}) (326_297:
  6233. [\xa7\xa9\xae\xb1\xbc-\xbe\xd7\xf7],
  6234. U+02EA..02EB, U+1100..11FF,
  6235. U+1401..167F, U+18B0..18FF, U+2016 ...)
  6236. \p{VertSpace} \v (7: [\n\cK\f\r\x85], U+2028..2029)
  6237. \p{Vo: *} \p{Vertical_Orientation: *}
  6238. \p{VS} \p{Variation_Selector} (=
  6239. \p{Variation_Selector=Y}) (NOT
  6240. \p{Variation_Selectors}) (259)
  6241. \p{VS: *} \p{Variation_Selector: *}
  6242. X \p{VS_Sup} \p{Variation_Selectors_Supplement} (=
  6243. \p{Block=
  6244. Variation_Selectors_Supplement}) (240)
  6245. \p{Wancho} \p{Script_Extensions=Wancho} (Short:
  6246. \p{Wcho}; NOT \p{Block=Wancho}) (59)
  6247. \p{Wara} \p{Warang_Citi} (= \p{Script_Extensions=
  6248. Warang_Citi}) (NOT \p{Block=
  6249. Warang_Citi}) (84)
  6250. \p{Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
  6251. \p{Wara}; NOT \p{Block=Warang_Citi}) (84)
  6252. \p{WB: *} \p{Word_Break: *}
  6253. \p{Wcho} \p{Wancho} (= \p{Script_Extensions=
  6254. Wancho}) (NOT \p{Block=Wancho}) (59)
  6255. \p{White_Space} \p{White_Space=Y} (Short: \p{Space}) (25)
  6256. \p{White_Space: N*} (Short: \p{Space=N}, \P{Space}) (1_114_087
  6257. plus all above-Unicode code points:
  6258. [^\t\n\cK\f\r\x20\x85\xa0],
  6259. U+0100..167F, U+1681..1FFF,
  6260. U+200B..2027, U+202A..202E, U+2030..205E
  6261. ...)
  6262. \p{White_Space: Y*} (Short: \p{Space=Y}, \p{Space}) (25:
  6263. [\t\n\cK\f\r\x20\x85\xa0], U+1680,
  6264. U+2000..200A, U+2028..2029, U+202F,
  6265. U+205F ...)
  6266. \p{Word} \p{XPosixWord} (128_919)
  6267. \p{Word_Break: ALetter} (Short: \p{WB=LE}) (28_693: [A-Za-
  6268. z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
  6269. \xff], U+0100..02D7, U+02DE..02E4,
  6270. U+02EC..02FF, U+0370..0374, U+0376..0377
  6271. ...)
  6272. \p{Word_Break: CR} (Short: \p{WB=CR}) (1: [\r])
  6273. \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1: [\"])
  6274. \p{Word_Break: DQ} \p{Word_Break=Double_Quote} (1)
  6275. \p{Word_Break: E_Base} (Short: \p{WB=EB}) (0)
  6276. \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (0)
  6277. \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (0)
  6278. \p{Word_Break: EB} \p{Word_Break=E_Base} (0)
  6279. \p{Word_Break: EBG} \p{Word_Break=E_Base_GAZ} (0)
  6280. \p{Word_Break: EM} \p{Word_Break=E_Modifier} (0)
  6281. \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (11)
  6282. \p{Word_Break: Extend} (Short: \p{WB=Extend}) (2372:
  6283. U+0300..036F, U+0483..0489,
  6284. U+0591..05BD, U+05BF, U+05C1..05C2,
  6285. U+05C4..05C5 ...)
  6286. \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11: [_], U+202F,
  6287. U+203F..2040, U+2054, U+FE33..FE34,
  6288. U+FE4D..FE4F ...)
  6289. \p{Word_Break: FO} \p{Word_Break=Format} (62)
  6290. \p{Word_Break: Format} (Short: \p{WB=FO}) (62: [\xad],
  6291. U+0600..0605, U+061C, U+06DD, U+070F,
  6292. U+08E2 ...)
  6293. \p{Word_Break: GAZ} \p{Word_Break=Glue_After_Zwj} (0)
  6294. \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (0)
  6295. \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (75:
  6296. U+05D0..05EA, U+05EF..05F2, U+FB1D,
  6297. U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
  6298. ...)
  6299. \p{Word_Break: HL} \p{Word_Break=Hebrew_Letter} (75)
  6300. \p{Word_Break: KA} \p{Word_Break=Katakana} (314)
  6301. \p{Word_Break: Katakana} (Short: \p{WB=KA}) (314: U+3031..3035,
  6302. U+309B..309C, U+30A0..30FA,
  6303. U+30FC..30FF, U+31F0..31FF, U+32D0..32FE
  6304. ...)
  6305. \p{Word_Break: LE} \p{Word_Break=ALetter} (28_693)
  6306. \p{Word_Break: LF} (Short: \p{WB=LF}) (1: [\n])
  6307. \p{Word_Break: MB} \p{Word_Break=MidNumLet} (7)
  6308. \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (8: [:\xb7], U+0387,
  6309. U+05F4, U+2027, U+FE13, U+FE55 ...)
  6310. \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15: [,;], U+037E,
  6311. U+0589, U+060C..060D, U+066C, U+07F8 ...)
  6312. \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7: [.],
  6313. U+2018..2019, U+2024, U+FE52, U+FF07,
  6314. U+FF0E)
  6315. \p{Word_Break: ML} \p{Word_Break=MidLetter} (8)
  6316. \p{Word_Break: MN} \p{Word_Break=MidNum} (15)
  6317. \p{Word_Break: Newline} (Short: \p{WB=NL}) (5: [\cK\f\x85],
  6318. U+2028..2029)
  6319. \p{Word_Break: NL} \p{Word_Break=Newline} (5)
  6320. \p{Word_Break: NU} \p{Word_Break=Numeric} (631)
  6321. \p{Word_Break: Numeric} (Short: \p{WB=NU}) (631: [0-9],
  6322. U+0660..0669, U+066B, U+06F0..06F9,
  6323. U+07C0..07C9, U+0966..096F ...)
  6324. \p{Word_Break: Other} (Short: \p{WB=XX}) (1_081_874 plus all
  6325. above-Unicode code points:
  6326. [^\n\cK\f\r\x20\"\',.0-9:;A-Z_a-
  6327. z\x85\xaa\xad\xb5\xb7\xba\xc0-\xd6\xd8-
  6328. \xf6\xf8-\xff], U+02D8..02DD,
  6329. U+02E5..02EB, U+0375, U+0378..0379,
  6330. U+0380..0385 ...)
  6331. \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26:
  6332. U+1F1E6..1F1FF)
  6333. \p{Word_Break: RI} \p{Word_Break=Regional_Indicator} (26)
  6334. \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1: [\'])
  6335. \p{Word_Break: SQ} \p{Word_Break=Single_Quote} (1)
  6336. \p{Word_Break: WSegSpace} (Short: \p{WB=WSegSpace}) (14: [\x20],
  6337. U+1680, U+2000..2006, U+2008..200A,
  6338. U+205F, U+3000)
  6339. \p{Word_Break: XX} \p{Word_Break=Other} (1_081_874 plus all
  6340. above-Unicode code points)
  6341. \p{Word_Break: ZWJ} (Short: \p{WB=ZWJ}) (1: U+200D)
  6342. \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (25)
  6343. \p{WSpace: *} \p{White_Space: *}
  6344. \p{XDigit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
  6345. \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC})
  6346. (128_770)
  6347. \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (985_342
  6348. plus all above-Unicode code points:
  6349. [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
  6350. >?\@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-
  6351. \xb4\xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
  6352. U+02C2..02C5, U+02D2..02DF,
  6353. U+02E5..02EB, U+02ED, U+02EF..02FF ...)
  6354. \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (128_770:
  6355. [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-
  6356. \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
  6357. U+02C6..02D1, U+02E0..02E4, U+02EC,
  6358. U+02EE ...)
  6359. \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (125_861)
  6360. \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (988_251
  6361. plus all above-Unicode code points:
  6362. [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
  6363. >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
  6364. \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
  6365. U+02C2..02C5, U+02D2..02DF,
  6366. U+02E5..02EB, U+02ED, U+02EF..036F ...)
  6367. \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (125_861:
  6368. [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
  6369. \xf6\xf8-\xff], U+0100..02C1,
  6370. U+02C6..02D1, U+02E0..02E4, U+02EC,
  6371. U+02EE ...)
  6372. \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y})
  6373. (128_770)
  6374. \p{XIDC: *} \p{XID_Continue: *}
  6375. \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (125_861)
  6376. \p{XIDS: *} \p{XID_Start: *}
  6377. \p{Xpeo} \p{Old_Persian} (= \p{Script_Extensions=
  6378. Old_Persian}) (NOT \p{Block=
  6379. Old_Persian}) (50)
  6380. \p{XPerlSpace} \p{XPosixSpace} (25)
  6381. \p{XPosixAlnum} Alphabetic and (decimal) Numeric (Short:
  6382. \p{Alnum}) (127_886: [0-9A-Za-
  6383. z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
  6384. \xff], U+0100..02C1, U+02C6..02D1,
  6385. U+02E0..02E4, U+02EC, U+02EE ...)
  6386. \p{XPosixAlpha} \p{Alphabetic=Y} (Short: \p{Alpha})
  6387. (127_256)
  6388. \p{XPosixBlank} \h, Horizontal white space (Short:
  6389. \p{Blank}) (18: [\t\x20\xa0], U+1680,
  6390. U+2000..200A, U+202F, U+205F, U+3000)
  6391. \p{XPosixCntrl} \p{General_Category=Control} Control
  6392. characters (Short: \p{Cc}) (65)
  6393. \p{XPosixDigit} \p{General_Category=Decimal_Number} [0-9]
  6394. + all other decimal digits (Short:
  6395. \p{Nd}) (630)
  6396. \p{XPosixGraph} Characters that are graphical (Short:
  6397. \p{Graph}) (275_378:
  6398. [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
  6399. Z\[\\\]\^_`a-z\{\|\}~\xa1-\xff],
  6400. U+0100..0377, U+037A..037F,
  6401. U+0384..038A, U+038C, U+038E..03A1 ...)
  6402. \p{XPosixLower} \p{Lowercase=Y} (Short: \p{Lower}; /i=
  6403. Cased=Yes) (2340)
  6404. \p{XPosixPrint} Characters that are graphical plus space
  6405. characters (but no controls) (Short:
  6406. \p{Print}) (275_395: [\x20-\x7e\xa0-
  6407. \xff], U+0100..0377, U+037A..037F,
  6408. U+0384..038A, U+038C, U+038E..03A1 ...)
  6409. \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (801:
  6410. [!\"#\$\%&\'\(\)*+,\-.\/:;<=
  6411. >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
  6412. \xb7\xbb\xbf], U+037E, U+0387,
  6413. U+055A..055F, U+0589..058A, U+05BE ...)
  6414. \p{XPosixSpace} \s including beyond ASCII and vertical tab
  6415. (Short: \p{SpacePerl}) (25:
  6416. [\t\n\cK\f\r\x20\x85\xa0], U+1680,
  6417. U+2000..200A, U+2028..2029, U+202F,
  6418. U+205F ...)
  6419. \p{XPosixUpper} \p{Uppercase=Y} (Short: \p{Upper}; /i=
  6420. Cased=Yes) (1908)
  6421. \p{XPosixWord} \w, including beyond ASCII; = \p{Alnum} +
  6422. \pM + \p{Pc} + \p{Join_Control} (Short:
  6423. \p{Word}) (128_919: [0-9A-Z_a-
  6424. z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
  6425. \xff], U+0100..02C1, U+02C6..02D1,
  6426. U+02E0..02E4, U+02EC, U+02EE ...)
  6427. \p{XPosixXDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
  6428. \p{Xsux} \p{Cuneiform} (= \p{Script_Extensions=
  6429. Cuneiform}) (NOT \p{Block=Cuneiform})
  6430. (1234)
  6431. \p{Yi} \p{Script_Extensions=Yi} (1246)
  6432. X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64)
  6433. X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168)
  6434. \p{Yiii} \p{Yi} (= \p{Script_Extensions=Yi}) (1246)
  6435. X \p{Yijing} \p{Yijing_Hexagram_Symbols} (= \p{Block=
  6436. Yijing_Hexagram_Symbols}) (64)
  6437. X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
  6438. (Short: \p{InYijing}) (64)
  6439. \p{Z} \pZ \p{Separator} (= \p{General_Category=
  6440. Separator}) (19)
  6441. \p{Zanabazar_Square} \p{Script_Extensions=Zanabazar_Square}
  6442. (Short: \p{Zanb}; NOT \p{Block=
  6443. Zanabazar_Square}) (72)
  6444. \p{Zanb} \p{Zanabazar_Square} (=
  6445. \p{Script_Extensions=Zanabazar_Square})
  6446. (NOT \p{Block=Zanabazar_Square}) (72)
  6447. \p{Zinh} \p{Inherited} (= \p{Script_Extensions=
  6448. Inherited}) (502)
  6449. \p{Zl} \p{Line_Separator} (= \p{General_Category=
  6450. Line_Separator}) (1)
  6451. \p{Zp} \p{Paragraph_Separator} (=
  6452. \p{General_Category=
  6453. Paragraph_Separator}) (1)
  6454. \p{Zs} \p{Space_Separator} (=
  6455. \p{General_Category=Space_Separator})
  6456. (17)
  6457. \p{Zyyy} \p{Common} (= \p{Script_Extensions=
  6458. Common}) (7386)
  6459. \p{Zzzz} \p{Unknown} (= \p{Script_Extensions=
  6460. Unknown}) (976_118 plus all above-
  6461. Unicode code points)

Legal \p{} and \P{} constructs that match no characters

Unicode has some property-value pairs that currently don't match anything. This happens generally either because they are obsolete, or they exist for symmetry with other forms, but no language has yet been encoded that uses them. In this version of Unicode, the following match zero code points:

  • \p{Canonical_Combining_Class=Attached_Below_Left}
  • \p{Canonical_Combining_Class=CCC133}
  • \p{Grapheme_Cluster_Break=E_Base}
  • \p{Grapheme_Cluster_Break=E_Base_GAZ}
  • \p{Grapheme_Cluster_Break=E_Modifier}
  • \p{Grapheme_Cluster_Break=Glue_After_Zwj}
  • \p{Word_Break=E_Base}
  • \p{Word_Break=E_Base_GAZ}
  • \p{Word_Break=E_Modifier}
  • \p{Word_Break=Glue_After_Zwj}

Properties accessible through Unicode::UCD

The value of any Unicode (not including Perl extensions) character property mentioned above for any single code point is available through charprop() in Unicode::UCD. charprops_all() in Unicode::UCD returns the values of all the Unicode properties for a given code point.

Besides these, all the Unicode character properties mentioned above (except for those marked as for internal use by Perl) are also accessible by prop_invlist() in Unicode::UCD.

Due to their nature, not all Unicode character properties are suitable for regular expression matches, nor prop_invlist() . The remaining non-provisional, non-internal ones are accessible via prop_invmap() in Unicode::UCD (except for those that this Perl installation hasn't included; see below for which those are).

For compatibility with other parts of Perl, all the single forms given in the table in the section above are recognized. BUT, there are some ambiguities between some Perl extensions and the Unicode properties, all of which are silently resolved in favor of the official Unicode property. To avoid surprises, you should only use prop_invmap() for forms listed in the table below, which omits the non-recommended ones. The affected forms are the Perl single form equivalents of Unicode properties, such as \p{sc} being a single-form equivalent of \p{gc=sc} , which is treated by prop_invmap() as the Script property, whose short name is sc . The table indicates the current ambiguities in the INFO column, beginning with the word "NOT" .

The standard Unicode properties listed below are documented in http://www.unicode.org/reports/tr44/; Perl_Decimal_Digit is documented in prop_invmap() in Unicode::UCD. The other Perl extensions are in Other Properties in perlunicode;

The first column in the table is a name for the property; the second column is an alternative name, if any, plus possibly some annotations. The alternative name is the property's full name, unless that would simply repeat the first column, in which case the second column indicates the property's short name (if different). The annotations are given only in the entry for the full name. The annotations for binary properties include a list of the first few ranges that the property matches. To avoid any ambiguity, the SPACE character is represented as \x20 .

If a property is obsolete, etc, the entry will be flagged with the same characters used in the table in the section above, like D or S.

  1. NAME INFO
  2. Age
  3. AHex ASCII_Hex_Digit
  4. All (Perl extension). All code points,
  5. including those above Unicode. Same as
  6. qr/./s. U+0000..infinity
  7. Alnum XPosixAlnum. (Perl extension)
  8. Alpha Alphabetic
  9. Alphabetic (Short: Alpha). [A-Za-z\xaa\xb5\xba\xc0-
  10. \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
  11. U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
  12. ...
  13. Any (Perl extension). All Unicode code
  14. points. U+0000..10FFFF
  15. ASCII Block=Basic_Latin. (Perl extension).
  16. [\x00-\x7f]
  17. ASCII_Hex_Digit (Short: AHex). [0-9A-Fa-f]
  18. Assigned (Perl extension). All assigned code
  19. points. U+0000..0377, U+037A..037F,
  20. U+0384..038A, U+038C, U+038E..03A1,
  21. U+03A3..052F ...
  22. Bc Bidi_Class
  23. Bidi_C Bidi_Control
  24. Bidi_Class (Short: bc)
  25. Bidi_Control (Short: Bidi_C). U+061C, U+200E..200F,
  26. U+202A..202E, U+2066..2069
  27. Bidi_M Bidi_Mirrored
  28. Bidi_Mirrored (Short: Bidi_M).
  29. [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
  30. U+169B..169C, U+2039..203A, U+2045..2046,
  31. U+207D..207E ...
  32. Bidi_Mirroring_Glyph (Short: bmg)
  33. Bidi_Paired_Bracket (Short: bpb)
  34. Bidi_Paired_Bracket_Type (Short: bpt)
  35. Blank XPosixBlank. (Perl extension)
  36. Blk Block
  37. Block (Short: blk)
  38. Bmg Bidi_Mirroring_Glyph
  39. Bpb Bidi_Paired_Bracket
  40. Bpt Bidi_Paired_Bracket_Type
  41. Canonical_Combining_Class (Short: ccc)
  42. Case_Folding (Short: cf)
  43. Case_Ignorable (Short: CI).
  44. [\'.:\^`\xa8\xad\xaf\xb4\xb7-\xb8],
  45. U+02B0..036F, U+0374..0375, U+037A,
  46. U+0384..0385, U+0387 ...
  47. Cased [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
  48. \xff], U+0100..01BA, U+01BC..01BF,
  49. U+01C4..0293, U+0295..02B8, U+02C0..02C1
  50. ...
  51. Category General_Category
  52. Ccc Canonical_Combining_Class
  53. CE Composition_Exclusion
  54. Cf Case_Folding; NOT 'cf' meaning
  55. 'General_Category=Format'
  56. Changes_When_Casefolded (Short: CWCF). [A-Z\xb5\xc0-\xd6\xd8-
  57. \xdf], U+0100, U+0102, U+0104, U+0106,
  58. U+0108 ...
  59. Changes_When_Casemapped (Short: CWCM). [A-Za-z\xb5\xc0-\xd6\xd8-
  60. \xf6\xf8-\xff], U+0100..0137,
  61. U+0139..018C, U+018E..019A, U+019C..01A9,
  62. U+01AC..01B9 ...
  63. Changes_When_Lowercased (Short: CWL). [A-Z\xc0-\xd6\xd8-\xde],
  64. U+0100, U+0102, U+0104, U+0106, U+0108 ...
  65. Changes_When_NFKC_Casefolded (Short: CWKCF). [A-
  66. Z\xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb8-
  67. \xba\xbc-\xbe\xc0-\xd6\xd8-\xdf], U+0100,
  68. U+0102, U+0104, U+0106, U+0108 ...
  69. Changes_When_Titlecased (Short: CWT). [a-z\xb5\xdf-\xf6\xf8-
  70. \xff], U+0101, U+0103, U+0105, U+0107,
  71. U+0109 ...
  72. Changes_When_Uppercased (Short: CWU). [a-z\xb5\xdf-\xf6\xf8-
  73. \xff], U+0101, U+0103, U+0105, U+0107,
  74. U+0109 ...
  75. CI Case_Ignorable
  76. Cntrl XPosixCntrl (=General_Category=Control).
  77. (Perl extension)
  78. Comp_Ex Full_Composition_Exclusion
  79. Composition_Exclusion (Short: CE). U+0958..095F, U+09DC..09DD,
  80. U+09DF, U+0A33, U+0A36, U+0A59..0A5B ...
  81. CWCF Changes_When_Casefolded
  82. CWCM Changes_When_Casemapped
  83. CWKCF Changes_When_NFKC_Casefolded
  84. CWL Changes_When_Lowercased
  85. CWT Changes_When_Titlecased
  86. CWU Changes_When_Uppercased
  87. Dash [\-], U+058A, U+05BE, U+1400, U+1806,
  88. U+2010..2015 ...
  89. Decomposition_Mapping (Short: dm)
  90. Decomposition_Type (Short: dt)
  91. Default_Ignorable_Code_Point (Short: DI). [\xad], U+034F, U+061C,
  92. U+115F..1160, U+17B4..17B5, U+180B..180E
  93. ...
  94. Dep Deprecated
  95. Deprecated (Short: Dep). U+0149, U+0673, U+0F77,
  96. U+0F79, U+17A3..17A4, U+206A..206F ...
  97. DI Default_Ignorable_Code_Point
  98. Dia Diacritic
  99. Diacritic (Short: Dia). [\^`\xa8\xaf\xb4\xb7-\xb8],
  100. U+02B0..034E, U+0350..0357, U+035D..0362,
  101. U+0374..0375, U+037A ...
  102. Digit XPosixDigit (=General_Category=
  103. Decimal_Number). (Perl extension)
  104. Dm Decomposition_Mapping
  105. Dt Decomposition_Type
  106. Ea East_Asian_Width
  107. East_Asian_Width (Short: ea)
  108. EqUIdeo Equivalent_Unified_Ideograph
  109. Equivalent_Unified_Ideograph (Short: EqUIdeo)
  110. Ext Extender
  111. Extender (Short: Ext). [\xb7], U+02D0..02D1,
  112. U+0640, U+07FA, U+0E46, U+0EC6 ...
  113. Full_Composition_Exclusion (Short: Comp_Ex). U+0340..0341,
  114. U+0343..0344, U+0374, U+037E, U+0387,
  115. U+0958..095F ...
  116. Gc General_Category
  117. GCB Grapheme_Cluster_Break
  118. General_Category (Short: gc)
  119. Gr_Base Grapheme_Base
  120. Gr_Ext Grapheme_Extend
  121. Graph XPosixGraph. (Perl extension)
  122. Grapheme_Base (Short: Gr_Base). [\x20-\x7e\xa0-
  123. \xac\xae-\xff], U+0100..02FF,
  124. U+0370..0377, U+037A..037F, U+0384..038A,
  125. U+038C ...
  126. Grapheme_Cluster_Break (Short: GCB)
  127. Grapheme_Extend (Short: Gr_Ext). U+0300..036F,
  128. U+0483..0489, U+0591..05BD, U+05BF,
  129. U+05C1..05C2, U+05C4..05C5 ...
  130. Hangul_Syllable_Type (Short: hst)
  131. Hex Hex_Digit
  132. Hex_Digit (Short: Hex). [0-9A-Fa-f], U+FF10..FF19,
  133. U+FF21..FF26, U+FF41..FF46
  134. HorizSpace XPosixBlank. (Perl extension)
  135. Hst Hangul_Syllable_Type
  136. D Hyphen [\-\xad], U+058A, U+1806, U+2010..2011,
  137. U+2E17, U+30FB ... Supplanted by
  138. Line_Break property values; see
  139. www.unicode.org/reports/tr14
  140. ID_Continue (Short: IDC). [0-9A-Z_a-
  141. z\xaa\xb5\xb7\xba\xc0-\xd6\xd8-\xf6\xf8-
  142. \xff], U+0100..02C1, U+02C6..02D1,
  143. U+02E0..02E4, U+02EC, U+02EE ...
  144. ID_Start (Short: IDS). [A-Za-z\xaa\xb5\xba\xc0-
  145. \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
  146. U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
  147. ...
  148. IDC ID_Continue
  149. Ideo Ideographic
  150. Ideographic (Short: Ideo). U+3006..3007,
  151. U+3021..3029, U+3038..303A, U+3400..4DB5,
  152. U+4E00..9FEF, U+F900..FA6D ...
  153. IDS ID_Start
  154. IDS_Binary_Operator (Short: IDSB). U+2FF0..2FF1, U+2FF4..2FFB
  155. IDS_Trinary_Operator (Short: IDST). U+2FF2..2FF3
  156. IDSB IDS_Binary_Operator
  157. IDST IDS_Trinary_Operator
  158. In Present_In. (Perl extension)
  159. Indic_Positional_Category (Short: InPC)
  160. Indic_Syllabic_Category (Short: InSC)
  161. InPC Indic_Positional_Category
  162. InSC Indic_Syllabic_Category
  163. Isc ISO_Comment; NOT 'isc' meaning
  164. 'General_Category=Other'
  165. ISO_Comment (Short: isc)
  166. Jg Joining_Group
  167. Join_C Join_Control
  168. Join_Control (Short: Join_C). U+200C..200D
  169. Joining_Group (Short: jg)
  170. Joining_Type (Short: jt)
  171. Jt Joining_Type
  172. Lb Line_Break
  173. Lc Lowercase_Mapping; NOT 'lc' meaning
  174. 'General_Category=Cased_Letter'
  175. Line_Break (Short: lb)
  176. LOE Logical_Order_Exception
  177. Logical_Order_Exception (Short: LOE). U+0E40..0E44, U+0EC0..0EC4,
  178. U+19B5..19B7, U+19BA, U+AAB5..AAB6, U+AAB9
  179. ...
  180. Lower Lowercase
  181. Lowercase (Short: Lower). [a-z\xaa\xb5\xba\xdf-
  182. \xf6\xf8-\xff], U+0101, U+0103, U+0105,
  183. U+0107, U+0109 ...
  184. Lowercase_Mapping (Short: lc)
  185. Math [+<=>\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
  186. U+03D5, U+03F0..03F1, U+03F4..03F6,
  187. U+0606..0608 ...
  188. Na Name
  189. Na1 Unicode_1_Name
  190. Name (Short: na)
  191. Name_Alias
  192. NChar Noncharacter_Code_Point
  193. NFC_QC NFC_Quick_Check
  194. NFC_Quick_Check (Short: NFC_QC)
  195. NFD_QC NFD_Quick_Check
  196. NFD_Quick_Check (Short: NFD_QC)
  197. NFKC_Casefold (Short: NFKC_CF)
  198. NFKC_CF NFKC_Casefold
  199. NFKC_QC NFKC_Quick_Check
  200. NFKC_Quick_Check (Short: NFKC_QC)
  201. NFKD_QC NFKD_Quick_Check
  202. NFKD_Quick_Check (Short: NFKD_QC)
  203. Noncharacter_Code_Point (Short: NChar). U+FDD0..FDEF,
  204. U+FFFE..FFFF, U+1FFFE..1FFFF,
  205. U+2FFFE..2FFFF, U+3FFFE..3FFFF,
  206. U+4FFFE..4FFFF ...
  207. Nt Numeric_Type
  208. Numeric_Type (Short: nt)
  209. Numeric_Value (Short: nv)
  210. Nv Numeric_Value
  211. Pat_Syn Pattern_Syntax
  212. Pat_WS Pattern_White_Space
  213. Pattern_Syntax (Short: Pat_Syn).
  214. [!\"#\$\%&\'\(\)*+,\-.\/:;<=
  215. >?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9\xab-
  216. \xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
  217. U+2010..2027, U+2030..203E, U+2041..2053,
  218. U+2055..205E, U+2190..245F ...
  219. Pattern_White_Space (Short: Pat_WS). [\t\n\cK\f\r\x20\x85],
  220. U+200E..200F, U+2028..2029
  221. PCM Prepended_Concatenation_Mark
  222. Perl_Decimal_Digit (Perl extension)
  223. PerlSpace PosixSpace. (Perl extension)
  224. PerlWord PosixWord. (Perl extension)
  225. PosixAlnum (Perl extension). [0-9A-Za-z]
  226. PosixAlpha (Perl extension). [A-Za-z]
  227. PosixBlank (Perl extension). [\t\x20]
  228. PosixCntrl (Perl extension). ASCII control
  229. characters. ACK, BEL, BS, CAN, CR, DC1,
  230. DC2, DC3, DC4, DEL, DLE, ENQ, EOM, EOT,
  231. ESC, ETB, ETX, FF, FS, GS, HT, LF, NAK,
  232. NUL, RS, SI, SO, SOH, STX, SUB, SYN, US, VT
  233. PosixDigit (Perl extension). [0-9]
  234. PosixGraph (Perl extension).
  235. [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
  236. Z\[\\\]\^_`a-z\{\|\}~]
  237. PosixLower (Perl extension). [a-z]
  238. PosixPrint (Perl extension). [\x20-\x7e]
  239. PosixPunct (Perl extension).
  240. [!\"#\$\%&\'\(\)*+,\-.\/:;<=
  241. >?\@\[\\\]\^_`\{\|\}~]
  242. PosixSpace (Perl extension). [\t\n\cK\f\r\x20]
  243. PosixUpper (Perl extension). [A-Z]
  244. PosixWord (Perl extension). \w, restricted to
  245. ASCII. [0-9A-Z_a-z]
  246. PosixXDigit ASCII_Hex_Digit. (Perl extension).
  247. [0-9A-Fa-f]
  248. Prepended_Concatenation_Mark (Short: PCM). U+0600..0605, U+06DD,
  249. U+070F, U+08E2, U+110BD, U+110CD
  250. Present_In (Short: In). (Perl extension)
  251. Print XPosixPrint. (Perl extension)
  252. Punct General_Category=Punctuation. (Perl
  253. extension).
  254. [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_-
  255. \{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
  256. U+037E, U+0387, U+055A..055F,
  257. U+0589..058A, U+05BE ...
  258. QMark Quotation_Mark
  259. Quotation_Mark (Short: QMark). [\"\'\xab\xbb],
  260. U+2018..201F, U+2039..203A, U+2E42,
  261. U+300C..300F, U+301D..301F ...
  262. Radical U+2E80..2E99, U+2E9B..2EF3, U+2F00..2FD5
  263. Regional_Indicator (Short: RI). U+1F1E6..1F1FF
  264. RI Regional_Indicator
  265. SB Sentence_Break
  266. Sc Script; NOT 'sc' meaning
  267. 'General_Category=Currency_Symbol'
  268. Scf Simple_Case_Folding
  269. Script (Short: sc)
  270. Script_Extensions (Short: scx)
  271. Scx Script_Extensions
  272. SD Soft_Dotted
  273. Sentence_Break (Short: SB)
  274. Sentence_Terminal (Short: STerm). [!.?], U+0589,
  275. U+061E..061F, U+06D4, U+0700..0702, U+07F9
  276. ...
  277. Sfc Simple_Case_Folding
  278. Simple_Case_Folding (Short: scf)
  279. Simple_Lowercase_Mapping (Short: slc)
  280. Simple_Titlecase_Mapping (Short: stc)
  281. Simple_Uppercase_Mapping (Short: suc)
  282. Slc Simple_Lowercase_Mapping
  283. Soft_Dotted (Short: SD). [i-j], U+012F, U+0249,
  284. U+0268, U+029D, U+02B2 ...
  285. Space White_Space
  286. SpacePerl XPosixSpace. (Perl extension)
  287. Stc Simple_Titlecase_Mapping
  288. STerm Sentence_Terminal
  289. Suc Simple_Uppercase_Mapping
  290. Tc Titlecase_Mapping
  291. Term Terminal_Punctuation
  292. Terminal_Punctuation (Short: Term). [!,.:;?], U+037E, U+0387,
  293. U+0589, U+05C3, U+060C ...
  294. Title Titlecase. (Perl extension)
  295. Titlecase (Short: Title). (Perl extension). (=
  296. \p{Gc=Lt}). U+01C5, U+01C8, U+01CB,
  297. U+01F2, U+1F88..1F8F, U+1F98..1F9F ...
  298. Titlecase_Mapping (Short: tc)
  299. Uc Uppercase_Mapping
  300. UIdeo Unified_Ideograph
  301. Unicode Any. (Perl extension)
  302. Unicode_1_Name (Short: na1)
  303. Unified_Ideograph (Short: UIdeo). U+3400..4DB5,
  304. U+4E00..9FEF, U+FA0E..FA0F, U+FA11,
  305. U+FA13..FA14, U+FA1F ...
  306. Upper Uppercase
  307. Uppercase (Short: Upper). [A-Z\xc0-\xd6\xd8-\xde],
  308. U+0100, U+0102, U+0104, U+0106, U+0108 ...
  309. Uppercase_Mapping (Short: uc)
  310. Variation_Selector (Short: VS). U+180B..180D, U+FE00..FE0F,
  311. U+E0100..E01EF
  312. Vertical_Orientation (Short: vo)
  313. VertSpace (Perl extension). \v. [\n\cK\f\r\x85],
  314. U+2028..2029
  315. Vo Vertical_Orientation
  316. VS Variation_Selector
  317. WB Word_Break
  318. White_Space (Short: WSpace).
  319. [\t\n\cK\f\r\x20\x85\xa0], U+1680,
  320. U+2000..200A, U+2028..2029, U+202F, U+205F
  321. ...
  322. Word XPosixWord. (Perl extension)
  323. Word_Break (Short: WB)
  324. WSpace White_Space
  325. XDigit XPosixXDigit (=Hex_Digit). (Perl
  326. extension)
  327. XID_Continue (Short: XIDC). [0-9A-Z_a-
  328. z\xaa\xb5\xb7\xba\xc0-\xd6\xd8-\xf6\xf8-
  329. \xff], U+0100..02C1, U+02C6..02D1,
  330. U+02E0..02E4, U+02EC, U+02EE ...
  331. XID_Start (Short: XIDS). [A-Za-z\xaa\xb5\xba\xc0-
  332. \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
  333. U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
  334. ...
  335. XIDC XID_Continue
  336. XIDS XID_Start
  337. XPerlSpace XPosixSpace. (Perl extension)
  338. XPosixAlnum (Short: Alnum). (Perl extension).
  339. Alphabetic and (decimal) Numeric. [0-9A-
  340. Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
  341. \xff], U+0100..02C1, U+02C6..02D1,
  342. U+02E0..02E4, U+02EC, U+02EE ...
  343. XPosixAlpha Alphabetic. (Perl extension). [A-Za-
  344. z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
  345. U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
  346. U+02EC, U+02EE ...
  347. XPosixBlank (Short: Blank). (Perl extension). \h,
  348. Horizontal white space. [\t\x20\xa0],
  349. U+1680, U+2000..200A, U+202F, U+205F,
  350. U+3000
  351. XPosixCntrl General_Category=Control (Short: Cntrl).
  352. (Perl extension). Control characters.
  353. [\x00-\x1f\x7f-\x9f]
  354. XPosixDigit General_Category=Decimal_Number (Short:
  355. Digit). (Perl extension). [0-9] + all
  356. other decimal digits. [0-9],
  357. U+0660..0669, U+06F0..06F9, U+07C0..07C9,
  358. U+0966..096F, U+09E6..09EF ...
  359. XPosixGraph (Short: Graph). (Perl extension).
  360. Characters that are graphical.
  361. [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
  362. Z\[\\\]\^_`a-z\{\|\}~\xa1-\xff],
  363. U+0100..0377, U+037A..037F, U+0384..038A,
  364. U+038C, U+038E..03A1 ...
  365. XPosixLower Lowercase. (Perl extension). [a-
  366. z\xaa\xb5\xba\xdf-\xf6\xf8-\xff], U+0101,
  367. U+0103, U+0105, U+0107, U+0109 ...
  368. XPosixPrint (Short: Print). (Perl extension).
  369. Characters that are graphical plus space
  370. characters (but no controls). [\x20-
  371. \x7e\xa0-\xff], U+0100..0377,
  372. U+037A..037F, U+0384..038A, U+038C,
  373. U+038E..03A1 ...
  374. XPosixPunct (Perl extension). \p{Punct} + ASCII-range
  375. \p{Symbol}. [!\"#\$\%&\'\(\)*+,\-.\/:;<=
  376. >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
  377. \xb7\xbb\xbf], U+037E, U+0387,
  378. U+055A..055F, U+0589..058A, U+05BE ...
  379. XPosixSpace (Perl extension). \s including beyond
  380. ASCII and vertical tab.
  381. [\t\n\cK\f\r\x20\x85\xa0], U+1680,
  382. U+2000..200A, U+2028..2029, U+202F, U+205F
  383. ...
  384. XPosixUpper Uppercase. (Perl extension). [A-Z\xc0-
  385. \xd6\xd8-\xde], U+0100, U+0102, U+0104,
  386. U+0106, U+0108 ...
  387. XPosixWord (Short: Word). (Perl extension). \w,
  388. including beyond ASCII; = \p{Alnum} + \pM
  389. + \p{Pc} + \p{Join_Control}. [0-9A-Z_a-
  390. z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
  391. U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
  392. U+02EC, U+02EE ...
  393. XPosixXDigit Hex_Digit (Short: XDigit). (Perl
  394. extension). [0-9A-Fa-f], U+FF10..FF19,
  395. U+FF21..FF26, U+FF41..FF46

Properties accessible through other means

Certain properties are accessible also via core function calls. These are:

  1. Lowercase_Mapping lc() and lcfirst()
  2. Titlecase_Mapping ucfirst()
  3. Uppercase_Mapping uc()

Also, Case_Folding is accessible through the /i modifier in regular expressions, the \F transliteration escape, and the fc operator.

And, the Name and Name_Aliases properties are accessible through the \N{} interpolation in double-quoted strings and regular expressions; and functions charnames::viacode() , charnames::vianame() , and charnames::string_vianame() (which require a use charnames (); to be specified.

Finally, most properties related to decomposition are accessible via Unicode::Normalize.

Unicode character properties that are NOT accepted by Perl

Perl will generate an error for a few character properties in Unicode when used in a regular expression. The non-Unihan ones are listed below, with the reasons they are not accepted, perhaps with work-arounds. The short names for the properties are listed enclosed in (parentheses). As described after the list, an installation can change the defaults and choose to accept any of these. The list is machine generated based on the choices made for the installation that generated this document.

  • Expands_On_NFC (XO_NFC)
  • Expands_On_NFD (XO_NFD)
  • Expands_On_NFKC (XO_NFKC)
  • Expands_On_NFKD (XO_NFKD)

    Deprecated by Unicode. These are characters that expand to more than one character in the specified normalization form, but whether they actually take up more bytes or not depends on the encoding being used. For example, a UTF-8 encoded character may expand to a different number of bytes than a UTF-32 encoded character.

  • Extended_Pictographic (XPG)

    Not part of the Unicode Character Database

  • Grapheme_Link (Gr_Link)

    Duplicates ccc=vr (Canonical_Combining_Class=Virama)

  • Jamo_Short_Name (JSN)
  • Other_Alphabetic (OAlpha)
  • Other_Default_Ignorable_Code_Point (ODI)
  • Other_Grapheme_Extend (OGr_Ext)
  • Other_ID_Continue (OIDC)
  • Other_ID_Start (OIDS)
  • Other_Lowercase (OLower)
  • Other_Math (OMath)
  • Other_Uppercase (OUpper)

    Used by Unicode internally for generating other properties and not intended to be used stand-alone

  • Script=Katakana_Or_Hiragana (sc=Hrkt)

    Obsolete. All code points previously matched by this have been moved to "Script=Common". Consider instead using "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or both)

  • Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)

    All code points that would be matched by this are matched by either "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"

An installation can choose to allow any of these to be matched by downloading the Unicode database from http://www.unicode.org/Public/ to $Config{privlib} /unicore/ in the Perl source tree, changing the controlling lists contained in the program $Config{privlib} /unicore/mktables and then re-compiling and installing. (%Config is available from the Config module).

Also, perl can be recompiled to operate on an earlier version of the Unicode standard. Further information is at $Config{privlib} /unicore/README.perl.

Other information in the Unicode data base

The Unicode data base is delivered in two different formats. The XML version is valid for more modern Unicode releases. The other version is a collection of files. The two are intended to give equivalent information. Perl uses the older form; this allows you to recompile Perl to use early Unicode releases.

The only non-character property that Perl currently supports is Named Sequences, in which a sequence of code points is given a name and generally treated as a single entity. (Perl supports these via the \N{...} double-quotish construct, charnames::string_vianame(name) in charnames, and namedseq() in Unicode::UCD.

Below is a list of the files in the Unicode data base that Perl doesn't currently use, along with very brief descriptions of their purposes. Some of the names of the files have been shortened from those that Unicode uses, in order to allow them to be distinguishable from similarly named files on file systems for which only the first 8 characters of a name are significant.

  • auxiliary/GraphemeBreakTest.html
  • auxiliary/LineBreakTest.html
  • auxiliary/SentenceBreakTest.html
  • auxiliary/WordBreakTest.html

    Documentation of validation Tests

  • BidiCharacterTest.txt
  • BidiTest.txt
  • NormTest.txt

    Validation Tests

  • CJKRadicals.txt

    Maps the kRSUnicode property values to corresponding code points

  • EmojiSources.txt

    Maps certain Unicode code points to their legacy Japanese cell-phone values

  • extracted/DName.txt

    This file adds no new information not already present in other files

  • Index.txt

    Alphabetical index of Unicode characters

  • NamedSqProv.txt

    Named sequences proposed for inclusion in a later version of the Unicode Standard; if you need them now, you can append this file to NamedSequences.txt and recompile perl

  • NamesList.html

    Describes the format and contents of NamesList.txt

  • NamesList.txt

    Annotated list of characters

  • NormalizationCorrections.txt

    Documentation of corrections already incorporated into the Unicode data base

  • NushuSources.txt

    Specifies source material for Nushu characters

  • ReadMe.txt

    Documentation

  • StandardizedVariants.html

    Obsoleted as of Unicode 9.0, but previously provided a visual display of the standard variant sequences derived from StandardizedVariants.txt.

  • StandardizedVariants.txt

    Certain glyph variations for character display are standardized. This lists the non-Unihan ones; the Unihan ones are also not used by Perl, and are in a separate Unicode data base http://www.unicode.org/ivd

  • TangutSources.txt

    Specifies source mappings for Tangut ideographs and components. This data file also includes informative radical-stroke values that are used internally by Unicode

  • USourceData.txt

    Documentation of status and cross reference of proposals for encoding by Unicode of Unihan characters

  • USourceGlyphs.pdf

    Pictures of the characters in USourceData.txt

SEE ALSO

http://www.unicode.org/reports/tr44/

perlrecharclass

perlunicode