=begin comment # !!!!!!! DO NOT EDIT THIS FILE !!!!!!! # This file is machine-generated by lib/unicore/mktables from the Unicode # database, Version 14.0.0. Any changes made here will be lost! To change this file, edit lib/unicore/mktables instead. =end comment =head1 NAME perluniprops - Index of Unicode Version 14.0.0 character properties in Perl =head1 DESCRIPTION This document provides information about the portion of the Unicode database that deals with character properties, that is the portion that is defined on single code points. (L below briefly mentions other data that Unicode provides.) Perl can provide access to all non-provisional Unicode character properties, though not all are enabled by default. The omitted ones are the Unihan properties and certain deprecated or Unicode-internal properties. (An installation may choose to recompile Perl's tables to change this. See L.) For most purposes, access to Unicode properties from the Perl core is through regular expression matches, as described in the next section. For some special purposes, and to access the properties that are not suitable for regular expression matching, all the Unicode character properties that Perl handles are accessible via the standard L module, as described in the section L. Perl also provides some additional extensions and short-cut synonyms for Unicode properties. This document merely lists all available properties and does not attempt to explain what each property really means. There is a brief description of each Perl extension; see L for more information on these. There is some detail about Blocks, Scripts, General_Category, and Bidi_Class in L, but to find out about the intricacies of the official Unicode properties, refer to the Unicode standard. A good starting place is L. Note that you can define your own properties; see L. =head1 Properties accessible through C<\p{}> and C<\P{}> The Perl regular expression C<\p{}> and C<\P{}> constructs give access to most of the Unicode character properties. The table below shows all these constructs, both single and compound forms. B consist of two components, separated by an equals sign or a colon. The first component is the property name, and the second component is the particular value of the property to match against, for example, C<\p{Script_Extensions: Greek}> and C<\p{Script_Extensions=Greek}> both mean to match characters whose Script_Extensions property value is Greek. (C is an improved version of the C