Expression | Syntax | Description |
---|
Uppercase letter | :Lu | Matches any one capital letter. For example, :Luhe matches "The" but not "the". |
Lowercase letter | :Ll | Matches any one lower case letter. For example, :Llhe matches "the" but not "The". |
Title case letter | :Lt | Matches characters that combine an uppercase letter with a lowercase letter, such as Nj and Dz. |
Modifier letter | :Lm | Matches letters or punctuation, such as commas, cross accents, and double prime, used to indicate modifications to the preceding letter. |
Other letter | :Lo | Matches other letters, such as gothic letter ahsa. |
Decimal digit | :Nd | Matches decimal digits such as 0-9 and their full-width equivalents. |
Letter digit | :Nl | Matches letter digits such as roman numerals and ideographic number zero. |
Other digit | :No | Matches other digits such as old italic number one. |
Open punctuation | :Ps | Matches opening punctuation such as open brackets and braces. |
Close punctuation | :Pe | Matches closing punctuation such as closing brackets and braces. |
Initial quote punctuation | :Pi | Matches initial double quotation marks. |
Final quote punctuation | :Pf | Matches single quotation marks and ending double quotation marks. |
Dash punctuation | :Pd | Matches the dash mark. |
Connector punctuation | :Pc | Matches the underscore or underline mark. |
Other punctuation | :Po | Matches (,), ?, ", !, @, #, %, &, *, \, (:), (;), ', and /. |
Space separator | :Zs | Matches blanks. |
Line separator | :Zl | Matches the Unicode character U+2028. |
Paragraph separator | :Zp | Matches the Unicode character U+2029. |
Non-spacing mark | :Mn | Matches non-spacing marks. |
Combining mark | :Mc | Matches combining marks. |
Enclosing mark | :Me | Matches enclosing marks. |
Math symbol | :Sm | Matches +, =, ~, |, <, and >. |
Currency symbol | :Sc | Matches $ and other currency symbols. |
Modifier symbol | :Sk | Matches modifier symbols such as circumflex accent, grave accent, and macron. |
Other symbol | :So | Matches other symbols, such as the copyright sign, pilcrow sign, and the degree sign. |
Other control | :Cc | Matches Unicode control characters such as TAB and NEWLINE. |
Other format | :Cf | Formatting control character such as the bi-directional control characters. |
Surrogate | :Cs | Matches one half of a surrogate pair. |
Other private-use | :Co | Matches any character from the private-use area. |
Other not assigned | :Cn | Matches characters that do not map to a Unicode character. |