Regular Expression Cheat Sheet (.NET)

Regular Expression Cheat Sheet (.NET)

Metacharacters Defined

MChar

Definition

^

Start of a string.

$

End of a string.

.

Any character (except /n newline)

|

Alternation.

{...}

Explicit quantifier notation.

[...]

Explicit set of characters to match.

(...)

Logical grouping of part of an expression.

*

0 or more of previous expression.

+

1 or more of previous expression.

?

0 or 1 of previous expression; also forces minimal matching when an expression might match several strings within a search string.

/

Preceding one of the above, it makes it a literal instead of a s pe cial character. Preceding a s pe cial matching character, see below.

Metacharacter Examples

Pattern

Sample Matches

^abc

abc, abcdefg, abc123, ...

abc$

abc, endsinabc, 123abc, ...

a.c

abc, aac, acc, adc, aec, ...

bill|ted

ted, bill

ab{2}c

abbc

a[bB]c

abc, aBc

(abc){2}

abcabc

ab*c

ac, abc, abbc, abbbc, ...

ab+c

abc, abbc, abbbc, ...

ab?c

ac, abc

a/sc

a c

 

Character Esca pe s http://msdn.microsoft.com/library/en-us/cpgenref/html/cpconcharacterescapes.asp

Esca pe d Char

Description

ordinary characters

Characters other than . $ ^ { [ ( | ) ] } * + ? / match themselves.

/a

Matches a bell (alarm) /u0007.

/b

Matches a backspace /u0008 if in a []; otherwise matches a word boundary (between /w and /W characters).

/t

Matches a tab /u0009.

/r

Matches a carriage return /u000D.

/v

Matches a vertical tab /u000B.

/f

Matches a form feed /u000C.

/n

Matches a new line /u000A.

/e

Matches an esca pe /u001B.

/040

Matches an ASCII character as octal (up to three digits); numbers with no leading zero are backreferences if they have only one digit or if they correspond to a capturing group number. (For more information, see Backreferences.) For example, the character /040 represents a space.

/x20

Matches an ASCII character using hexadecimal representation (exactly two digits).

/cC

Matches an ASCII control character; for example /cC is control-C.

/u0020

Matches a Unicode character using a hexadecimal representation (exactly four digits).

/*

When followed by a character that is not recognized as an esca pe d character, matches that character. For example, /* is the same as /x2A.

 

Character Classes http://msdn.microsoft.com/library/en-us/cpgenref/html/cpconcharacterclasses.asp

Char Class

Description

.

Matches any character except /n. If modified by the Singleline option, a pe riod character matches any character. For more information, see Regular Expression Options.

[aeiou]

Matches any single character included in the s pe cified set of characters.

[^aeiou]

Matches any single character not in the s pe cified set of characters.

[0-9a-fA-F]

Use of a hyphen (–) allows s pe cification of contiguous character ranges.

/p{name}

Matches any character in the named character class s pe cified by {name}. Supported names are Unicode groups and block ranges. For example, Ll, Nd, Z, IsGreek, IsBoxDrawing.

/P{name}

Matches text not included in groups and block ranges s pe cified in {name}.

/w

Matches any word character. Equivalent to the Unicode character categories [/p{Ll}/p{Lu}/p{Lt}/p{Lo}/p{Nd}/p{Pc}]. If ECMAScript-compliant behavior is s pe cified with the ECMAScript option, /w is equivalent to [a-zA-Z_0-9].

/W

Matches any nonword character. Equivalent to the Unicode categories [^/p{Ll}/p{Lu}/p{Lt}/p{Lo}/p{Nd}/p{Pc}]. If ECMAScript-compliant behavior is s pe cified with the ECMAScript option, /W is equivalent to [^a-zA-Z_0-9].

/s

Matches any white-space character. Equivalent to the Unicode character categories [/f/n/r/t/v/x85/p{Z}]. If ECMAScript-compliant behavior is s pe cified with the ECMAScript option, /s is equivalent to [ /f/n/r/t/v].

/S

Matches any non-white-space character. Equivalent to the Unicode character categories [^/f/n/r/t/v/x85/p{Z}]. If ECMAScript-compliant behavior is s pe cified with the ECMAScript option, /S is equivalent to [^ /f/n/r/t/v].

/d

Matches any decimal digit. Equivalent to /p{Nd} for Unicode and [0-9] for non-Unicode, ECMAScript behavior.

/D

Matches any nondigit. Equivalent to /P{Nd} for Unicode and [^0-9] for non-Unicode, ECMAScript behavior.

 
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值