正则表达式是一种用于模式匹配或替换的规范。编程时经常遇到字符串查找,匹配,或者屏蔽特殊字符等等,如果采用正则表达式,则可以起到事半功倍的效果。
一、正则表达式基本语法。
1、.----句点符号
句点符号可以匹配任意单个字符,如a-z, 0-9,!,@,#,$等特殊字符,即通常意义上所说的通配符。
Pattern pattern = Pattern.compile( "leng" );
Matcher matcher = pattern.matcher( "len." );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true
2、[]----方括号
.句点符号会匹配所有的字符,匹配范围太广。[]方括号可以限定匹配范围,即只能匹配[]中限定的字符。如[ade]、 [a-z]。
Pattern pattern = Pattern.compile( "l[abc]ng" );
Matcher matcher = pattern.matcher( "leng" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为false
3、|---或符号
|符号进行的是或运算,即匹配|符号连接的字符。如(a|bc|de)
Pattern pattern = Pattern.compile( "l(a|bc|de)ng" );
Matcher matcher = pattern.matcher( "ldeng" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true
4、匹配次数符号
*----表示匹配0次或多次
+----表示匹配1次或多次
?----表示匹配0次或1次
{n}----表示匹配n次(n为非负整数)
{m,n}----表示匹配最少m次,最多n次
Pattern pattern = Pattern.compile( "leng*" );
Matcher matcher = pattern.matcher( "len" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true
Pattern pattern = Pattern.compile( "leng+" );
Matcher matcher = pattern.matcher( "len" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为false
Pattern pattern = Pattern.compile( "leng?" );
Matcher matcher = pattern.matcher( "len" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true
Pattern pattern = Pattern.compile( "leng{1}" );
Matcher matcher = pattern.matcher( "leng" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true
Pattern pattern = Pattern.compile( "leng{1,3}" );
Matcher matcher = pattern.matcher( "leng" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true
5、^符号
1) 如果^符号用在[]方括号内部,则表示否定,即不匹配方括号内限定的字符,如[^abc]。
Pattern pattern = Pattern.compile( "[^b]*" );
Matcher matcher = pattern.matcher( "leng" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true
2)如果^符号用在[]方括号外部,则表示起始,即匹配以^符号引导的开始字符,如^leng.
Pattern pattern = Pattern.compile( "^eng" );
Matcher matcher = pattern.matcher( "leng" );
boolean ret = matcher.matches( ); //输出为false
6、$符号
$符号用来匹配以$符号引导的结束字符,如eng$
Pattern pattern = Pattern.compile( ".*ang$" );
Matcher matcher = pattern.matcher( "lengxuegang" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true
7、Java中常用的特殊符号
\d----表示数字:[0-9]
\D----表示非数字:[^0-9]
\w----表示字符:[a-zA-Z0-9]
\W----表示非字符:[^a-zA-Z0-9]
\s----表示空白字符:[\t\n\r\f]
\S----表示非空白字符:[^\t\n\r\f]
二、示例
1、字符串匹配。匹配以age结尾的字符串
Pattern pattern = Pattern.compile( ".*age$" );
Matcher matcher = pattern.matcher( "Java program language" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true
2、字符串分割。将字符串以某个或某几个字符进行拆分
Pattern pattern = Pattern.compile( "[.!?]" );
String[] array = pattern.split( "Java program language. I like it!" );
for ( int i = 0; i < array.length; i++ ) {
System.out.println( array[i].trim( ) );
}
输出为:
Java program language
I like it
3、字符串替换
1)首次替换
Pattern pattern = Pattern.compile( "Java" );
Matcher matcher = pattern.matcher( "Java program language. I like Java programming");
System.out.println(matcher.replaceFirst( "C++" ));
输出为:
C++ program language. I like Java programming
2)全部替换
Pattern pattern = Pattern.compile( "Java" );
Matcher matcher = pattern.matcher( "Java program language. I like Java programming" );
System.out.println(matcher.replaceAll( "C++" ));
输出为:
C++ program language. I like C++ programming