JAVA正则表达式

最新推荐文章于 2024-07-28 19:41:52 发布

pop子

最新推荐文章于 2024-07-28 19:41:52 发布

阅读量92

点赞数

文章标签： java

本文链接：https://blog.csdn.net/qq_40830043/article/details/118032568

版权

一、正则表达式简介

正则表达式定义了字符串的模式。

正则表达式可以用来搜索、编辑或处理文本。

正则表达式并不仅限于某一种语言，但是在每种语言中有细微的差别。

二、Pattern和Matcher

Pattern.complie(String regex)

Pattern类用于创建一个正则表达式,可以通过Pattern.complie(String regex)创建一个正则表达式.

Pattern p=Pattern.compile("\\w+"); 
p.pattern();						//返回 \w+

返回Pattern.complile(String regex)的regex参数

Pattern.split(CharSequence input)

Pattern有一个split(CharSequence input)方法,用于分隔字符串,并返回一个String[]

Pattern p=Pattern.compile("\\d+"); 
String[] str=p.split("我的QQ是:123456我的电话是:978654我的邮箱是:456852@qq.com");

结果:
str[0]=“我的QQ是:”
str[1]=“我的电话是:”
str[2]=“我的邮箱是:”
str[3]=“@qq.com”

Pattern.matcher(String regex,CharSequence input)用于快速匹配字符串,该方法适合用于只匹配一次,且匹配全部字符串.

Pattern.matches("\\d+","2333");		//返回true 
Pattern.matches("\\d+","2333qwq");	//返回false,需要匹配到所有字符串才能返回true,这里qwq不能匹配到

Pattern.matcher(CharSequence input)

Pattern.matcher(CharSequence input)返回一个Matcher对象.
Pattern类只能做一些简单的匹配操作,Matcher类提供了对正则表达式的分组支持,以及对正则表达式的多次匹配支持.

Pattern p=Pattern.compile("\\d+"); 
Matcher m=p.matcher("23qwq33"); 
m.pattern();		//返回p 也就是返回该Matcher对象是由哪个Pattern对象的创建的

Matcher.matches()/ Matcher.lookingAt()/ Matcher.find()

Matcher类提供三个匹配操作方法,三个方法均返回boolean类型,当匹配到时返回true没匹配到则返回false
matches()对整个字符串进行匹配,只有整个字符串都匹配了才返回true

Pattern p=Pattern.compile("\\d+"); 
Matcher m=p.matcher("23qwq33"); 
m.matches();		//返回false,因为qwq不能被\d+匹配,导致整个字符串匹配未成功. 
Matcher m2=p.matcher("2333"); 
m2.matches();		//返回true,因为\d+匹配到了整个字符串

我们现在回头看一下Pattern.matcher(String regex,CharSequence input),它与下面这段代码等价

Pattern.compile(regex).matcher(input).matches()

lookingAt()对前面的字符串进行匹配,只有匹配到的字符串在最前面才返回true

Pattern p=Pattern.compile("\\d+"); 
Matcher m=p.matcher("22bb23"); 
m.lookingAt();		//返回true,因为\d+匹配到了前面的22 
Matcher m2=p.matcher("aa2223"); 
m2.lookingAt();		//返回false,因为\d+不能匹配前面的aa

find()对字符串进行匹配,匹配到的字符串可以在任何位置.

Pattern p=Pattern.compile("\\d+"); 
Matcher m=p.matcher("23qwq33"); 
m.find();			//返回true 
Matcher m2=p.matcher("qwq"); 
m2.find();			//返回false

Mathcer.start()/ Matcher.end()/ Matcher.group()

当使用matches(),lookingAt(),find()执行匹配操作后,就可以利用以上三个方法得到更详细的信息.

start()返回匹配到的子字符串在字符串中的索引位置.
end()返回匹配到的子字符串的最后一个字符在字符串中的索引位置.
group()返回匹配到的子字符串

Pattern p=Pattern.compile("\\d+"); 
Matcher m=p.matcher("qwq2333xs"); 
m.find();			//匹配2333 
m.start();			//返回3 
m.end();			//返回7
m.group();			//返回2333 
 
Mathcer m2=m.matcher("2333qwq"); 
m.lookingAt();   	//匹配2333 
m.start();			//返回0
m.end();   			//返回4 
m.group(); 			//返回2333 
 
Matcher m3=m.matcher("2333qwq"); 
m.matches();		//匹配整个字符串 
m.start();   		//返回0
m.end();   			//返回7
m.group();   		//返回2333qwq

三、常用正则表达式

一、校验数字的表达式

1 数字：^[0-9]*$
2 n位的数字：^\d{n}$
3 至少n位的数字：^\d{n,}$
4 m-n位的数字：^\d{m,n}$
5 零和非零开头的数字：^(0|[1-9][0-9]*)$
6 非零开头的最多带两位小数的数字：^([1-9][0-9]*)+(.[0-9]{1,2})?$
7 带1-2位小数的正数或负数：^(\-)?\d+(\.\d{1,2})?$
8 正数、负数、和小数：^(\-|\+)?\d+(\.\d+)?$
9 有两位小数的正实数：^[0-9]+(.[0-9]{2})?$
10 有1~3位小数的正实数：^[0-9]+(.[0-9]{1,3})?$
11 非零的正整数：^[1-9]\d*$ 或 ^([1-9][0-9]*){1,3}$ 或 ^\+?[1-9][0-9]*$
12 非零的负整数：^\-[1-9][]0-9"*$ 或 ^-[1-9]\d*$
13 非负整数：^\d+$ 或 ^[1-9]\d*|0$
14 非正整数：^-[1-9]\d*|0$ 或 ^((-\d+)|(0+))$
15 非负浮点数：^\d+(\.\d+)?$ 或 ^[1-9]\d*\.\d*|0\.\d*[1-9]\d*|0?\.0+|0$
16 非正浮点数：^((-\d+(\.\d+)?)|(0+(\.0+)?))$ 或 ^(-([1-9]\d*\.\d*|0\.\d*[1-9]\d*))|0?\.0+|0$
17 正浮点数：^[1-9]\d*\.\d*|0\.\d*[1-9]\d*$ 或 ^(([0-9]+\.[0-9]*[1-9][0-9]*)|([0-9]*[1-9][0-9]*\.[0-9]+)|([0-9]*[1-9][0-9]*))$
18 负浮点数：^-([1-9]\d*\.\d*|0\.\d*[1-9]\d*)$ 或 ^(-(([0-9]+\.[0-9]*[1-9][0-9]*)|([0-9]*[1-9][0-9]*\.[0-9]+)|([0-9]*[1-9][0-9]*)))$
19 浮点数：^(-?\d+)(\.\d+)?$ 或 ^-?([1-9]\d*\.\d*|0\.\d*[1-9]\d*|0?\.0+|0)$


二、校验字符的表达式

1 汉字：^[\u4e00-\u9fa5]{0,}$
2 英文和数字：^[A-Za-z0-9]+$ 或 ^[A-Za-z0-9]{4,40}$
3 长度为3-20的所有字符：^.{3,20}$
4 由26个英文字母组成的字符串：^[A-Za-z]+$
5 由26个大写英文字母组成的字符串：^[A-Z]+$
6 由26个小写英文字母组成的字符串：^[a-z]+$
7 由数字和26个英文字母组成的字符串：^[A-Za-z0-9]+$
8 由数字、26个英文字母或者下划线组成的字符串：^\w+$ 或 ^\w{3,20}$
9 中文、英文、数字包括下划线：^[\u4E00-\u9FA5A-Za-z0-9_]+$
10 中文、英文、数字但不包括下划线等符号：^[\u4E00-\u9FA5A-Za-z0-9]+$ 或 ^[\u4E00-\u9FA5A-Za-z0-9]{2,20}$
11 可以输入含有^%&',;=?$\"等字符：[^%&',;=?$\x22]+
12 禁止输入含有~的字符：[^~\x22]+

四、正则表达式的应用

在爬虫中能有所作用，能提取目标信息

pop子

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
JAVA正则表达式

一、正则表达式简介正则表达式定义了字符串的模式。正则表达式可以用来搜索、编辑或处理文本。正则表达式并不仅限于某一种语言，但是在每种语言中有细微的差别。二、Pattern和MatcherPattern.complie(String regex)Pattern类用于创建一个正则表达式,可以通过Pattern.complie(String regex)创建一个正则表达式.Pattern p=Pattern.compile("\\w+"); p.pattern(); //返回 \w+
复制链接

扫一扫