Java：正则表达式

最新推荐文章于 2024-07-14 19:16:06 发布

微凉128

最新推荐文章于 2024-07-14 19:16:06 发布

阅读量265

点赞数

本文链接：https://blog.csdn.net/qq_41504815/article/details/113531390

版权

1、概述

正则表达式,定义了字符串的模式,可以用来搜索,编辑或处理文本,并不仅限于某一种语言在任何语言中都有,但是有细微的差别。

java中在1.4推出java.util.regex包,为我们提供了java使用正则表达式的应用平台
java中 \ 为转移符把有意义字符转换为无意义字符
但是在正则表达式中 \ 也是转移符,把有意义字符转换为无意义字符
在java中使用正则表达式中的 \ 的时候需要使用两个 \

2、语法

在这里插入图片描述

Java 源代码的字符串中的反斜线被解释为 Unicode 转义或其他字符转义。因此必须在字符串字面值中使用两个反斜线，表示正则表达式受到保护，不被 Java 字节码编译器解释。例如，当解释为正则表达式时，字符串字面值 “\b” 与单个退格字符匹配，而 “\b” 与单词边界匹配。字符串字面值 “(hello)” 是非法的，将导致编译时错误；要与字符串 (hello) 匹配，必须使用字符串字面值 “$hello$”。

在这里插入图片描述

3 () [] {}的区别

在这里插入图片描述
练习：

1 匹配整数和小数 ^\d+ (\.\d+)?

^\d 以数字打头
.匹配任意字符,需要转义 .
? 出现0次或1次
+出现1到n次
() 把.和\d+ 看做一个整体

2 匹配电话

^\d{11}$

^1([358][0-9]|4[579]|66|7[0135678]|9[89])[0-9]{8}$

4 Pattern

用于创建一个正则表达式，也可以说创建一个匹配模式，它的构造方法是私有的，不可以直接创建。
可以通过Pattern.complie(String regex)创建一个正则表达式。

使用：

Pattern.split(CharSequence input) ,成员方法, 用于分隔字符串
Pattern.matches (String regex,CharSequence input)，静态方法，用于快速匹配字符串,该方法适合用于只匹配一次,且匹配全部字符串。

实际操作中,有时候也是直接使用String中的方法,比如,分割,替换,验证。

boolean matches(String regex) 验证
String[] split(String regex) 分割
String replaceAll(String regex,String replacement) 替换

	public static  void test(){
		String str = "1.2.3.4.5";
		Pattern pattern = Pattern.compile("\\.");
		// 以 . 分割返回字符串数组
		String[] strs = pattern.split(str);
		for (String string : strs) {
			System.out.println(string);
		}
	}

5、Matcher

构造方法也是私有的，不能随意创建，只能通过Pattern.matcher(CharSequence input)方法得到该类的实例 Matcher m = p.matcher(“aaaaab”);
支持便捷强大的正则匹配操作，包括分组、多次匹配支持。

三个方法：

Matcher.matches()：对整个字符串进行匹配，只有整个字符串都匹配了才返回true
Matcher.lookingAt()：对前面的字符串进行匹配，只有匹配到的字符串在最前面才返回true
Matcher.find()：对字符串进行匹配,匹配到的字符串可以在任何位置

字符串匹配使用：

public static void test() {
		String regexTel = "\\d{11}";
		String tel = "13113113111a";
		// 引擎对象
		Pattern pattern = Pattern.compile(regexTel);
		// 匹配器对象
		Matcher matcher = pattern.matcher(tel);

		// 全词匹配 false
		System.out.println(matcher.matches());
		
		matcher = pattern.matcher(tel);
		// 从前开始匹配 true
		System.out.println(matcher.lookingAt());

		matcher = pattern.matcher(tel);
		// 任意位置 true
		System.out.println(matcher.find());
	}

字符串提取：

	public static void test() { 
		String regexTel = "((.{2,3})的电话是)(\\d{11})";
		String tel = "张三的电话是13113113111李小四的电话是15115115111";

		// 引擎对象
		Pattern pattern = Pattern.compile(regexTel);
		// 匹配器对象
		Matcher matcher = pattern.matcher(tel);

		while (matcher.find()) {
			// 0 和无参 是获取匹配到的数据
			// 1 就是第一组 2 就是第二组
			System.out.println(matcher.group(2) + " : " + matcher.group(3));
			// 匹配元素的起始索引
			System.out.println(matcher.start());
			// 匹配元素的结束索引
			System.out.println(matcher.end());
		}
	}

叠词去重：

public static void main(String[] args) {
		// 还原成 我要学编程
		String string = "我我我,,,我我,我,我要要要要,,,,,,要要,要学,学学学,学,编编程程程,程程程,,,,,程";
		//把逗号 先去掉
		string = string.replaceAll(",", "");

		/**
		 * 使用 find和group 获取数据
		 */
		
		// (.) 任意字符组成, \\1 捕获1次 , 叠词, $1 得到1组,取前面的组1次或多次
		
		// \\1 获取前面组中的数据
		// (\\d)\\1 : 表示连续出现的数字字符 , 比如 11,22,333,44444
		// (\\d)(a)\\1 : 匹配第一个是数字,第二个是a,第三个和第一个是相同的数字 ,比如 1a1 , 2a2, 4a4
		// (\\d)(a)\\2 : 匹配第一个是数字,第二个是a,第三个和第二个是相同的 , 比如 1aa , 2aa , 8aa
		
		// 对每个字进行分组
		String regex = "(.)(\\1+)";
		Pattern pattern = Pattern.compile(regex);
		Matcher matcher = pattern.matcher(string);
		while (matcher.find()) {
			// 第一组 我
			System.out.println(matcher.group(1));
			// 第二组 我我我我我我  比所有的少一个
			System.out.println(matcher.group(2));
		}
		
		/**
		 * 还原成 我要学编程
		 */
		 
		// $1 就等于 group(1)
		// $1 等于所有去重.因为$1就是group(1) 而我们这个正则表达式中 第一组 就是叠词中的一个
		string = string.replaceAll(regex, "$1");
		// $2 就是重复就删一个,因为第二组是叠词中删除一个相同的叠词
		// string = string.replaceAll(regex, "$2");
		System.out.println(string);
	}

微凉128

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Java：正则表达式

1、概述正则表达式,定义了字符串的模式,可以用来搜索,编辑或处理文本,并不仅限于某一种语言在任何语言中都有,但是有细微的差别。java中在1.4推出java.util.regex包,为我们提供了java使用正则表达式的应用平台java中 \ 为转移符把有意义字符转换为无意义字符但是在正则表达式中 \ 也是转移符,把有意义字符转换为无意义字符在java中使用正则表达式中的 \ 的时候需要使用两个 \2、语法Java 源代码的字符串中的反斜线被解释为 Unicode
复制链接

扫一扫