一次性解决正则表达式

最新推荐文章于 2024-09-14 18:46:05 发布

远方故人

最新推荐文章于 2024-09-14 18:46:05 发布

阅读量154

点赞数

分类专栏：计算机文章标签： java

本文链接：https://blog.csdn.net/qq_49027029/article/details/127878034

版权

计算机专栏收录该内容

11 篇文章 0 订阅

订阅专栏

本文详细介绍了Java中的正则表达式使用，包括字符匹配、预定义字符、数量表示、边界、分组与引用以及预查断言等核心概念，并通过实例展示了如何进行匹配、分割和替换操作。同时，讲解了Pattern类在处理正则表达式中的作用，如lookingAt()方法和find()方法的应用。

摘要由CSDN通过智能技术生成

1.正则表达式

字符匹配 matches

		"Dog".matches("(?i)dog");	// 忽略大小写 : true

		"我爱中国\\中国爱我\\".split("\\\\"); // [我爱中国, 中国爱我]

获得匹配器 Macher

字符

// [abc] {a或b或c 单个字符}
// [^abc] {非 a和b和c 的单个字符 : 补集}
// [a-z] {a到z 的任意单个字符都行}
// [a-c[e-f]] {a-c 或 e-f 单个字符 : 并集}
// [a-c&&[c-f]] {a-c 且 c-f 单个字符 : 交集}


// true
"T".matches("[\tT]"
// true            
"a".matches("[^\tT]"
// true
"a".matches("[a-c]"
// true            
"a".matches("[a-c[e-g]]
// true            
"c".matches("[a-c&&[c-f]]")

预定义字符

// . {排除 \r 和 \n 的 单个字符 : 补集 - 不要写在 [] 里面}
// \\w {[a-zA-Z0-9_]}
// \\W {[^a-zA-Z0-9_]}
// \\s {[\u000b\n\r\t\f]}
// \\S {非空白}
// \\d {[0-9]}
// \\D {[^0-9]}

// false            
"\r".matches(".")
"\n".matches(".")  

    
    
    
    
    
// false    
"罗".matches("\\w");
// true
"_".matches("\\w");
"a".matches("\\w");
"1".matches("\\w");

// true
"\t".matches("\\s");

// true
"1".matches("\\d");

数量表示

表达式	描述
`?`	匹配前面的表达式0个或1个。即表示可选项。
`+`	匹配前面的表达式至少1个。
`*`	匹配前面的表达式0个或多个。
`	`
`{m}`	匹配前面的表达式m个。
`{m,}`	匹配前面的表达式最少m个。
`{m,n}`	匹配前面的表达式最少m个，最多n个。

        // false
        System.out.println("".matches("\\d+"));
        // true
        System.out.println("".matches("\\d+|\\d?"));
        // true
        System.out.println("12".matches("\\d{1}|\\d{1,2}"));

        // 贪婪量词 : 吞下字符串, 从后面找到符合 foo的部分
        // 前面 xfooxxxxxx 符合 .* 返回 xfooxxxxxxfoo -> ORZ -> ORZ1
        "xfooxxxxxxfoo1".replaceAll(".*foo", "ORZ");

        // 逐步量词 : 找到 xfoo 符合 .*?foo
        // 剩下的部分 xxxxxxfoo 符合 .*?foo
        // xfoo -> ORZ xxxxxxfoo -> ORZ -> ORZORZ1
        "xfooxxxxxxfoo1".replaceAll(".*?foo", "ORZ");

        // 独吞量词 : 吞下字符串 发现符合 .*+ 
        // 没有剩余部分匹配 foo 所以无事发生 -> xfooxxxxxxfoo1
        "xfooxxxxxxfoo1".replaceAll(".*+foo", "ORZ");

边界

表达式	描述
`^`	匹配字符串或行开头。
`$`	匹配字符串或行结尾。
`\b`	匹配单词边界。比如`Sheep\b`可以匹配`CodeSheep`末尾的`Sheep`，不能匹配`CodeSheepCode`中的`Sheep`
`\B`	匹配非单词边界。比如`Code\B`可以匹配`HelloCodeSheep`中的`Code`，不能匹配`HelloCode`中的`Code`。

        // [justin ,  monica doggie Irendog]
        // 设置单词边界
        "justin dog monica doggie Irendog".split("\\bdog\\b");

        // [justin ,  monica  ,  dog ,  gie]
        // 设置单词边界
        "justin 罗 monica  罗 dog 罗 gie".split("\\b罗\\b");

        // [justin ,  monica  ,  dog ,  gie]
        // 设置单词边界
        "justin _ monica  _ dog _ gie".split("\\b_\\b");

        // [justin ,  monica  ,  dog ,  gie]
        // 设置单词边界
        "justin 1 monica  1 dog 1 gie".split("\\b1\\b");

        // [justin ,  monica  ,  dog ,  gie]
        // 非单词边界
        "justin \t monica  \t dog \t gie".split("\\B\\t\\B");

分组和引用

表达式	描述
`(expression)`	分组。匹配括号里的整个表达式。
`(?:expression)`	非捕获分组。匹配括号里的整个字符串但不获取匹配结果，拿不到分组引用。
`\num`	对前面所匹配分组的引用。比如`(\d)\1`可以匹配两个相同的数字，`(Code)(Sheep)\1\2`则可以匹配`CodeSheepCodeSheep`。

        // true
        "1111".matches("^(\\d*)|(\\D*)$");

        // true
        "wwww".matches("^((\\d\\d)|(\\D\\D))\\1$");

        // true 第一个分组是非捕获分组, 不算
        "1111".matches("^(?:(\\d\\d)|(\\D\\D))\\1$");

预查断言

表达式	描述
`(?=)`	正向预查。比如`Code(?=Sheep)`能匹配`CodeSheep`中的`Code`，但不能匹配`CodePig`中的`Code`。
`(?!)`	正向否定预查。比如`Code(?!Sheep)`不能匹配`CodeSheep`中的`Code`，但能匹配`CodePig`中的`Code`。
`(?<=)`	反向预查。比如`(?<=Code)Sheep`能匹配`CodeSheep`中的`Sheep`，但不能匹配`ReadSheep`中的`Sheep`。
`(?<!)`	反向否定预查。比如`(?<!Code)Sheep`不能匹配`CodeSheep`中的`Sheep`，但能匹配`ReadSheep`中的`Sheep`。

        // [Hello, World!!]
        "HellosplitWorld!!".split("split(?=World)");

        // [Hello, world!!]
        "Hellosplitworld!!".split("split(?!World)");

        // [hello, world!!]
        "hellosplitworld!!!".split("(?<=hello)split");

        // [Hello, world!!]
        "Hellosplitworld!!!".split("(?<!hello)split");

        // true
        "dogabcabc".matches("((?i)dog([a][b][c]))\\2");

忽略大小写

		"Dog".matches("(?i)dog");	// 忽略大小写 : true

		"我爱中国\\中国爱我\\".split("\\\\"); // [我爱中国, 中国爱我]

2.Pattern

代表 regex 对象

		// 获得 Pattern 对象
		Pattern compile = Pattern.compile(".*foo");
		// 获得比较器对象
        Matcher matcher = compile.matcher("xfoox");
        // ture 开头符合即可
        matcher.lookingAt();

        //                             贪婪量词,  独吞量词,  逐步量词
        String[] regexs = new String[]{".*foo", ".*+foo", ".*?foo"};

        // .*foo find  in xfooxxxxxxxxxxxxxxfoo
        // .*+foo find
        // .*?foo find  in xfoo in xxxxxxxxxxxxxxfoo
        for (String regex : regexs) {
            Pattern compile = Pattern.compile(regex);
            Matcher matcher = compile.matcher("xfooxxxxxxxxxxxxxxfoo");
            // 查询是否有符合的内容
            while (matcher.find()) {
                // 拿到符合的内容
                matcher.group();
            }
        }