Go常用正则函数整理

bobo-rs

于 2024-09-11 13:39:30 发布

阅读量1k

点赞数 11

分类专栏： Go技术分享文章标签： golang mysql 数据库正则 regexp

本文链接：https://blog.csdn.net/qq_35807303/article/details/142024639

版权

Go技术分享专栏收录该内容

15 篇文章 0 订阅

订阅专栏

前言

在Go语言中，标准库regexp提供了丰富的API来处理正则表达式，支持编译、匹配、查找、替换等操作。以下是一个Go标准库正则函数的常用大全教程及部分使用示例。
在这里插入图片描述

Go标准库常用正则函数

Compile：编译正则表达式，返回一个可用于匹配、查找、替换等操作的*Regexp对象。
MustCompile：功能与Compile相同，但在正则表达式无法编译时会引发panic。
Match：检查字节切片中是否存在匹配正则表达式的子序列。
MatchString：检查字符串中是否存在匹配正则表达式的子串。
Find：在字节切片中查找正则表达式的第一个匹配项，并返回匹配项的位置和子匹配项。
FindString：在字符串中查找正则表达式的第一个匹配项，并返回该匹配项。
FindAll：在字节切片中查找正则表达式的所有匹配项，并返回它们的位置和子匹配项。
FindAllString：在字符串中查找正则表达式的所有匹配项，并返回它们的切片。
ReplaceAll：在字节切片中查找正则表达式的所有匹配项，并将它们替换为指定的替换字符串，返回新的字节切片。
ReplaceAllString：在字符串中查找正则表达式的所有匹配项，并将它们替换为指定的替换字符串，返回新的字符串。
ReplaceAllLiteralString：与ReplaceAllString类似，但不考虑替换字符串中的任何正则表达式元字符。
Expand：根据正则表达式匹配的结果和模板字符串来构造一个新的字符串。
QuoteMeta：将字符串中的特殊正则表达式元字符转义，以便它们可以被视为普通字符。

使用示例

示例1：编译并匹配字符串

package main

import (
    "fmt"
    "regexp"
)

func main() {
    re, err := regexp.Compile(`\d+`)
    if err != nil {
        panic(err)
    }
    fmt.Println(re.MatchString("Hello 123, this is a test.")) // 输出: true
}

示例2：查找并替换字符串中的数字

package main

import (
    "fmt"
    "regexp"
)

func main() {
    re := regexp.MustCompile(`\d+`)
    result := re.ReplaceAllString("Price: 100 USD, Discount: 20%", "XXX")
    fmt.Println(result) // 输出: Price: XXX USD, Discount: XXX%
}

示例3：提取子匹配项

package main

import (
    "fmt"
    "regexp"
)

func main() {
    re := regexp.MustCompile(`(\w+)-(\d+)`)
    str := "user-1234"
    matches := re.FindStringSubmatch(str)
    if len(matches) > 0 {
        fmt.Println("Full match:", matches[0])    // Full match: user-1234
        fmt.Println("User:", matches[1])          // User: user
        fmt.Println("ID:", matches[2])            // ID: 1234
    }
}

示例4：逐行匹配

package main

import (
    "fmt"
    "regexp"
)

func main() {
    re := regexp.MustCompile(`(?m)^go.*$`)
    str := "go is fun\nhello\ngo again"
    matches := re.FindAllString(str, -1)
    fmt.Println(matches) // 输出: [go is fun go again]
}

示例5：使用`FindAllString`提取所有匹配项

package main

import (
    "fmt"
    "regexp"
)

func main() {
    re := regexp.MustCompile(`\b\w+\b`) // 匹配单词边界之间的单词字符序列
    str := "Hello, world! This is a test."
    matches := re.FindAllString(str, -1) // -1 表示查找所有匹配项
    fmt.Println(matches) // 输出: [Hello world This is a test]
}

示例6：使用`Expand`根据模板替换

package main

import (
    "fmt"
    "regexp"
)

func main() {
    re := regexp.MustCompile(`(\w+), (\w+)!`)
    src := "Alice, Bob! Hello, World!"
    result := re.ExpandString(nil, "Hi $2, $1!", re.FindStringSubmatch(src))
    fmt.Println(result) // 输出: Hi Bob, Alice!
}

示例7：使用`Split`按正则表达式分割字符串

package main

import (
    "fmt"
    "regexp"
)

func main() {
    re := regexp.MustCompile(`\s+`) // 匹配一个或多个空白字符
    str := "Hello    world!    This   is   a   test."
    parts := re.Split(str, -1) // -1 表示分割所有匹配项
    fmt.Println(parts) // 输出: [Hello world! This is a test.]
}

示例8：使用`FindStringIndex`获取匹配项的位置

package main

import (
    "fmt"
    "regexp"
)

func main() {
    re := regexp.MustCompile(`\d+`) // 匹配一个或多个数字
    str := "There are 123 apples and 456 oranges."
    loc := re.FindStringIndex(str)
    if loc != nil {
        fmt.Printf("Found at: %d to %d\n", loc[0], loc[1])
        fmt.Printf("Matched text: %s\n", str[loc[0]:loc[1]])
    }
    // 输出:
    // Found at: 7 to 10
    // Matched text: 123
}

示例9：使用`FindAllStringIndex`获取所有匹配项的位置

package main

import (
    "fmt"
    "regexp"
)

func main() {
    re := regexp.MustCompile(`\b\w+\b`) // 匹配单词
    str := "Hello, world! This is a test."
    indices := re.FindAllStringIndex(str, -1)
    for _, loc := range indices {
        fmt.Printf("Found at: %d to %d\n", loc[0], loc[1])
    }
    // 输出所有单词的位置
}

示例10：使用`CompilePOSIX`编译POSIX风格的正则表达式

虽然CompilePOSIX在现代Go程序中使用较少，因为它限制了正则表达式的功能，但以下是一个示例：

package main

import (
    "fmt"
    "regexp"
)

func main() {
    re, err := regexp.CompilePOSIX(`^[a-zA-Z0-9]+$`)
    if err != nil {
        panic(err)
    }
    matched := re.MatchString("username123")
    fmt.Println(matched) // 输出: true
}

请注意，CompilePOSIX使用的是POSIX BRE（基本正则表达式）语法，与Go标准库中的Compile（使用Perl兼容的正则表达式语法）不同。
当然可以，以下是更多关于Go标准库regexp包的使用示例，涵盖了更广泛的功能和场景：

示例11：忽略大小写匹配

package main

import (
    "fmt"
    "regexp"
)

func main() {
    re := regexp.MustCompile(`(?i)hello`) // 使用(?i)标志来忽略大小写
    str := "Hello, world! hello again."
    matched := re.MatchString(str)
    fmt.Println(matched) // 输出: true
}

注意：虽然Go的regexp包本身没有直接的忽略大小写编译选项（如Python的re.IGNORECASE），但你可以通过在正则表达式内部使用(?i)标志（如果正则表达式引擎支持的话，但请注意Go的标准库并不直接支持(?i)，这里只是为了说明概念。在Go中，你应该使用regexp.Compile时传入的regexp.IgnoreCase选项）。然而，Go的标准做法是在编译正则表达式时传递regexp.IgnoreCase选项：

re := regexp.MustCompile(`hello`, regexp.IgnoreCase)

示例12：使用`NamedCaptureGroups`捕获命名组

Go标准库的regexp包不直接支持命名捕获组（如Python的(?P<name>pattern)），但你可以通过索引来访问捕获组。不过，你可以通过命名你的正则表达式组来在代码中实现类似的功能，尽管这不是由正则表达式引擎直接支持的。

示例13：匹配电子邮件地址

package main

import (
    "fmt"
    "regexp"
)

func main() {
    re := regexp.MustCompile(`^[\w-\.]+@([\w-]+\.)+[\w-]{2,4}$`)
    emails := []string{
        "example@example.com",
        "invalid-email",
        "another.valid.email@domain.co.uk",
    }
    for _, email := range emails {
        if re.MatchString(email) {
            fmt.Println(email, "is a valid email")
        } else {
            fmt.Println(email, "is not a valid email")
        }
    }
}

示例14：替换字符串中的URL

package main

import (
    "fmt"
    "regexp"
)

func main() {
    re := regexp.MustCompile(`https?://[\w./?=%&\-]+`)
    str := "Visit https://example.com or http://another.example.org for more info."
    result := re.ReplaceAllString(str, "[URL]")
    fmt.Println(result) // 输出: Visit [URL] or [URL] for more info.
}

示例15：使用`SubexpNames`（虽然Go标准库不直接支持）

由于Go标准库的regexp包不支持命名捕获组，因此没有SubexpNames方法。但是，你可以通过手动维护一个与你的正则表达式中捕获组相对应的名称映射来实现类似的功能。

示例16：使用`ReplaceFunc`进行更复杂的替换

package main

import (
    "fmt"
    "regexp"
    "strings"
)

func main() {
    re := regexp.MustCompile(`\b\w+\b`)
    str := "Hello, world! This is a test."
    result := re.ReplaceAllStringFunc(str, func(match string) string {
        return strings.ToUpper(match)
    })
    fmt.Println(result) // 输出: HELLO, WORLD! THIS IS A TEST.
}

这个示例展示了如何使用ReplaceAllStringFunc方法来对每个匹配项执行一个自定义的替换函数，这里是将所有单词转换为大写。

总结

这些示例应该能够为你提供关于如何在Go中使用regexp包的更多想法和技巧。记得在实际应用中根据你的具体需求选择合适的函数和方法。在实际开发中，你可以根据具体需求选择合适的函数和方法。请注意，这里只提供了部分常用函数和示例。在实际应用中，regexp包的功能远不止于此，您还可以根据需要使用Split、FindIndex、FindStringIndex等其他函数。此外，正则表达式的性能也是一个需要注意的问题。在设计正则表达式时，应尽量避免复杂的嵌套和回溯，以提高匹配效率。同时，对于需要频繁使用的正则表达式，建议使用Compile或MustCompile进行预编译，以提高执行效率。