python_re模块 —— 正则表达式操作

最新推荐文章于 2023-07-03 18:20:59 发布

qqgirllianxin

最新推荐文章于 2023-07-03 18:20:59 发布

阅读量457

点赞数

分类专栏： Python 文章标签：正则表达式

本文链接：https://blog.csdn.net/qqgirllianxin/article/details/39578271

版权

Python 专栏收录该内容

7 篇文章 0 订阅

订阅专栏

1. 正则表达式语法

2. 常用的正则表达式处理函数

re.match尝试从字符串的开始匹配一个模式

import re
text = "Elaine is a beautiful girl, she is cool,clever, and so on..."

#match从字符串的开始匹配一个模式
m = re.match(r"(\w+)\s",text)

if m:
    print m.group()
else:
    print 'not match'

re.search 从字符串的任一位置去匹配一个模式

re.match("c","abcdef")    # No Match
re.search("c", "abcdef")  # Match

re.sub 替换字符串中的匹配项

re.sub(r'\s+','-',string)

#将string串中的空格替换成"-"

re.split 分割字符串

#将字符串按空格分割成一个单词列表
re.split(r'\s+',text)

re.findall 获取字符串中所有匹配的串

#匹配所有包含o的单词
re.findall(r’\w*o\w*',text)

>>>['cool', 'so', 'on']

re.complie 把正则表达式编译成一个正则表达式对象，可以复用该正则表达式

import re

text = "Elaine is a beautiful girl, she is cool,clever, and so on..."
pattern = re.compile(r'\w*o\w*')
newstring = lambda m: '[' + m.group(0) + ']'
print pattern.findall(text)   
print pattern.sub(newstring, text) 
#将单词中包含o的用“[]"括起来

4. 正则表达式实例

实例1.

teststr1 = "800-123-4234"

pattern1 = re.compile(r'^\d{3}-\d{3}-\d{4}$')

print pattern1.findall(teststr1)

实例2.

teststr2 = "email: elaine@163.com, email: xx_xx@sina.org"

pattern2 = re.compile(r"\w+:\s+\w+@\w+\.(?:org|com|net)") #（）中的？：表示括号内的内容不做为分组

print pattern2.findall(teststr2)

实例3.

teststring=["HELLO world","hello world!","hello world"]

expressions = ["hello","HELLO"]

for string in teststring:

for expression in expressions:

if re.search(expression,string):

print expression , "found in string" ,string

else:

print expression , "not found in string" ,string