python爬虫数据解析之正则表达式及re.match()匹配多个字符方法

1.  *表示匹配任意多个字符   \d*表示匹配任意多个数字字符

import re

text = "123h1ello world"
text1 = "123Hello world456"
text2 = "hello world"

res = re.match("\d*", text)    
res1 = re.match("\d*", text1)
res2 = re.match("\d*", text2)

print(res.group())
print(res1.group())
print(res2.group())

输出结果为

123
123


Process finished with exit code 0

2.  +表示匹配1个或者多个任意字符   \w+表示匹配一个或多个字母,数字或下划线

import re

text = "he+llo world"
text1 = "Hello world456"
text2 = "+hello world"

res = re.match("\w+", text)
res1 = re.match("\w+", text1)
res2 = re.match("\w+", text2)

print(res.group())
print(res1.group())
print(res2)

输出结果为

he
Hello
None

Process finished with exit code 0

3.  ?表示匹配0个或一个字符     \w?表示匹配0或1个字母,数字或下划线

import re

text = "he+llo world"
text1 = "Hello world456"
text2 = "+hello world"

res = re.match("\w?", text)
res1 = re.match("\w?", text1)
res2 = re.match("\w?", text2)

print(res.group())
print(res1.group())
print(res2.group())

输出结果为

h
H


Process finished with exit code 0

3.  {m}表示匹配m个字符    \d{11}表示匹配11个数字字符

import re

text = "he+llo world"
text1 = "Hello world456"
text2 = "hello world"

res = re.match("\w{2}", text)
res1 = re.match("\w{3}", text1)
res2 = re.match("\w{4}", text2)

print(res.group())
print(res1.group())
print(res2.group())

输出结果为

he
Hel
hell

Process finished with exit code 0

5.   {m,n}表示匹配m-n个字符   \w{2,4}表示匹配2-4个字符
 

import re

text = "hello world"
text1 = "Helloworld456"
text2 = "hello world"

res = re.match("\w{2,5}", text)
res1 = re.match("\w{6,8}", text1)
res2 = re.match("\w{20,25}", text2)

print(res.group())
print(res1.group())
print(res2)

hello
Hellowor
None

Process finished with exit code 0

  • 0
    点赞
  • 2
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值