sre_constants.error: nothing to repeat at position 2 正则表达式

最新推荐文章于 2023-07-26 15:17:13 发布

想穿红色学位服的狐狸

最新推荐文章于 2023-07-26 15:17:13 发布

阅读量4.7k

点赞数

分类专栏： # Python学习笔记 Python 文章标签：正则表达式

本文链接：https://blog.csdn.net/dkolli/article/details/102636355

版权

Python 同时被 2 个专栏收录

17 篇文章 2 订阅

订阅专栏

Python学习笔记

16 篇文章 2 订阅

订阅专栏

	for temperature in dls:
        temperature_pattern = re.compile('<ddclass="txt2">(.*?)</dd>')
        temperature_dd = re.findall(temperature_pattern, temperature)
        # print(temperature_dd)

        #获取最低温对应的正则表达式
        low_temperature_pattern =re.compile('^(*.?)~')
        #^:匹配输入字符串的开始位置。  在源代码中~前的都是最低气温
        low_li = re.findall(low_temperature_pattern, temperature_dd)  
        #利用low_temperature_pattern这个正则表达式，在temperature_dd中寻找，
        #因为是从温度数据中找最低温和最高温
        #我们希望处理的是第一个元素，所以是temperature_dd[0]
        
		#获取最高温对应的正则表达式
        high_temperature_pattern = re.compile('<b>(*.?)</b>')
        high_li = re.findall(high_temperature_pattern, temperature_dd)

提示错误
在这里插入图片描述
错误说是没有可以重复到的，先打印了更往上的“temperature_dd”发现没有问题，那么问题就出在了正则表达式上。

根据上一条打印测试的数据发现开头结尾没有问题。
问题出在了表达式：

(*.?)   #错误的
(.*?)	#正确的

在正则表达式中：

#三个字符分别是这样的含义
.  #匹配除“\n”之外的任何单个字符。
*  #匹配前面的子表达式零次或多次。
?  #当该字符紧跟在任何一个其他限制符（*,+,?，{n}，{n,}，{n,m}）后面时，匹配模式是非贪婪的。--->
		#--->非贪婪模式尽可能少的匹配所搜索的字,而默认的贪婪模式则尽可能多的匹配所搜索的字符串。

修改之后，数据提取正确。
在这里插入图片描述

想穿红色学位服的狐狸

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
sre_constants.error: nothing to repeat at position 2 正则表达式

for temperature in dls: temperature_pattern = re.compile('<ddclass="txt2">(.*?)</dd>') temperature_dd = re.findall(temperature_pattern, temperature) # print(temper...
复制链接

扫一扫