【问题记录】re字符串操作报错：'_sre.SRE_Match' object has no attribute 'split'

最新推荐文章于 2024-08-16 13:50:18 发布

abylee

最新推荐文章于 2024-08-16 13:50:18 发布

阅读量3.8k

点赞数

分类专栏：爬虫文章标签： python 正则表达式爬虫

本文链接：https://blog.csdn.net/abylee/article/details/94289142

版权

爬虫专栏收录该内容

4 篇文章 0 订阅

订阅专栏

在写爬虫的时候，使用了re.search获取信息，报错：’_sre.SRE_Match’ object has no attribute ‘split’

原代码：
页面中的信息大概是：东城区（144），需求是东城区和144分开输出

 for span in page_content.find_all('h3',class_='u-title-3'):   #定位直辖市的区县名
 		district = span.find_all('span')
        for i in range(len(district)):  
            district_str = district[i].get_text()
            district_name = district_str.split('(')[0]
            info1 = re.search(regex,district_str).split('(')[1]  #报错的句子
            hos_num = re.sub('\)','',info1)

问题：re_search返回的是匹配对象，如果要返回字符串，就得用group()方法
修改后的代码：

            for i in range(len(district)):  
                district_str = district[i].get_text()
                district_name = district_str.split('(')[0]
                info1 = re.search(regex,district_str).group(0)
                info2 = info1.split('(')[1]
                hos_num = re.sub('\)','',info2)     
                print(district_name,hos_num)

输出：

在这里插入图片描述