Python正则表达式及re模块使用

1、 正则表达式

元字符:具有固定含义的特殊符号:
常用元字符:

. 匹配除了换行以外的任意字符
\w 匹配字母或者数字或下划线
\s 匹配任意的空白符
\d 匹配数字
\n 匹配一个换行符
\t 匹配一个制表符

^ 匹配字符串的开始
$ 匹配字符串的结尾

\W 匹配非字母或数字或下划线
\D 匹配非数字
\S 匹配非空白符
a|b 匹配字符a或字符b
() 匹配括号内的表达式,也表示一个组
[…] 匹配字符组中的字符
[^…] 匹配除了字符组中的所有字符

量词:控制前面的元字符出现的次数

* 重复零次或者更多次
+ 重复一次或更多次
? 重复零次或者一次
{n} 重复n次
{n,} 重复n次或者更多次
{n,m} 重复n到m次

贪婪匹配和惰性匹配
.* 贪婪匹配,尽可能多的去匹配结果
.*? 惰性匹配,尽可能少的去匹配结果----->回溯

正则匹配测试地址:
tool.oschina.net/regex

2、re模块使用

2.1 查找

结果=re.findall(正则,字符串),建议在正则前使用r,防止转义,其中结果返回的是列表

res1=re.findall(r'\d+',"我今年35了,年薪40w。")
print(res1)
结果
['35', '40']

结果=re.finditer(正则,字符串),建议在正则前使用r,防止转义,其中结果返回的是迭代器,迭代器中返回的是一个Match对象,从每一个macth对象中通过group分组中拿取结果。

res2=re.finditer(r'\d+',"我今年35了,年薪40w。")
print(res1)
for it in res2:
    print(it)
    print(it.group())
 
 结果:
<re.Match object; span=(3, 5), match='35'>
35
<re.Match object; span=(9, 11), match='40'>
40

结果=re.search(正则,字符串) 建议在正则前使用r,防止转义,其中结果返回的是是一个Match对象,从每一个macth对象中通过group分组中拿取结果。search方法进行全局搜索,搜索到第一个结果了,就会返回,不再继续向下搜索。

res3=re.search(r'\d+',"我今年35了,年薪40w。")
print(res3)
print(res3.group())
结果
<re.Match object; span=(3, 5), match='35'>
35

obj=re.compile(正则表达式) 编译好一个正则表达式,然后通过obj.findall(字符串)进行快速匹配,可以提升效率,

我们还是拿上述示例进行测试

str="我今年35了,年薪40w。"
obj=re.compile(r'\d+')
res1=obj.findall(str)
res2=obj.finditer(str)
res3=obj.search(str)
print(res1)
for it in res2:
    print(it)
    print(it.group())
print(res3)
print(res3.group())

结果:
['35', '40']
<re.Match object; span=(3, 5), match='35'>
35
<re.Match object; span=(9, 11), match='40'>
40
<re.Match object; span=(3, 5), match='35'>
35

    <div id="screening" class="s" data-dstat-areaid="70" data-dstat-mode="click,expose">
        <div class="screening-hd">
                <div class="ui-slide-control">
                    <span class="prev-btn"><a class="btn-prev" href="javascript:void(0)"></a></span>
                    <span class="next-btn"><a class="btn-next" href="javascript:void(0)"></a></span>
                </div>
                <div class="slide-tip"><span class="ui-slide-index">1</span> / <span class="ui-slide-max">3</span></div>
            <h2>正在热映<span><a onclick="moreurl(this, {from:'mv_l_a'})" href="/cinema/nowplaying/">全部正在热映&raquo;</a></span><span><a onclick="moreurl(this, {from:'mv_l_w'})" href="./later/">即将上映&raquo;</a></span></h2>
        </div>
        <div class="screening-bd">
            <ul class="ui-slide-content" data-slide-index="1" data-index-max="3">
                        
                        
                    <li class="ui-slide-item s" data-dstat-areaid=70_1 data-dstat-mode=click,expose data-dstat-watch=.ui-slide-content data-dstat-viewport=.screening-bd data-title="神奇动物:邓布利多之谜 Fantastic Beasts: The Secrets of Dumbledore" data-release="2022" data-rate="6.2" data-star="35" data-trailer="https://movie.douban.com/subject/26147418/trailer" data-ticket="https://movie.douban.com/ticket/redirect/?movie_id=26147418" data-duration="142分钟" data-region="英国" data-director="大卫·叶茨" data-actors="埃迪·雷德梅恩 / 裘德·洛 / 麦斯·米科尔森" data-intro="" data-enough="true" data-rater="54539">
                        <ul class="">
                            <li class="poster">
                                <a onclick="moreurl(this, {from:'mv_a_pst'})" href="https://movie.douban.com/subject/26147418/?from=showing">
                                    <img src="https://img9.doubanio.com/view/photo/s_ratio_poster/public/p2871106106.webp" alt="神奇动物:邓布利多之谜" rel="nofollow" class="" />
                                </a>
                            </li>
                            <li class="title">
                                <a onclick="moreurl(this, {from:'mv_a_tl'})" href="https://movie.douban.com/subject/26147418/?from=showing" class="">神奇动物:邓...</a>
                            </li>
                            <li class="rating">
                                        <span class="rating-star allstar35"></span><span class="subject-rate">6.2</span>
                            </li>
                                <li class="ticket_btn"><span><a onclick="moreurl(this, {from:'mv_b_tc'})" href="https://movie.douban.com/ticket/redirect/?movie_id=26147418" target="_blank">选座购票</a></span></li>
                        </ul>
                        
                        
                    <li class="ui-slide-item"  data-title="新蝙蝠侠 The Batman" data-release="2022" data-rate="7.6" data-star="40" data-trailer="https://movie.douban.com/subject/6424756/trailer" data-ticket="https://movie.douban.com/ticket/redirect/?movie_id=6424756" data-duration="175分钟" data-region="美国" data-director="马特·里夫斯" data-actors="罗伯特·帕丁森 / 佐伊·克罗维兹 / 保罗·达诺" data-intro="" data-enough="true" data-rater="159306">
                        <ul class="">
                            <li class="poster">
                                <a onclick="moreurl(this, {from:'mv_a_pst'})" href="https://movie.douban.com/subject/6424756/?from=showing">
                                    <img src="https://img2.doubanio.com/view/photo/s_ratio_poster/public/p2868425292.webp" alt="新蝙蝠侠" rel="nofollow" class="" />
                                </a>
                            </li>
                            <li class="title">
                                <a onclick="moreurl(this, {from:'mv_a_tl'})" href="https://movie.douban.com/subject/6424756/?from=showing" class="">新蝙蝠侠</a>
                            </li>
                            <li class="rating">
                                        <span class="rating-star allstar40"></span><span class="subject-rate">7.6</span>
                            </li>
                                <li class="ticket_btn"><span><a onclick="moreurl(this, {from:'mv_b_tc'})" href="https://movie.douban.com/ticket/redirect/?movie_id=6424756" target="_blank">选座购票</a></span></li>
                        </ul>
                        
                        
                    <li class="ui-slide-item"  data-title="月球陨落 Moonfall" data-release="2022" data-rate="5.4" data-star="30" data-trailer="https://movie.douban.com/subject/26825482/trailer" data-ticket="https://movie.douban.com/ticket/redirect/?movie_id=26825482" data-duration="130分钟" data-region="美国" data-director="罗兰·艾默里奇" data-actors="哈莉·贝瑞 / 帕特里克·威尔森 / 约翰·布莱德利" data-intro="" data-enough="true" data-rater="61959">
                        <ul class="">
                            <li class="poster">
                                <a onclick="moreurl(this, {from:'mv_a_pst'})" href="https://movie.douban.com/subject/26825482/?from=showing">
                                    <img src="https://img1.doubanio.com/view/photo/s_ratio_poster/public/p2869422067.webp" alt="月球陨落" rel="nofollow" class="" />
                                </a>
                            </li>
                            <li class="title">
                                <a onclick="moreurl(this, {from:'mv_a_tl'})" href="https://movie.douban.com/subject/26825482/?from=showing" class="">月球陨落</a>
                            </li>
                            <li class="rating">
                                        <span class="rating-star allstar30"></span><span class="subject-rate">5.4</span>
                            </li>
                                <li class="ticket_btn"><span><a onclick="moreurl(this, {from:'mv_b_tc'})" href="https://movie.douban.com/ticket/redirect/?movie_id=26825482" target="_blank">选座购票</a></span></li>
                        </ul>
                        
                        
                    <li class="ui-slide-item"  data-title="密室逃生2 Escape Room  2" data-release="2021" data-rate="6.2" data-star="35" data-trailer="https://movie.douban.com/subject/30469922/trailer" data-ticket="https://movie.douban.com/ticket/redirect/?movie_id=30469922" data-duration="88分钟(院线版)" data-region="美国" data-director="亚当·罗比特尔" data-actors="泰勒·拉塞尔 / 洛根·米勒 / 霍兰·罗登" data-intro="" data-enough="true" data-rater="58604">
                        <ul class="">
                            <li class="poster">
                                <a onclick="moreurl(this, {from:'mv_a_pst'})" href="https://movie.douban.com/subject/30469922/?from=showing">
                                    <img src="https://img2.doubanio.com/view/photo/s_ratio_poster/public/p2870297032.webp" alt="密室逃生2" rel="nofollow" class="" />
                                </a>
                            </li>
                            <li class="title">
                                <a onclick="moreurl(this, {from:'mv_a_tl'})" href="https://movie.douban.com/subject/30469922/?from=showing" class="">密室逃生2</a>
                            </li>
                            <li class="rating">
                                        <span class="rating-star allstar35"></span><span class="subject-rate">6.2</span>
                            </li>
                                <li class="ticket_btn"><span><a onclick="moreurl(this, {from:'mv_b_tc'})" href="https://movie.douban.com/ticket/redirect/?movie_id=30469922" target="_blank">选座购票</a></span></li>
                        </ul>
                        
                        
                    <li class="ui-slide-item"  data-title="精灵旅社4:变身大冒险 Hotel Transylvania 4: Transformania" data-release="2022" data-rate="6.3" data-star="35" data-trailer="https://movie.douban.com/subject/30472643/trailer" data-ticket="https://movie.douban.com/ticket/redirect/?movie_id=30472643" data-duration="88分钟" data-region="美国" data-director="德里克·德莱蒙" data-actors="布赖恩·哈尔 / 安迪·萨姆伯格 / 赛琳娜·戈麦斯" data-intro="" data-enough="true" data-rater="21500">
                        <ul class="">
                            <li class="poster">
                                <a onclick="moreurl(this, {from:'mv_a_pst'})" href="https://movie.douban.com/subject/30472643/?from=showing">
                                    <img src="https://img1.doubanio.com/view/photo/s_ratio_poster/public/p2870571717.webp" alt="精灵旅社4:变身大冒险" rel="nofollow" class="" />
                                </a>
                            </li>
                            <li class="title">
                                <a onclick="moreurl(this, {from:'mv_a_tl'})" href="https://movie.douban.com/subject/30472643/?from=showing" class="">精灵旅社4:...</a>
                            </li>
                            <li class="rating">
                                        <span class="rating-star allstar35"></span><span class="subject-rate">6.3</span>
                            </li>
                                <li class="ticket_btn"><span><a onclick="moreurl(this, {from:'mv_b_tc'})" href="https://movie.douban.com/ticket/redirect/?movie_id=30472643" target="_blank">选座购票</a></span></li>
                        </ul>
                        
                        
                    <li class="ui-slide-item s" data-dstat-areaid=70_2 data-dstat-mode=click,expose data-dstat-watch=.ui-slide-content data-dstat-viewport=.screening-bd data-title="神秘海域 Uncharted" data-release="2022" data-rate="6.3" data-star="35" data-trailer="https://movie.douban.com/subject/3822687/trailer" data-ticket="https://movie.douban.com/ticket/redirect/?movie_id=3822687" data-duration="116分钟" data-region="美国" data-director="鲁本·弗雷斯彻" data-actors="汤姆·赫兰德 / 马克·沃尔伯格 / 索菲娅·阿里" data-intro="" data-enough="true" data-rater="38054">
                        <ul class="">
                            <li class="poster">
                                <a onclick="moreurl(this, {from:'mv_a_pst'})" href="https://movie.douban.com/subject/3822687/?from=showing">
                                    <img src="https://img9.doubanio.com/view/photo/s_ratio_poster/public/p2869751944.webp" alt="神秘海域" rel="nofollow" class="" />
                                </a>
                            </li>
                            <li class="title">
                                <a onclick="moreurl(this, {from:'mv_a_tl'})" href="https://movie.douban.com/subject/3822687/?from=showing" class="">神秘海域</a>
                            </li>
                            <li class="rating">
                                        <span class="rating-star allstar35"></span><span class="subject-rate">6.3</span>
                            </li>
                                <li class="ticket_btn"><span><a onclick="moreurl(this, {from:'mv_b_tc'})" href="https://movie.douban.com/ticket/redirect/?movie_id=3822687" target="_blank">选座购票</a></span></li>
                        </ul>
                        
                        
                    <li class="ui-slide-item"  data-title="花束般的恋爱 花束みたいな恋をした" data-release="2021" data-rate="8.6" data-star="45" data-trailer="https://movie.douban.com/subject/34874432/trailer" data-ticket="https://movie.douban.com/ticket/redirect/?movie_id=34874432" data-duration="124分钟" data-region="日本" data-director="土井裕泰" data-actors="菅田将晖 / 有村架纯 / 细田佳央太" data-intro="" data-enough="true" data-rater="394348">
                        <ul class="">
                            <li class="poster">
                                <a onclick="moreurl(this, {from:'mv_a_pst'})" href="https://movie.douban.com/subject/34874432/?from=showing">
                                    <img src="https://img2.doubanio.com/view/photo/s_ratio_poster/public/p2868462052.webp" alt="花束般的恋爱" rel="nofollow" class="" />
                                </a>
                            </li>
                            <li class="title">
                                <a onclick="moreurl(this, {from:'mv_a_tl'})" href="https://movie.douban.com/subject/34874432/?from=showing" class="">花束般的恋爱...</a>
                            </li>
                            <li class="rating">
                                        <span class="rating-star allstar45"></span><span class="subject-rate">8.6</span>
                            </li>
                                <li class="ticket_btn"><span><a onclick="moreurl(this, {from:'mv_b_tc'})" href="https://movie.douban.com/ticket/redirect/?movie_id=34874432" target="_blank">选座购票</a></span></li>
                        </ul>
                        
                        
                    <li class="ui-slide-item"  data-title="这个杀手不太冷静" data-release="2022" data-rate="6.3" data-star="35" data-trailer="https://movie.douban.com/subject/35505100/trailer" data-ticket="https://movie.douban.com/ticket/redirect/?movie_id=35505100" data-duration="109分钟" data-region="中国大陆" data-director="邢文雄" data-actors="马丽 / 魏翔 / 陈明昊" data-intro="" data-enough="true" data-rater="383169">
                        <ul class="">
                            <li class="poster">
                                <a onclick="moreurl(this, {from:'mv_a_pst'})" href="https://movie.douban.com/subject/35505100/?from=showing">
                                    <img src="https://img3.doubanio.com/view/photo/s_ratio_poster/public/p2814949620.webp" alt="这个杀手不太冷静" rel="nofollow" class="" />
                                </a>
                            </li>
                            <li class="title">
                                <a onclick="moreurl(this, {from:'mv_a_tl'})" href="https://movie.douban.com/subject/35505100/?from=showing" class="">这个杀手不太...</a>
                            </li>
                            <li class="rating">
                                        <span class="rating-star allstar35"></span><span class="subject-rate">6.3</span>
                            </li>
                                <li class="ticket_btn"><span><a onclick="moreurl(this, {from:'mv_b_tc'})" href="https://movie.douban.com/ticket/redirect/?movie_id=35505100" target="_blank">选座购票</a></span></li>
                        </ul>
                        
                        
                    <li class="ui-slide-item"  data-title="卧鼠藏虫" data-release="2022" data-rate="" data-star="00" data-trailer="https://movie.douban.com/subject/35390245/trailer" data-ticket="https://movie.douban.com/ticket/redirect/?movie_id=35390245" data-duration="96分钟" data-region="中国大陆" data-director="文松" data-actors="文松 / 贾冰 / 崔志佳" data-intro="" data-enough="false" data-rater="391">
                        <ul class="">
                            <li class="poster">
                                <a onclick="moreurl(this, {from:'mv_a_pst'})" href="https://movie.douban.com/subject/35390245/?from=showing">
                                    <img src="https://img2.doubanio.com/view/photo/s_ratio_poster/public/p2829062662.webp" alt="卧鼠藏虫" rel="nofollow" class="" />
                                </a>
                            </li>
                            <li class="title">
                                <a onclick="moreurl(this, {from:'mv_a_tl'})" href="https://movie.douban.com/subject/35390245/?from=showing" class="">卧鼠藏虫</a>
                            </li>
                            <li class="rating">
                                    

                                        <span class="text-tip">暂无评分</span>
                            </li>
                                <li class="ticket_btn"><span><a onclick="moreurl(this, {from:'mv_b_tc'})" href="https://movie.douban.com/ticket/redirect/?movie_id=35390245" target="_blank">选座购票</a></span></li>
                        </ul>
                        
                        
                    <li class="ui-slide-item"  data-title="狙击手" data-release="2022" data-rate="7.7" data-star="40" data-trailer="https://movie.douban.com/subject/35215390/trailer" data-ticket="https://movie.douban.com/ticket/redirect/?movie_id=35215390" data-duration="96分钟" data-region="中国大陆" data-director="张艺谋" data-actors="陈永胜 / 章宇 / 张译" data-intro="" data-enough="true" data-rater="228589">
                        <ul class="">
                            <li class="poster">
                                <a onclick="moreurl(this, {from:'mv_a_pst'})" href="https://movie.douban.com/subject/35215390/?from=showing">
                                    <img src="https://img2.doubanio.com/view/photo/s_ratio_poster/public/p2738601191.webp" alt="狙击手" rel="nofollow" class="" />
                                </a>
                            </li>
                            <li class="title">
                                <a onclick="moreurl(this, {from:'mv_a_tl'})" href="https://movie.douban.com/subject/35215390/?from=showing" class="">狙击手</a>
                            </li>
                            <li class="rating">
                                        <span class="rating-star allstar40"></span><span class="subject-rate">7.7</span>
                            </li>
                                <li class="ticket_btn"><span><a onclick="moreurl(this, {from:'mv_b_tc'})" href="https://movie.douban.com/ticket/redirect/?movie_id=35215390" target="_blank">选座购票</a></span></li>
                        </ul>
                        
                        
                    <li class="ui-slide-item s" data-dstat-areaid=70_3 data-dstat-mode=click,expose data-dstat-watch=.ui-slide-content data-dstat-viewport=.screening-bd data-title="诺亚方舟漂流记2 Ooops! The Adventure Continues" data-release="2020" data-rate="" data-star="00" data-trailer="https://movie.douban.com/subject/35210321/trailer" data-ticket="https://movie.douban.com/ticket/redirect/?movie_id=35210321" data-duration="86分钟" data-region="德国" data-director="托比·格恩科尔" data-actors="Max / 德莫特·莫根尼斯 / Ava" data-intro="" data-enough="false" data-rater="242">
                        <ul class="">
                            <li class="poster">
                                <a onclick="moreurl(this, {from:'mv_a_pst'})" href="https://movie.douban.com/subject/35210321/?from=showing">
                                    <img src="https://img1.doubanio.com/view/photo/s_ratio_poster/public/p2869809668.webp" alt="诺亚方舟漂流记2" rel="nofollow" class="" />
                                </a>
                            </li>
                            <li class="title">
                                <a onclick="moreurl(this, {from:'mv_a_tl'})" href="https://movie.douban.com/subject/35210321/?from=showing" class="">诺亚方舟漂流...</a>
                            </li>
                            <li class="rating">
                                    

                                        <span class="text-tip">暂无评分</span>
                            </li>
                                <li class="ticket_btn"><span><a onclick="moreurl(this, {from:'mv_b_tc'})" href="https://movie.douban.com/ticket/redirect/?movie_id=35210321" target="_blank">选座购票</a></span></li>
                        </ul>
                        
                        
                    <li class="ui-slide-item"  data-title="奇迹·笨小孩" data-release="2022" data-rate="7.4" data-star="40" data-trailer="https://movie.douban.com/subject/35312437/trailer" data-ticket="https://movie.douban.com/ticket/redirect/?movie_id=35312437" data-duration="106分钟" data-region="中国大陆" data-director="文牧野" data-actors="易烊千玺 / 田雨 / 陈哈琳" data-intro="" data-enough="true" data-rater="316259">
                        <ul class="">
                            <li class="poster">
                                <a onclick="moreurl(this, {from:'mv_a_pst'})" href="https://movie.douban.com/subject/35312437/?from=showing">
                                    <img src="https://img2.doubanio.com/view/photo/s_ratio_poster/public/p2842327103.webp" alt="奇迹·笨小孩" rel="nofollow" class="" />
                                </a>
                            </li>
                            <li class="title">
                                <a onclick="moreurl(this, {from:'mv_a_tl'})" href="https://movie.douban.com/subject/35312437/?from=showing" class="">奇迹·笨小孩...</a>
                            </li>
                            <li class="rating">
                                        <span class="rating-star allstar40"></span><span class="subject-rate">7.4</span>
                            </li>
                                <li class="ticket_btn"><span><a onclick="moreurl(this, {from:'mv_b_tc'})" href="https://movie.douban.com/ticket/redirect/?movie_id=35312437" target="_blank">选座购票</a></span></li>
                        </ul>
                        
                        
                    <li class="ui-slide-item"  data-title="长津湖之水门桥" data-release="2022" data-rate="7.2" data-star="35" data-trailer="https://movie.douban.com/subject/35613853/trailer" data-ticket="https://movie.douban.com/ticket/redirect/?movie_id=35613853" data-duration="149分钟" data-region="中国大陆" data-director="徐克" data-actors="吴京 / 易烊千玺 / 朱亚文" data-intro="" data-enough="true" data-rater="240778">
                        <ul class="">
                            <li class="poster">
                                <a onclick="moreurl(this, {from:'mv_a_pst'})" href="https://movie.douban.com/subject/35613853/?from=showing">
                                    <img src="https://img2.doubanio.com/view/photo/s_ratio_poster/public/p2846021991.webp" alt="长津湖之水门桥" rel="nofollow" class="" />
                                </a>
                            </li>
                            <li class="title">
                                <a onclick="moreurl(this, {from:'mv_a_tl'})" href="https://movie.douban.com/subject/35613853/?from=showing" class="">长津湖之水门...</a>
                            </li>
                            <li class="rating">
                                        <span class="rating-star allstar35"></span><span class="subject-rate">7.2</span>
                            </li>
                                <li class="ticket_btn"><span><a onclick="moreurl(this, {from:'mv_b_tc'})" href="https://movie.douban.com/ticket/redirect/?movie_id=35613853" target="_blank">选座购票</a></span></li>
                        </ul>
                        
                        
                    <li class="ui-slide-item"  data-title="熊出没·重返地球" data-release="2022" data-rate="6.5" data-star="35" data-trailer="https://movie.douban.com/subject/35377026/trailer" data-ticket="https://movie.douban.com/ticket/redirect/?movie_id=35377026" data-duration="99分钟" data-region="中国大陆" data-director="林汇达" data-actors="张秉君 / 张伟 / 谭笑" data-intro="" data-enough="true" data-rater="20405">
                        <ul class="">
                            <li class="poster">
                                <a onclick="moreurl(this, {from:'mv_a_pst'})" href="https://movie.douban.com/subject/35377026/?from=showing">
                                    <img src="https://img2.doubanio.com/view/photo/s_ratio_poster/public/p2856825681.webp" alt="熊出没·重返地球" rel="nofollow" class="" />
                                </a>
                            </li>
                            <li class="title">
                                <a onclick="moreurl(this, {from:'mv_a_tl'})" href="https://movie.douban.com/subject/35377026/?from=showing" class="">熊出没·重返...</a>
                            </li>
                            <li class="rating">
                                        <span class="rating-star allstar35"></span><span class="subject-rate">6.5</span>
                            </li>
                                <li class="ticket_btn"><span><a onclick="moreurl(this, {from:'mv_b_tc'})" href="https://movie.douban.com/ticket/redirect/?movie_id=35377026" target="_blank">选座购票</a></span></li>
                        </ul>
                        
                        
                    <li class="ui-slide-item"  data-title="安魂" data-release="2022" data-rate="" data-star="00" data-trailer="https://movie.douban.com/subject/34808495/trailer" data-ticket="https://movie.douban.com/ticket/redirect/?movie_id=34808495" data-duration="107分钟" data-region="中国大陆" data-director="日向寺太郎" data-actors="巍子 / 陈瑾 / 强宇" data-intro="" data-enough="false" data-rater="176">
                        <ul class="">
                            <li class="poster">
                                <a onclick="moreurl(this, {from:'mv_a_pst'})" href="https://movie.douban.com/subject/34808495/?from=showing">
                                    <img src="https://img3.doubanio.com/view/photo/s_ratio_poster/public/p2851606190.webp" alt="安魂" rel="nofollow" class="" />
                                </a>
                            </li>
                            <li class="title">
                                <a onclick="moreurl(this, {from:'mv_a_tl'})" href="https://movie.douban.com/subject/34808495/?from=showing" class="">安魂</a>
                            </li>
                            <li class="rating">
                                    

                                        <span class="text-tip">暂无评分</span>
                            </li>
                                <li class="ticket_btn"><span><a onclick="moreurl(this, {from:'mv_b_tc'})" href="https://movie.douban.com/ticket/redirect/?movie_id=34808495" target="_blank">选座购票</a></span></li>
                        </ul>
            </ul>
        </div>
    </div>

爬取data-title

obj=re.compile(r'data-title="(?P<data_title>.*?)"',re.S)
res=obj.finditer(str)
for it in res:
    print(it.group("data_title"))

结果:
神奇动物:邓布利多之谜 Fantastic Beasts: The Secrets of Dumbledore
新蝙蝠侠 The Batman
月球陨落 Moonfall
密室逃生2 Escape Room  2
精灵旅社4:变身大冒险 Hotel Transylvania 4: Transformania
神秘海域 Uncharted
花束般的恋爱 花束みたいな恋をした
这个杀手不太冷静
卧鼠藏虫
狙击手
诺亚方舟漂流记2 Ooops! The Adventure Continues
奇迹·笨小孩
长津湖之水门桥
熊出没·重返地球
安魂
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值