练习

最新推荐文章于 2022-05-30 11:05:22 发布

元永真

最新推荐文章于 2022-05-30 11:05:22 发布

阅读量270

点赞数

本文链接：https://blog.csdn.net/weixin_37855495/article/details/67639919

版权

class HtmlParser(object):
    def _get_new_urls(self,page_url,soup):
        new_urls = set()
        normaltitle_data = {}
        '''
        <h3 class="normaltitle">北京行政区酒店</h3>
       '''
        normaltitleS = soup.find_all('h3',class_="normaltitle")
        for normaltitle in normaltitleS:
            normaltitle =normaltitle.get_text()
            if('北京行政区酒店'is normaltitle):
                links = soup.find_all("a",href=re.compile(r"/html5/hotel/sitemap-beijing1/location"))
                for link in links:
                     title = link.get_text()
                     normaltitle_data['title'] =  link['href']
        #            print(normaltitle_data)
        return normaltitle_data

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

元永真

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
练习

class HtmlParser(object): def _get_new_urls(self,page_url,soup): new_urls = set() normaltitle_data = {} ''' 北京行政区酒店 ''' normaltitleS = soup.find_all(
复制链接

扫一扫