Python爬虫-7-BeautifulSoup简单案例

最新推荐文章于 2022-10-05 22:53:06 发布

VIP文章 karry_孙二

最新推荐文章于 2022-10-05 22:53:06 发布

阅读量831

点赞数

分类专栏： Python爬虫

本文链接：https://blog.csdn.net/qq_39620483/article/details/83141571

版权

以爬取简书首页标题为例

# coding:utf-8
import requests
from bs4 import BeautifulSoup

# 简书首页title爬取
class SoupSpider:
    def __init__(self):
        self.session = requests.Session()

    def jian_shu_spider(self, url, headers):
        response = requests.get(url, headers=headers).text
        # 将获取到的内容转换成BeautifulSoup格式
        soup = BeautifulSoup(response, "lxml")
        # 查找所有class="title"的语句
        title_list = soup.find_all(class_= "title")
        for tit in title_list:
            title = tit.text
            print("文章标题：{}".format(title))

if __name__ == '__main__':
    soup_spider = SoupSpider()
    soup_spider.jian_shu_spider(
        "http://www.jianshu.com",
        {
        "Referer": "https://www.jianshu.com/",
        "User-Agent": "Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML

最低0.47元/天解锁文章

karry_孙二

关注

0
点赞
踩
4

收藏

觉得还不错? 一键收藏
0
评论
Python爬虫-7-BeautifulSoup简单案例

以爬取简书首页标题为例# coding:utf-8import requestsfrom bs4 import BeautifulSoup# 简书首页title爬取class SoupSpider: def __init__(self): self.session = requests.Session() def jian_shu_spider(s...
复制链接

扫一扫