Python获取豆瓣新书列表

最新推荐文章于 2024-05-08 09:02:47 发布

寂寞灵魂

最新推荐文章于 2024-05-08 09:02:47 发布

阅读量698

点赞数

分类专栏： Python 文章标签： python

本文链接：https://blog.csdn.net/riverflowrand/article/details/51008533

版权

Python 专栏收录该内容

6 篇文章 0 订阅

订阅专栏

# -*- encoding:utf-8 -*-

from bs4 import BeautifulSoup
import urllib
import urllib.request
import re

Url = 'https://book.douban.com/latest?icn=index-latestbook-all'
page = urllib.request.urlopen(Url).read().decode('utf-8')

soup = BeautifulSoup(page, 'html.parser')
#class标识CSS类名的关键字 class 在Python中是保留字,使用 class 做参数会导致语法错误.
# 从Beautiful Soup的4.1.1版本开始,可以通过 class_ 参数搜索有指定CSS类名的tag
books = soup.find_all('div', class_='detail-frame')
for item in books:
    print(item.find('h2').contents[0])
    constracts = item.find_all('p');
    for item2 in constracts:
        print(item2.contents[0])
    print('*********************************')

寂寞灵魂

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Python获取豆瓣新书列表

# -*- encoding:utf-8 -*-from bs4 import BeautifulSoupimport urllibimport urllib.requestimport reUrl = 'https://book.douban.com/latest?icn=index-latestbook-all'page = urllib.request.urlopen(Url
复制链接

扫一扫