【PythonPlanet】爬虫_电影

最新推荐文章于 2024-07-12 16:16:27 发布

海林Lin

最新推荐文章于 2024-07-12 16:16:27 发布

阅读量131

点赞数

分类专栏： PythonPlanet 文章标签： python html

本文链接：https://blog.csdn.net/weixin_42814182/article/details/107816156

版权

PythonPlanet 专栏收录该内容

11 篇文章 0 订阅

订阅专栏

豆瓣电影TOP250

import requests
from bs4 import BeautifulSoup
headers = {'user-agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/65.0.3325.181 Safari/537.36'}
for n in range(11):
    # 获取数据
    url_page = 'https://movie.douban.com/top250?start='+str(n*25)+'&filter='
    res_moves = requests.get(url_page,headers=headers)
    # 解析数据
    bs_moves = BeautifulSoup(res_moves.text,'html.parser')
    top_moves = bs_moves.find('ol',class_='grid_view')
    for tops in top_moves.find_all('li'):
        num = tops.find('em').text
        title = tops.find('span',class_='title').text
        try:
            comment = tops.find('span',class_='inq').text
        except:
            comment = ''
        score = tops.find('span',class_='rating_num').text
        url_move = tops.find('a')['href']
        print(num+'.'+title+'--'+comment+'--'+score+'--'+url_move)

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

海林Lin

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
【PythonPlanet】爬虫_电影

豆瓣电影TOP250import requestsfrom bs4 import BeautifulSoupheaders = {'user-agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/65.0.3325.181 Safari/537.36'}list_all = []for n in range(11): # 获取数据 url_page =
复制链接

扫一扫