python-爬虫爬取豆瓣Top250电影数据

最新推荐文章于 2024-08-08 14:47:02 发布

Vivinia_Vivinia

最新推荐文章于 2024-08-08 14:47:02 发布

阅读量2.8k

点赞数 1

分类专栏： python 文章标签： python 爬虫豆瓣电影

本文链接：https://blog.csdn.net/hester_hester/article/details/106183699

版权

目标效果：

代码：

#-*- codeing =utf-8 -*-
#################引入模块#################
from bs4 import BeautifulSoup  #网页解析，获取数据
import re   #正则表达式，
import urllib.request,urllib.error   #制定URL，获取网页数据
import xlwt   #进行excel制作

#################定义变量#################
findLink=re.compile(r'<a href="(.*?)">')   #创建正则表达式对象，影片链接规则
findImgSrc=re.compile(r'<img.*src="(.*?)"',re.S)   #图片链接规则
findTitle=re.compile(r'<span class="title">(.*)</span>')   #影片片名
findRating=re.compile(r'<span class="rating_num" property="v:average">(.*)</span>')
findJudge=re.compile(r'<span>(\d*)人评价</span>')   #评价人数
findInq=re.compile(r'&l