编写一个爬取影评的爬虫程序
用的是python3.8、pycharm
先附上代码
import requests,re
def getOneSortList():
response = requests.get('https://www.sbkk88.com/a/dianyingpinglun/')
response.encoding = 'utf-8'
html = response.text
reg = r'<a href="(.*?)"><div class="x6"'
onesortlist = re.findall(reg,html)
return onesortlist
def getContent(url):
response = requests.get(url)
response.encoding = 'utf-8'
html = response.text
reg = r' <div class="content_left">([\S\s]*?)<div class="zk3"></div>'
return re.findall(reg,html)
for contenturl in getOneSortList():
num = contenturl.split('.')
num[0] = 'https://www'
url1 = num[0] + '.' + num[1] + '.' + num[2] + '.' + num[3]
contenturl = url1
for novelcontent in getContent(contenturl):
print(novelcontent)
用到了正则,导入了request和re模块。
运行结果
第一次发博不太熟悉
注意:本内容为本人原创,转载需标注来源