爬虫 豆瓣音乐 top250

豆瓣音乐top250爬取,并将内容写入文件

from urllib.request import Request, urlopen
import bs4
import requests
import re
import json
import xlwt

workbook = xlwt.Workbook(encoding='utf-8')
worksheet = workbook.add_sheet('sheet1', cell_overwrite_ok=True)
worksheet.write(0, 0, label='歌名')
worksheet.write(0, 1, label='歌手')
worksheet.write(0, 2, label='时间')
worksheet.write(0, 3, label='类型')
worksheet.write(0, 4, label='评分')
z = 1
url=('https://music.douban.com/top250?start=0')#.format(i)
header = {'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/84.0.4147.105 Safari/537.36'}
ret = Request(url=url, headers=header)
html = urlopen(ret)
bs = bs4.BeautifulSoup(html, 'html.parser')



alls = bs.find_all('div',{'class':'pl2'})
for x in alls:
    music_name = x.find('a').get_text()
    leixing = x.find('p').get_text().split('/ ')
    score = x.find('span',{'class':'rating_nums'}).get_text()
    singer = leixing[0]
    time = leixing[1]
    type = leixing[-1]
    # print(leixing)

    worksheet.write(z,0,music_name)
    worksheet.write(z,1, singer)
    worksheet.write(z,2,time)
    worksheet.write(z,3,type)
    worksheet.write(z,4,score)
    z += 1
workbook.save('yinyue.xls')
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值