Python爬虫——下载音乐
1.调用需要库
安装库:
Windows + R 输入cmd 后 执行下面这两行
pip install requests -i https://pypi.tuna.tsinghua.edu.cn/simple ( -i 和后面的链接有没有都可以)
pip install pyexecjs -i https://pypi.tuna.tsinghua.edu.cn/simple
import requests # 请求模块
import execjs # 调用js代码模块
import time # 时间模块
2. 完整代码
代码中的 md5.js 是加mi 文件,必须要有这个文件代码才能正常执行
import requests
import time
import execjs
time_stamp = int(time.time()*1000)
music = input("请输入要下载的歌名:")
json = f"NVPh5oo715z5DIWAeQlhMDsWXXQV4hwtbitrate=0callback=callback123clienttime={time_stamp}clientver=2000dfid=-inputtype=0iscorrection=1isfuzzy=0keyword={music}mid={time_stamp}page=1pagesize=30platform=WebFilterprivilege_filter=0srcappid=2919token=userid=0uuid={time_stamp}NVPh5oo715z5DIWAeQlhMDsWXXQV4hwt"
node = execjs.get()
ctx = node.compile(open('md5.js', mode='r', encoding='utf-8').read())
funcname = f'getpwd("{json}")'
md5 = ctx.eval(funcname)
url_search = f'https://complexsearch.kugou.com/v2/search/song?callback=callback123&keyword={music}&page=1&pagesize=30&bitrate=0&isfuzzy=0&inputtype=0&platform=WebFilter&userid=0&clientver=2000&iscorrection=1&privilege_filter=0&token=&srcappid=2919&clienttime={time_stamp}&mid={time_stamp}&uuid={time_stamp}&dfid=-&signature={md5}'
headers = {
'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/102.0.0.0 Safari/537.36',
'referer': 'https://www.kugou.com/',
}
resp = requests.get(url_search, headers=headers).text
str_call = resp.strip('callback123')
dic = eval(str_call)
try:
for i in range(100):
Hash = dic['data']['lists'][i]['FileHash']
AlbumID = dic['data']['lists'][i]['AlbumID']
ID = dic['data']['lists'][i]['ID']
FileName = dic['data']['lists'][i]['FileName']
music_url = f'https://wwwapi.kugou.com/yy/index.php?r=play/getdata&hash={Hash}&dfid=3x7tbK1hO4na41SmwR3COpKL&appid=1014&mid=097671c0efd4b8b448626c3399dad797&platid=4&album_id={AlbumID}&album_audio_id={ID}&_={time_stamp}'
print(FileName, '\n' + music_url, '\n')
except:
pass
download_music = input('请选择要下载的歌曲链接复制到这里:')
resp_music = requests.get(download_music)
dic = resp_music.json()
name = dic['data']['audio_name']
href = dic['data']['play_url']
resp_mp3 = requests.get(href, headers=headers)
with open(r'歌曲/' + name + '.mp3', mode='wb') as f:
f.write(resp_mp3.content)
print('下载完成')
3.分析代码
第一步
定义时间戳
第二步
破解MD5 加密,然后把破解出来的MD5数据拼接到url链接中
第三步
对拼接好的url地址进行请求,请求后拿到每一首歌曲的hash值和id
第四步
把hash值和id拼接到要下载歌曲url中进行请求,就可以拿到MP3地址
第五步
最后保存歌曲
4.结语
- 有些歌曲只能试听60秒,所以下载下来的歌曲也是60秒,其它歌曲正常下载哦
- 有没有大佬知道怎么破解,欢迎在下面留言,嘿嘿