一个偶然的机会,加入了一个微信,听了几个公开课。对爬虫有了新的认识,这几天一直在学习,有了一点收获。
分享一下pyth
# https://music.163.com/#/discover/toplist # <li><a href="/song?id=2602622179">青春的颜色 (Live版)</a> 这是要处理的网址 import requests import bs4 导入模块 def down_song(songname,songid): baseurl = 'http://music.163.com/song/media/outer/url?id=' + songid re = requests.get(url=baseurl,headers=headers) with open(f'./music/{songname}.mp3','wb') as file: file.write(re.content) 定义的下载函数 url = 'https://music.163.com/#/discover/toplist' url1=url.replace('/#','') headers = { 'User-Agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/122.0.6261.95 Safari/537.36', } res = requests.get(url1,headers=headers) res.encoding = ('utf-8') 伪装请求 soup = bs4.BeautifulSoup(res.text,'lxml') song_list = soup.select("ul.f-hide li a") for song in song_list: songid = song['href'].split('=')[-1] songname = song.text down_song(songname, songid)
找到音乐ID NAME.
共同成长,一起来学习