我正在写一个脚本来抓取一个站点并下载mp3文件,它使用BeautifulSoup和{}将以下所有链接抓取到单独的艺术家页面中,但是有些抓取的URL中包含了不止一个“.”以及当我启动时request.get(url, header=)
它不会下载文件,我如何更正?在
例如:
^{pr2}$
代码:def download_mp3(url_list_file, download_dir):
with open(url_list_file, 'r', encoding='utf-8') as urls:
for url in urls:
ua = fake_useragent.UserAgent(verify_ssl=False)
#header = {'User-Agent': str(ua.chrome.random)}
artist_dir = url.split('/')[4]
song_name = url.split('/')[6].replace('\n', '')
#corrected_url = ('path:url')
#print(corrected_url)
download_to = os.path.join(download_dir, artist_dir, song_name)
save_path = os.path.join(download_dir, artist_dir)
#print(download_to)
#print(save_path)
print(url)