爬取豆瓣评论
BeautifulSoup得到网页信息;
get_url进行翻页操作;
jieba进行单词分割;
wordcloud进行词云处理;
matplotlib进行图形绘制;
from bs4 import BeautifulSoup
import requests
import pandas
def get_url(n):
url="https://music.douban.com/subject/35093585/comments/hot?p="
url=url+str(n)
return url
l=[]
for page in range(1,26):
r=requests.get(get_url(page),headers={
"User-agent":