python简单爬取b站视频弹幕
url:"https://comment.bilibili.com/139527441.xml"
代码:
import imageio
import jieba as jieba
import requests
import pandas as pd
from lxml import etree
url = "https://comment.bilibili.com/139527441.xml"
# 发送请求
response = requests.get(url)
xml = etree.fromstring(response.content)
# 解析数据
barrage = xml.xpath("/i/d/text()")
# 把列表转换成DataFrame
barrage_df = pd.DataFrame(barrage, columns=['弹幕内容'])
# 保存到本地
barrage_df.to_csv("./data/Barrage.csv", encoding='utf_8_sig')
将爬取的弹幕内容保存至csv文件