加油鸭接着上个内容,做了一点改进,可以批量爬取知乎用户信息。
实现思路:加一个id数组,循环访问获取id,然后输入到网址中,读取信息。
import requests
# 将浏览器复制出来的requests headers参数赋值给一个变量,一般直接用
headers = {
"User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/61.0.3163.100 Safari/537.36",
"Connection": "keep-alive",
"Accept": "text/html,application/json,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8",
"Accept-Language": "zh-CN,zh;q=0.8"}
# 你想要爬取的用户名列表
id_list = ['li-er-dan-12-76','yi-mei-cheng-xu-yuan-89','lao-bai-63-17','qi-e-chi-he-zhi-nan-66']
# 循环读取
for i in range(len(id_list)):
user_id = id_list[i]
# 爬取的API网址
zhihu_api = f'https://www.zhih