python爬虫入门笔记--知乎发现（爬取失败了）

最新推荐文章于 2024-04-30 13:30:30 发布

绿头龙

最新推荐文章于 2024-04-30 13:30:30 发布

阅读量1.7k

点赞数

分类专栏：爬虫文章标签： python 爬虫

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_40927436/article/details/92699860

版权

爬虫专栏收录该内容

6 篇文章 0 订阅

订阅专栏

import urllib.request
import urllib.parse
#401 Unauthorized客户试图未经授权访问受密码保护的页面 所以爬取失败了
url = 'https://www.zhihu.com/api/v3/feed/topstory/recommend?session_token=5ad2f1226d859b5abf6d7d214140e78f&desktop=true&page_number=4&limit=6&action=down&after_id=17 '
page = int(input('请输入你要查找的页数：'))

# page  5    2
# page  11   3
# page  17   4
# page  23   5


form_data={
    'session_token': '5ad2f1226d859b5abf6d7d214140e78f',
    'desktop': 'true',
    'page_number': page+1,
    'limit': '6',
    'action': 'down',
    'after_id': (page*6)-1,
}
headers={
    'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/74.0.3729.131 Safari/537.36',

}

request = urllib.request.Request(url = url,headers = headers)
form_data = urllib.parse.urlencode(form_data).encode()
response = urllib.request.urlopen(request,data=form_data)
print(response.read().decode())
# with open('北京大学微博.html','wb')as fp:
#     fp.write(response.read())

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录

绿头龙 CSDN认证博客专家 CSDN认证企业博客

码龄7年

256: 原创

23万+: 周排名

150万+: 总排名

29万+: 访问

: 等级

4468: 积分

93: 粉丝

186: 获赞

54: 评论

691: 收藏

私信

关注

热门文章

分类专栏

Charles 2篇
SonarQube 2篇
Docker 3篇
杂谈 5篇
Maven 1篇
Java 40篇
SSH 1篇
Tomcat 2篇
Jenkins 1篇
Locust 1篇
JVM 16篇
小工具 1篇
设计模式 5篇
IDEA 6篇
计算机网络 3篇
SpringBoot 11篇
MySql 6篇
数据结构 2篇
Java多线程 21篇
Spring5 22篇
JavaSE 29篇
MyBatis 17篇
SpringMVC 4篇
Mybatis-plus 3篇
Redis 5篇
操作系统 2篇
爬虫 6篇
遇到的异常/错误 57篇
算法 14篇

最新评论

Ubuntu使用apt-get的时候提示Package jenkins is not available, but is referred to by another package.
蕞簡單dē漩嵂: 报错了 Ign:1 https://pkg.jenkins.io/debian-stable binary/ InRelease Hit:2 http://kali.download/kali kali-rolling InRelease Get:3 https://pkg.jenkins.io/debian-stable binary/ Release [2,044 B] Get:4 https://pkg.jenkins.io/debian-stable binary/ Release.gpg [833 B] Ign:4 https://pkg.jenkins.io/debian-stable binary/ Release.gpg Reading package lists... Done W: GPG error: https://pkg.jenkins.io/debian-stable binary/ Release: The following signatures couldn't be verified because the public key is not available: NO_PUBKEY 5BA31D57EF5975CA E: The repository 'http://pkg.jenkins.io/debian-stable binary/ Release' is not signed. N: Updating from such a repository can't be done securely, and is therefore disabled by default. N: See apt-secure(8) manpage for repository creation and user configuration details.
如何判断一个对象是否可以被回收？
2301_78415972: 引用计数器算法，引用失效计数器值-1吧
Springboot配置suffix指定mvc视图的后缀
征途黯然.: 感谢博主，你的文章让我得到一些收获！(￣ˇ￣)
.getServletContext()爆红
m0_63305279: 真的有用hhh好厉害
Exception in thread “main“ io.grpc.StatusRuntimeException: UNAVAILABLE: io exception
刘二火: public MyClient(String host, int port) { this(ManagedChannelBuilder.forAddress(host, port) .usePlaintext() .build()); }这里吧，不过我还是报错

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。