爬虫-python网络数据收集学习-第3章-心得随笔20180419

白夜繁华尽

于 2018-04-19 21:25:11 发布

阅读量488

点赞数

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_38264564/article/details/80011120

版权

按书上运行程序时出现问题：

from urllib.request import urlopen
from bs4 import BeautifulSoup

html = urlopen("https://www.baidu.com/")
bsObj = BeautifulSoup(html)
for link in bsObj.findAll("a"):
    if 'href' in link.atters:
        print(link.atters['href'])

报错为：

E:\practicework\scrapingtest\ll_env\lib\site-packages\bs4\__init__.py:181: UserWarning: No parser was explicitly specified, so I'm using the best available HTML parser for this system ("html.parser"). This usually isn't a problem, but if you run this code on another system, or in a different virtual environment, it may use a different parser and behave differently.

The code that caused this warning is on line 5 of the file test20170419.py. To get rid of this warning, change code that looks like this:

 BeautifulSoup(YOUR_MARKUP})

to this:

 BeautifulSoup(YOUR_MARKUP, "html.parser")

在bsObj行添加"html.parser"才能解决问题。

from urllib.request import urlopen
from bs4 import BeautifulSoup

html = urlopen("https://www.baidu.com/")
bsObj = BeautifulSoup(html,'html.parser')
for link in bsObj.findAll("a"):
    if 'href' in link.atters:
        print(link.atters['href'])

这是用来指定解析器。

遇到错误如：

TypeError: argument of type 'NoneType' is not iterable
'NoneType' object has no attribute 'findAll'

都有可能意味着，没有找到相应的目标而返回了None。

白夜繁华尽

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
爬虫-python网络数据收集学习-第3章-心得随笔20180419

按书上运行程序时出现问题：from urllib.request import urlopenfrom bs4 import BeautifulSouphtml = urlopen("https://www.baidu.com/")bsObj = BeautifulSoup(html)for link in bsObj.findAll("a"): if 'href' in lin...
复制链接

扫一扫

白夜繁华尽 CSDN认证博客专家 CSDN认证企业博客

码龄7年

21: 原创

52万+: 周排名

5万+: 总排名

4万+: 访问

: 等级

650: 积分

56: 粉丝

48: 获赞

61: 评论

156: 收藏

私信

关注

分类专栏

最新评论

Android Studio关于Error：moudle not specified
Zhave123: Sync Project with Gradle File同步怎么同步？
简单小爬虫爬取招标信息
m0_68248391: 为什么我这里显示是时间错误呢？我不是特别懂
Flutter Json解析工具
CSDN-Ada助手: 恭喜您撰写了第20篇博客！标题为"Flutter Json解析工具"的博客非常引人注目。您对于Flutter Json解析的工具应用进行了深入而清晰的探讨，真是令人赞叹。在接下来的创作中，我建议您可以考虑探讨如何优化Json解析过程中的性能问题，或者分享一些实用的技巧，帮助读者更好地理解和应用这些工具。当然，这只是一个建议，您的博客已经非常出色了！继续保持创作的热情和耐心，期待您的下一篇博客！
win10系统配置Mask DINO经验总结
CSDN-Ada助手: 恭喜您写了第19篇博客！标题中的"win10系统配置Mask DINO经验总结"听起来非常有趣和实用。您的持续创作非常值得赞赏，非常感谢您分享关于系统配置和Mask DINO的经验总结。在下一步的创作中，我建议您可以考虑深入探讨Mask DINO的更多细节，比如它与其他系统配置的比较、如何解决可能遇到的问题以及一些高级技巧。这样的创作将进一步丰富读者的知识，帮助他们更好地理解和使用Mask DINO。再次感谢您的分享，期待您未来更多的博客文章！
Flutter2+Dart爬坑之高德地图api导入和初级使用（更新：适配3.0.0插件使用）
elfaman01: 挺好用

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。