[PY4E] Scraping HTML Data with BeautifulSoup

最新推荐文章于 2024-09-09 23:28:21 发布

ChaChi0327

最新推荐文章于 2024-09-09 23:28:21 发布

阅读量292

点赞数 1

分类专栏： PY4E 文章标签： python

本文链接：https://blog.csdn.net/weixin_40299908/article/details/107781382

版权

PY4E 专栏收录该内容

3 篇文章 0 订阅

订阅专栏

from urllib.request import urlopen
from bs4 import BeautifulSoup
import ssl

# Ignore SSL certificate errors
ctx = ssl.create_default_context()
ctx.check_hostname = False
ctx.verify_mode = ssl.CERT_NONE

url = 'http://py4e-data.dr-chuck.net/comments_745662.html'
html = urlopen(url, context=ctx).read()
soup = BeautifulSoup(html, "html.parser")
tags = soup('span')

lst = list()
for tag in tags:
# Look at the parts of a tag
    num = int(tag.contents[0])
    lst.append(num)
    
print(sum(lst))

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

ChaChi0327

关注关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
[PY4E] Scraping HTML Data with BeautifulSoup

from urllib.request import urlopenfrom bs4 import BeautifulSoupimport ssl# Ignore SSL certificate errorsctx = ssl.create_default_context()ctx.check_hostname = Falsectx.verify_mode = ssl.CERT_NONEurl = 'http://py4e-data.dr-chuck.net/comments_745662
复制链接

扫一扫