html文字段落怎么提刑,请问P段落怎么提取不出文字？

扬云飞

于 2021-05-31 22:51:46 发布

阅读量65

点赞数

文章标签： html文字段落怎么提刑

# coding:utf8

from bs4 import BeautifulSoup

import re

html_doc = """

The Dormouse's story

The Dormouse's story

Once upon a time there were three little sisters; and their names were

Elsie,

Lacie and

Tillie;

and they lived at the bottom of a well.

...

"""

soup = BeautifulSoup(html_doc,'html.parser',from_encoding='utf-8')

print'获取所有连接'

links = soup.find_all('a')

for link in links:

print link.name,link['href'],link.get_text()

print'获取lacie的连接'

link_node=soup.find('a',href='http://example.com/lacie')

print link_node.name,link_node['href'],link_node.get_text()

print'正则匹配'

link_node=soup.find('a',href=re.compile(r"ill"))

print link_node.name,link_node['href'],link_node.get_text()

print'p段落文字'

p_node=soup.find('a',class_="title")

print p_node.name, p_node.get_text()

会报错如下

获取所有连接

a http://example.com/elsie Elsie

a http://example.com/lacie Lacie

a http://example.com/tillie Tillie

获取lacie的连接

a http://example.com/lacie Lacie

正则匹配

a http://example.com/tillie Tillie

p段落文字

Traceback (most recent call last):

File "C:\Users\Administrator\workspace\2.7\66\test_bs4.py", line 34, in

print p_node.name, p_node.get_text()

AttributeError: 'NoneType' object has no attribute 'name'

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

扬云飞

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
html文字段落怎么提刑,请问P段落怎么提取不出文字？

# coding:utf8from bs4 import BeautifulSoupimport rehtml_doc = """The Dormouse's storyThe Dormouse's storyOnce upon a time there were three little sisters; and their names wereElsie,Lacie andTillie;and...
复制链接

扫一扫