python is beautiful_Python 3 - 从beautifulSoup中的标签中获取文本

最新推荐文章于 2021-03-25 22:36:49 发布

weixin_39617502

最新推荐文章于 2021-03-25 22:36:49 发布

阅读量55

点赞数

文章标签： python is beautiful

I am using beautifulSoup to extract data from website. Text from that website changes everytime you reload your page so basically I wish to be able to set a focus on the class name as a Static variable since the text is Dynamic.

import requests

from bs4 import BeautifulSoup

url = 'xxxxxxxxxxx'

r = requests.get(url)

soup = BeautifulSoup(r.content, 'html.parser')

class2 = soup.find_all(True, class_="template_title")

print (class2)

which prints out

When the page reloads, I will still have the focus on the area but I do not know how to print only the text (which in this case is : 4)

Once this is figured out, I have another question: If the class contains multiple tags, is there a way to get more static data to be sure it only prints the text I was searching for and not more? ( I have class, but could I use height="50" valign="bottom" width="535" as well?)

解决方案You can use text or string attribute of the element.

elems = soup.find_all(True, class_='template_title')

print([elem.string for elem in elems])

# prints `['4']` for the given html snippet

Specify more attributes as you want:

elems = soup.find_all(True, class_='template_title',

height='50', valign='bottom', width='535')

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

weixin_39617502

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python is beautiful_Python 3 - 从beautifulSoup中的标签中获取文本

I am using beautifulSoup to extract data from website. Text from that website changes everytime you reload your page so basically I wish to be able to set a focus on the class name as a Static variabl...
复制链接

扫一扫