python xpath text与attrib

最新推荐文章于 2023-09-22 11:49:30 发布

白叔King

最新推荐文章于 2023-09-22 11:49:30 发布

阅读量957

点赞数

分类专栏： python 文章标签： python 开发语言后端

本文链接：https://blog.csdn.net/weixin_37254196/article/details/121763255

版权

python 专栏收录该内容

85 篇文章 6 订阅

订阅专栏

废话不多说，直接开干！

说明

0.text获取标签包裹数据
或者解释用于html元素文本内容的存取
eg:element.text
1.attrib获取标签内的元素
eg:element.attrib['title'],element.attrib['href']

直接看代码

import asyncio
from pyppeteer import launch
from lxml import etree


async def main():
    browser = await launch()
    page = await browser.newPage()
    await page.goto('https://movie.douban.com/chart')
    await page.waitForXPath('//table//a[@title]')
    doc = etree.HTML(await page.content())
    # t = etree.tostring(doc, encoding="utf-8", pretty_print=True)
    # print(t.decode("utf-8"))
    for element in doc.xpath('//table//p[@class]'):
        print(element.attrib['text'])

    names = [element.attrib['title'] for element in doc.xpath('//table//a[@title]')]

    print('Names: ', names)
    await browser.close()


asyncio.get_event_loop().run_until_complete(main())

在这里插入图片描述

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

白叔King

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
python xpath text与attrib

废话不多说，直接开干！说明0.text获取标签包裹数据或者解释用于html元素文本内容的存取eg:element.text1.attrib获取标签内的元素eg:element.attrib['title'],element.attrib['href']直接看代码import asynciofrom pyppeteer import launchfrom lxml import etreeasync def main(): browser = await launch()
复制链接

扫一扫