python爬取动态网页selenium

最新推荐文章于 2024-06-18 18:14:10 发布

秦艽

最新推荐文章于 2024-06-18 18:14:10 发布

阅读量230

点赞数

分类专栏： install python 文章标签： python selenium html css edge

本文链接：https://blog.csdn.net/qq_40279151/article/details/106319024

版权

install 同时被 2 个专栏收录

10 篇文章 0 订阅

订阅专栏

python

5 篇文章 1 订阅

订阅专栏

安装selenium

安装浏览器驱动

https://www.cnblogs.com/wenchaoz/p/7875365.html

代码

比如爬取PTA网页题目

写上自己浏览器驱动的位置
找到的是WebElement对象，并不是html

import time
from selenium import webdriver

url = "https://pintia.cn/problem-sets/994805260223102976/problems/type/7"

# init browser
driver = webdriver.Edge(r'E:\VirtualDesktop\code\pyCode\msedgedriver.exe')
driver.get(url)
time.sleep(3)

# get data
html = driver.find_element_by_css_selector("div.DataTableContainer_3cQiI > table > tbody > tr:nth-child(1) > td:nth-child(3)")
print(type(html)) 
print(html.text)

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

秦艽

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python爬取动态网页selenium

安装selenium安装浏览器驱动https://www.cnblogs.com/wenchaoz/p/7875365.html代码比如爬取pat网页题目写上自己浏览器驱动的位置找到的是WebElement对象，并不是htmlimport timefrom selenium import webdriverurl = "https://pintia.cn/problem-sets/994805260223102976/problems/type/7"# init browser
复制链接

扫一扫