python获取html中js,如何通过Selenium / Python获取JavaScript编写的html内容[复制]

Tatasisy

于 2021-06-09 14:33:30 发布

阅读量427

点赞数

文章标签： python获取html中js

参见英文答案 >

Get HTML Source of WebElement in Selenium WebDriver using Python 13个

我正在使用Selenium进行网络爬行,我希望在Selenium模拟点击虚假链接后获得由JavaScript编写的元素(例如链接).

我尝试了get_html_source(),但它不包含JavaScript编写的内容.

我编写的代码：

def test_comment_url_fetch(self):

sel = self.selenium

sel.open("/rmrb")

url = sel.get_location()

#print url

if url.startswith('http://login'):

sel.open("/rmrb")

i = 1

while True:

try:

if i == 1:

sel.click("//div[@class='WB_feed_type SW_fun S_line2']/div/div/div[3]/div/a[4]")

print "click"

else:

XPath = "//div[@class='WB_feed_type SW_fun S_line2'][%d]/div/div/div[3]/div/a[4]"%i

sel.click(XPath)

print "click"

except Exception, e:

print e

break

i += 1

html = sel.get_html_source()

html_file = open("tmp\\foo.html", 'w')

html_file.write(html.encode('utf-8'))

html_file.close()

我使用while循环来点击一系列虚假链接,触发js-actions来显示额外的内容,而这些内容就是我想要的.但是sel.get_html_source()没有给出我想要的东西.

有人可以帮忙吗？非常感谢.

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python获取html中js,如何通过Selenium / Python获取JavaScript编写的html内容[复制]

参见英文答案 >Get HTML Source of WebElement in Selenium WebDriver using Python13个我正在使用Selenium进行网络爬行,我希望在Selenium模拟点击虚假链接后获得由JavaScript编写的元素(例如链接).我尝试了get_html_source(...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。