python requests 动态加载_等待页面加载，然后在python 3中获取带有requests.get的数据...

最新推荐文章于 2023-10-16 16:16:34 发布

weixin_39654352

最新推荐文章于 2023-10-16 16:16:34 发布

阅读量2.7k

点赞数

文章标签： python requests 动态加载

I have a page that i need to get the source to use with BS4, but the middle of the page takes 1 second(maybe less) to load the content, and requests.get catches the source of the page before the section loads, how can I wait a second before getting the data?

r = requests.get(URL + self.search, headers=USER_AGENT, timeout=5 )

soup = BeautifulSoup(r.content, 'html.parser')

a = soup.find_all('section', 'wrapper')

解决方案

It doesn't look like a problem of waiting, it looks like the element is being created by JavaScript, requests can't handle dynamically generated elements by JavaScript. A suggestion is to use

from bs4 import BeautifulSoup

from selenium import webdriver

url = "http://legendas.tv/busca/walking%20dead%20s03e02"

browser = webdriver.PhantomJS()

browser.get(url)

html = browser.page_source

soup = BeautifulSoup(html, 'lxml')

a = soup.find('section', 'wrapper')

Also, there's no need to use .findAll if you are only looking for one element only.

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

weixin_39654352

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
python requests 动态加载_等待页面加载，然后在python 3中获取带有requests.get的数据...

I have a page that i need to get the source to use with BS4, but the middle of the page takes 1 second(maybe less) to load the content, and requests.get catches the source of the page before the secti...
复制链接

扫一扫