爬虫
SOPHIA16527
这个作者很懒,什么都没留下…
展开
-
selenium获取动态网页
安装selenium参考:https://blog.csdn.net/SOPHIA16527/article/details/118446491?spm=1001.2014.3001.55011、安装selenium:pip install selenium2、查看chrom版本,浏览器输入:chrome://version/3、下载驱动:地址:http://npm.taobao.org/mirrors/chromedriver/下载与浏览器对应的版本,例如:chromedriver_win32原创 2021-07-04 00:18:23 · 296 阅读 · 0 评论 -
selenium.common.exceptions.WebDriverException: Message: ‘chromedriver‘ executable needs to be in PAT
问题安装selenium:pip install selenium生成Chrome对象,运行后报错from selenium.webdriver import Chromebrower = Chrome()报错如下:原因selenium 未安装驱动解决办法1、查看chrome版本:chrome://version/2、下载对应驱动下载地址:http://npm.taobao.org/mirrors/chromedriver/这里下载的是 chromedriver_win32原创 2021-07-03 23:22:14 · 253 阅读 · 0 评论 -
python3 requests post 请求400错误
post请求参数赋给data变量时,返回400参数赋给json变量后,正常200# coding:utf-8import requestsurl = r'http://**/**'data = { 'fq': 'false', 'limit': 10, 'page': 1}headers = { 'User-Agent': 'Mozilla/5.0 (Linux; Android 4.0.4; Galaxy Nexus Build/IMM76B) Apple原创 2020-12-22 16:26:33 · 7982 阅读 · 9 评论 -
requets urllib3.exceptions.ReadTimeoutError: HTTPConnectionPool(host=‘*‘, port=80): Read timed out
问题python requests 爬虫时报超时错误,具体如下:Traceback (most recent call last): File "/usr/lib/python3/dist-packages/urllib3/connectionpool.py", line 421, in _make_request six.raise_from(e, None) File "<string>", line 3, in raise_from File "/usr/lib/p原创 2020-09-18 09:38:08 · 3554 阅读 · 2 评论 -
requests.exceptions.ConnectionError: HTTPConnectionPool(host=‘****, port=80): Max retries exceeded w
python requests.get爬虫时,跑几个数据后,报错:requests.exceptions.ConnectionError: HTTPConnectionPool(host='****, port=80): Max retries exceeded with url: /beijing/file/4023E7D190674D26934AED5F4306DBC0/B76E1E3D9DF842D6ACBC63978C3A89FE/977FE5585248441E86067EFBF097E587/z原创 2020-09-17 17:35:46 · 22889 阅读 · 2 评论