python爬取网页实例(二)－selenium设置Firefox代理

三爷麋了鹿

于 2019-01-14 18:17:21 发布

阅读量534

点赞数

分类专栏： Python

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/u800820/article/details/86481499

版权

Python 专栏收录该内容

10 篇文章 0 订阅

订阅专栏

依旧先上代码。

# -*- coding:utf-8 -*-
from lxml import etree
from fake_useragent import UserAgent
from selenium import webdriver

ua = UserAgent()
ua_header = {
    'User-Agent': ua.random,
    'Cookie': ''
}


def conn_weibo():
    index_url = "https://www.weibo.com/"
    proxy = {
        'host': '172.17.18.80',
        'port': 8080
    }
    profile = webdriver.FirefoxProfile()
    profile.set_preference('network.proxy.type', 1)
    profile.set_preference('network.proxy.http', proxy['host'])
    profile.set_preference('network.proxy.http_port', proxy['port'])
    profile.set_preference('network.proxy.ssl', proxy['host'])
    profile.set_preference('network.proxy.ssl_port', proxy['port'])
    profile.update_preferences()

    driver = webdriver.Firefox(profile)
    driver.get(index_url)


if __name__ == '__main__':
    conn_weibo()

这里看上去就只有几句有用的代码，但是实际运用的时候对于初学者埋了不少坑，我把遇到的问题和解决方式记录下。

安装geckodriver

selenium.common.exceptions.WebDriverException: Message: 'geckodriver' executable needs to be in PATH

ubuntu16.04环境下解决方法：

* 下载 geckodriver，地址： https://github.com/mozilla/geckodriver/releases
* 解压后将geckodriver 存放至 /usr/local/bin/ 路径下即可

2. Windows环境下:

* 下载 geckodriver，地址： https://github.com/mozilla/geckodriver/releases

　　* 将geckodriver.exe放到Firefox的安装目录下(如D:\Program Files\Mozilla Firefox）

　　* 将火狐安装目录（如D:\Program Files\Mozilla Firefox）添加到环境变量Path中

* 重启IDE

selenium配置Firefox代理

注意端口号是整数；
ssl和ssl_port是针对https请求设置的，但是这里不用判断请求方式，以防后面的请求变成http后无法使用代理访问。

三爷麋了鹿

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python爬取网页实例(二)－selenium设置Firefox代理

依旧先上代码。# -*- coding:utf-8 -*-from lxml import etreefrom fake_useragent import UserAgentfrom selenium import webdriverua = UserAgent()ua_header = { 'User-Agent': ua.random, 'Cookie': '...
复制链接

扫一扫

专栏目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。