python

超轻量级php框架startmvc

Selenium chrome配置代理Python版的方法

更新时间:2020-06-13 10:48:01 作者:startmvc
环境:windows7+Python3.5.2+Selenium3.4.2+ChromeDriver2.29+Chrome58.0.3029.110(64-bit)Selenium官方给的Firefox代理

环境: windows 7 + Python 3.5.2 + Selenium 3.4.2 + Chrome Driver 2.29 + Chrome 58.0.3029.110 (64-bit)

Selenium官方给的Firefox代理配置方式并不起效,也没看到合适的配置方式,对于Chrome Selenium官方没有告知如何配置,但以下两种方式是有效的:

1. 连接无用户名密码认证的代理


chromeOptions = webdriver.ChromeOptions()
chromeOptions.add_argument('--proxy-server=http://ip:port') 
driver = webdriver.Chrome(chrome_options=chromeOptions)

2. 有用户名和密码的连接


from selenium import webdriverdef create_proxyauth_extension(proxy_host, proxy_port,
 proxy_username, proxy_password,
 scheme='http', plugin_path=None):
 """Proxy Auth Extension

 args:
 proxy_host (str): domain or ip address, ie proxy.domain.com
 proxy_port (int): port
 proxy_username (str): auth username
 proxy_password (str): auth password
 kwargs:
 scheme (str): proxy scheme, default http
 plugin_path (str): absolute path of the extension 

 return str -> plugin_path
 """
 import string
 import zipfile

 if plugin_path is None:
 plugin_path = 'd:/webdriver/vimm_chrome_proxyauth_plugin.zip'

 manifest_json = """
 {
 "version": "1.0.0",
 "manifest_version": 2,
 "name": "Chrome Proxy",
 "permissions": [
 "proxy",
 "tabs",
 "unlimitedStorage",
 "storage",
 "<all_urls>",
 "webRequest",
 "webRequestBlocking"
 ],
 "background": {
 "scripts": ["background.js"]
 },
 "minimum_chrome_version":"22.0.0"
 }
 """

 background_js = string.Template(
 """
 var config = {
 mode: "fixed_servers",
 rules: {
 singleProxy: {
 scheme: "${scheme}",
 host: "${host}",
 port: parseInt(${port})
 },
 bypassList: ["foobar.com"]
 }
 };

 chrome.proxy.settings.set({value: config, scope: "regular"}, function() {});

 function callbackFn(details) {
 return {
 authCredentials: {
 username: "${username}",
 password: "${password}"
 }
 };
 }

 chrome.webRequest.onAuthRequired.addListener(
 callbackFn,
 {urls: ["<all_urls>"]},
 ['blocking']
 );
 """
 ).substitute(
 host=proxy_host,
 port=proxy_port,
 username=proxy_username,
 password=proxy_password,
 scheme=scheme,
 )
 with zipfile.ZipFile(plugin_path, 'w') as zp:
 zp.writestr("manifest.json", manifest_json)
 zp.writestr("background.js", background_js)

 return plugin_path

proxyauth_plugin_path = create_proxyauth_extension(
 proxy_host="proxy.crawlera.com",
 proxy_port=8010,
 proxy_username="fea687a8b2d448d5a5925ef1dca2ebe9",
 proxy_password=""
)


co = webdriver.ChromeOptions()
co.add_argument("--start-maximized")
co.add_extension(proxyauth_plugin_path)


driver = webdriver.Chrome(chrome_options=co)
driver.get(http://www.amazon.com/)

以上直接通过python代码生成chrome所需的zip插件文件,IP端口用户名密码写上自己的,原文出处:

https://vimmaniac.com/blog/bangal/selenium-chrome-driver-proxy-with-authentication/

插件源代码 https://github.com/RobinDev/Selenium-Chrome-HTTP-Private-Proxy

以上就是本文的全部内容,希望对大家的学习有所帮助,也希望大家多多支持脚本之家。

Selenium chrome配置代理 Selenium chrome 代理