1、获得页面的源码
driver.get('http://www.baidu.com')
source_code = driver.getPageSource()
如果想将获得的网页源码保存成文件的话:
f = open(r'c:\html_source_code.html','w')
f.write(source_code.encode('utf-8'))
f.close()
2、获得页面元素的某个属性(title,target ....)
element = driver.find_element_by_id('#id')
element.get_attribute('&&&')
3、判断某个元素的值是否为某种类型 (正则表达式根据具体的类型去写):
self.assertRegex(driver.find_element_by_class_name("className").text,r"^[\s\S]*[Tt]ext*$")
4、cookie处理
driver.get_cookies() 获取cookie信息
add_cookie(cookie_dict) 向cookie中添加会话信息
delete_cookie(name) 删除特定的cookie
delete_all_cookies() 删除所有cookie
driver.get("http://www.baidu.com")
driver.add_cookie({'name':'aaa', 'value':'bbb'})如果想把所有cookies 保存成txt文件以便以后使用,可以使用python pickle modulefor cookie in driver.get_cookies(): print "%s -> %s" % (cookie['name'], cookie['value'])
import pickle import selenium.webdriver driver = selenium.webdriver.Firefox() driver.get("http://www.google.com") pickle.dump( driver.get_cookies() , open("cookies.pkl","wb"))
#and later to add them back: import pickle import selenium.webdriver driver = selenium.webdriver.Firefox() driver.get("http://www.google.com") cookies = pickle.load(open("cookies.pkl", "rb")) for cookie in cookies: driver.add_cookie(cookie)