爬虫
tianyuan233
这个作者很懒,什么都没留下…
展开
-
UserWarning: Selenium support for PhantomJS has been deprecated, please use headless versions of Chr
In [1]: from selenium import webdriver In [2]: driver = webdriver.PhantomJS() G:\Anaconda3\lib\sitepackages\selenium\webdriver\phantomjs\webdriver.py:49: UserWarning: Selenium support for PhantomJS h...原创 2018-03-07 22:38:34 · 4225 阅读 · 0 评论 -
正则表达式re库学习笔记
import re content = 'Hello 123 4567 World_This is a Demo'泛匹配# result = re.match('^Hello\s\d',content) # print(result) # print(result.group()) # # result1 = re.match('^Hello(.*)mo$',content) # print(res原创 2018-03-14 22:08:29 · 375 阅读 · 1 评论 -
BeautifulSoup库学习笔记
import requests from bs4 import BeautifulSoup import lxml # data = requests.get('https://book.douban.com/').textdata = ''' <ul> <li class=""><a data-moreurl-dict='{"from":"top-nav-click-main","uid":"0"原创 2018-03-14 22:12:19 · 232 阅读 · 0 评论 -
pyquery学习笔记
from pyquery import PyQuery as pqdata = ''' <ul class="qqq"> <li class="1"><a data-moreurl-dict='{"from":"top-nav-click-main","uid":"0"}' href="https://www.douban.com" target="_blank">豆瓣</a></li> <li c原创 2018-03-15 22:38:27 · 347 阅读 · 0 评论 -
selenlenium基本用法学习笔记
from selenium import webdriver from selenium.webdriver.common.by import By from selenium.webdriver.common.keys import Keys from selenium.webdriver.support import expected_conditions as EC from sele原创 2018-03-18 16:59:19 · 270 阅读 · 0 评论