爬虫
Gonietz123
这个作者很懒,什么都没留下…
展开
-
XPath 爬取糗事百科
import urllib.requestimport urllib.parseimport http.cookiejarfrom lxml import etreedef create_request(page): url = 'https://www.qiushibaike.com/text/page/' + str(page) + '/' headers = {...原创 2019-02-15 17:20:28 · 269 阅读 · 0 评论 -
python爬虫bs4抓取股票信息
import urllib.requestimport urllib.parsefrom bs4 import BeautifulSoupfrom day04.Item import Stockurl = 'http://quote.stockstar.com/'headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; ...原创 2019-02-15 17:25:32 · 406 阅读 · 0 评论 -
python爬虫使用json handler opener
import urllib.requestimport urllib.parseimport http.cookiejarimport jsonpathfrom lxml import etreeimport jsonclass Job: def __init__(self,name,jobname,salary): self.name = name ...原创 2019-02-15 17:30:21 · 242 阅读 · 0 评论