爬虫
Gonietz123
这个作者很懒,什么都没留下…
展开
-
XPath 爬取糗事百科
import urllib.request import urllib.parse import http.cookiejar from lxml import etree def create_request(page): url = 'https://www.qiushibaike.com/text/page/' + str(page) + '/' headers = { ...原创 2019-02-15 17:20:28 · 283 阅读 · 0 评论 -
python爬虫bs4抓取股票信息
import urllib.request import urllib.parse from bs4 import BeautifulSoup from day04.Item import Stock url = 'http://quote.stockstar.com/' headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; ...原创 2019-02-15 17:25:32 · 422 阅读 · 0 评论 -
python爬虫使用json handler opener
import urllib.request import urllib.parse import http.cookiejar import jsonpath from lxml import etree import json class Job: def __init__(self,name,jobname,salary): self.name = name ...原创 2019-02-15 17:30:21 · 253 阅读 · 0 评论