![](https://img-blog.csdnimg.cn/20201014180756754.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
python爬虫
vanranger
千里之行始于足下
展开
-
requests爬取小猪租房--记录
#!/usr/bin/env python#-*- coding:utf-8 -*-import requests,timefrom bs4 import BeautifulSoupheaders = { "User-Agent":'Mozilla/5.0 (X11; U; Linux x86_64; zh-CN; rv:1.9.2.10) Gecko/20100922 Ubuntu/1原创 2018-01-02 12:38:37 · 395 阅读 · 0 评论 -
requests爬取图片保存--记录
#!/usr/bin/env python#-*- coding:utf-8 -*-import requests,time,osfrom bs4 import BeautifulSoupheaders = { "User-Agent":'Mozilla/5.0 (X11; U; Linux x86_64; zh-CN; rv:1.9.2.10) Gecko/20100922 Ubunt原创 2018-01-02 16:39:30 · 2713 阅读 · 0 评论 -
requests爬小猪租房存入Mongodb--记录
#!/usr/bin/env python#-*- coding:utf-8 -*-import requests,time,pymongofrom bs4 import BeautifulSoupheaders = { "User-Agent":'Mozilla/5.0 (X11; U; Linux x86_64; zh-CN; rv:1.9.2.10) Gecko/20100922原创 2018-01-03 14:38:58 · 337 阅读 · 0 评论 -
requests爬猫眼电影 -- 记录
#!/usr/bin/env python#-*- coding:utf-8 -*-import requestsimport refrom requests.exceptions import RequestExceptionfrom json import dumpsdef get_one_page(url): try: response = requests.转载 2018-01-13 02:13:48 · 294 阅读 · 0 评论 -
requests多进程爬取今日头条街拍--记录
spider.py#!/usr/bin/env python#-*- coding:utf-8 -*-import requestsimport reimport jsonimport pymongoimport osfrom requests.exceptions import RequestExceptionfrom urllib.parse import urlen原创 2018-01-13 07:03:36 · 1354 阅读 · 0 评论