爬虫
winnertakeall
这个作者很懒,什么都没留下…
展开
-
豆瓣爬虫
import requestsfrom lxml import etreeheaders = { 'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/72.0.3626.96 Safari/537.36', 'Host'...原创 2019-02-17 23:51:11 · 244 阅读 · 0 评论 -
爬虫之中国天气网
import requestsfrom bs4 import BeautifulSoupfrom pyecharts import BarALL_DATA = []headers = { "User-Agent": "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko...原创 2019-02-24 23:23:47 · 855 阅读 · 0 评论 -
电影天堂爬虫
from lxml import etreeimport requestsBASE_DOMIN = "http://dytt8.net"url = "http://dytt8.net/html/gndy/dyzz/list_23_1.html"headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) ...原创 2019-02-19 23:39:36 · 4229 阅读 · 0 评论 -
多线程
单线程的方式import timedef coding(): for x in range(3): print("正在写代码%s"%x) time.sleep(1) def drawing(): for x in range(3): print("正在画图%s"%x) time.sleep(1...原创 2019-02-26 20:54:40 · 126 阅读 · 0 评论 -
爬虫之静态网页
import requestsfrom lxml import etreefrom urllib import requestimport collectionsimport timeimport osimport randomimport datetimeimport pandas as pddef getUA(): user_agent_list = [ \ ...原创 2019-04-07 20:30:24 · 949 阅读 · 0 评论