爬虫
wilson_go
这个作者很懒,什么都没留下…
展开
-
QQ空间爬虫
参考原创 2018-10-06 22:41:02 · 1076 阅读 · 0 评论 -
python中爬虫通用方法
import os url = 'http://www.**.net/images/logo.gif'filename = os.path.basename(url)print(filename)python 从url中提取文件名原创 2019-03-21 10:31:32 · 294 阅读 · 0 评论 -
python中的ThreadPoolExecutor
#!/bin/env python3import requestsimport datetimeimport threadingimport csvimport jsonimport randomfrom concurrent.futures import ThreadPoolExecutorpool = ThreadPoolExecutor(16)threadLocal =...原创 2019-09-27 23:05:22 · 618 阅读 · 0 评论 -
python爬虫使用Cookie登录
将Cookie写在header头部# coding:utf-8import requestsfrom bs4 import BeautifulSoupcookie = '''cisession=19dfd70a27ec0eecf1fe3fc2e48b7f91c7c83c60;CNZZDATA1000201968=1815846425-1478580135-https%253A%252F%...原创 2019-03-28 17:36:38 · 1601 阅读 · 0 评论