- 博客(3)
- 资源 (68)
- 问答 (1)
- 收藏
- 关注
原创 requests+selenium 爬虫项目和 scrapy 爬虫项目的区别
爬虫项目request+selenium爬虫项目周期项目介绍爬了XXXXX,XXX,XXX,等网站,获取网站上的XXX,XXX,XXX,数据,每个月定时抓取XXX数据,使用该数据实现了XXX,XXX,XX,开发环境linux+pycharm+requests+mongodb+redis+crontab+scrapy_redis+ scarpy + mysql+gevent+celery+threading使用技术使用requests…把数据存储在mongodb中使用cron
2020-07-23 23:55:37 1518 1
原创 通用爬虫思路总结
通用爬虫思路1. 通用爬虫思路1. 准备URL准备start_urlurl地址规律不明显,总数不确定通过代码查找下一页urlxpath定位不明显,寻找url地址,部分参数可能放在当前的响应中(比如当前页码数和总页码数会在当前响应中)准备url_list页码总数明确url地址规律明显2. 发送请求,获取响应添加随机的User-Agent,反反爬虫添加随机代理的IP,建立ip代理池,反反爬虫在对方判断我们是爬虫后,应该添加更多的headers字段,包括cook
2020-07-23 23:51:23 376
原创 原生 JavaScript 中 window.onload 全局加载模块中定义的函数 不能执行的一些问题
一个关于在window.onload里面定义函数,然后在html里面调用函数时出现错误。具体见下面<!DOCTYPE html><html lang="en"><head> <meta charset="UTF-8"> <meta name="viewport" content="width=device-width, initial-scale=1.0"> <meta http-equiv="X-UA-Com
2020-07-23 23:50:16 1337 1
virtualenv-20.0.5-py2.py3-none-any.whl
2020-02-23
virtualenv-20.0.4-py2.py3-none-any.whl
2020-02-23
virtualenv-20.0.3-py2.py3-none-any.whl
2020-02-23
selenium-3.14.1-py2.py3-none-any.whl
2020-02-23
Scrapy-1.8.0-py2.py3-none-any.whl
2020-02-23
Scrapy-1.7.4-py2.py3-none-any.whl
2020-02-23
Scrapy-1.7.3-py2.py3-none-any.whl
2020-02-23
Scrapy-1.7.2-py2.py3-none-any.whl
2020-02-23
requests-2.23.0-py2.py3-none-any.whl
2020-02-23
requests-2.21.0-py2.py3-none-any.whl
2020-02-23
pytz-2019.2-py2.py3-none-any.whl
2020-02-23
pytz-2019.1-py2.py3-none-any.whl
2020-02-23
pyproj-2.5.0-cp38-cp38-win32.whl
2020-02-23
pyproj-2.5.0-cp38-cp38-win_amd64.whl
2020-02-23
pyproj-2.5.0-cp38-cp38-macosx_10_9_x86_64.whl
2020-02-23
pyproj-2.5.0-cp37-cp37m-win32.whl
2020-02-23
pyproj-2.5.0-cp37-cp37m-win_amd64.whl
2020-02-23
pyproj-2.5.0-cp36-cp36m-win32.whl
2020-02-23
pyproj-2.5.0-cp36-cp36m-win_amd64.whl
2020-02-23
pygame-1.9.6-cp37-cp37m-win32.whl
2020-02-23
pygal-2.3.0-py2.py3-none-any.whl
2020-02-23
plotly-4.5.1-py2.py3-none-any.whl
2020-02-23
plotly-4.5.0rc1-py2.py3-none-any.whl
2020-02-23
pandas-1.0.1-cp38-cp38-win32.whl
2020-02-23
pandas-1.0.1-cp38-cp38-macosx_10_9_x86_64.whl
2020-02-23
pandas-1.0.1-cp37-cp37m-win_amd64.whl
2020-02-23
numpy-1.18.1-cp38-cp38-win32.whl
2020-02-23
numpy-1.18.1-cp38-cp38-macosx_10_9_x86_64.whl
2020-02-23
matplotlib-3.1.3-cp36-cp36m-win_amd64.whl
2020-02-23
matplotlib-3.1.3-cp36-cp36m-win32.whl
2020-02-23
matplotlib-3.1.3-cp37-cp37m-win32.whl
2020-02-23
matplotlib-3.1.3-cp38-cp38-macosx_10_9_x86_64.whl
2020-02-23
matplotlib-3.1.3-cp38-cp38-win32.whl
2020-02-23
greenlet-0.4.15-cp37-cp37m-win_amd64.whl
2020-02-23
greenlet-0.4.15-cp37-cp37m-win32.whl
2020-02-23
greenlet-0.4.15-cp38-cp38-win_amd64.whl
2020-02-23
greenlet-0.4.15-cp38-cp38-win32.whl
2020-02-23
gevent-1.5a3-cp38-cp38-win_amd64.whl
2020-02-23
gevent-1.5a3-cp37-cp37m-win_amd64.whl
2020-02-23
gevent-1.4.0-cp37-cp37m-win_amd64.whl
2020-02-23
TA创建的收藏夹 TA关注的收藏夹
TA关注的人