Scrapy 是一种用于抓取网站和提取结构化数据的应用程序框架,可用于广泛的有用应用程序,如数据挖掘、信息处理或历史存档等。
安装 Scrapy
从 PyPI 安装:
pip install Scrapy
使用 Anaconda 或 Miniconda 安装:
conda install -c conda-forge scrapy
安装后可在命令行查看是否成功:
> scrapy
Scrapy 1.6.0 - no active project # 因为尚未新建 scrapy 项目
Usage:
scrapy <command> [options] [args]
Available commands:
bench Run quick benchmark test
fetch Fetch a URL using the Scrapy downloader
genspider Generate new spider using pre-defined templates
runspider Run a self-contained spider (without creating a project)
settings Get settings values
shell Interactive scraping console
startproject Create new project
version Print Scrapy version
view Open URL in browser, as seen by Scr