docker 构建scrapyd

8 篇文章 0 订阅
4 篇文章 0 订阅

docker 构建scrapyd

Dockerfile

`

FROM selenium/standalone-chrome:85.0-chromedriver-85.0-20200907
USER root

RUN apt-get update && \
  apt-get install -y xvfb && \
  apt-get install -y python3-distutils && \
  curl https://bootstrap.pypa.io/get-pip.py -o get-pip.py && \
  python3 get-pip.py
ENV TimeZone=Asia/Shanghai   
RUN ln -snf /usr/share/zoneinfo/$TimeZone /etc/localtime && echo $TimeZone > /etc/timezone
WORKDIR /app
COPY requirements.txt .
COPY ./scrapyd.conf /etc/scrapyd/
EXPOSE 6800
RUN python3 -m pip install -r requirements.txt -i https://pypi.tuna.tsinghua.edu.cn/simple/
COPY . .
CMD scrapyd

`

requirements.txt

`

attrs
Automat
backports.zoneinfo
certifi
cffi
charset-normalizer
colorama
constantly
cryptography
cssselect
dateparser
Deprecated
Distance
environs
Faker
filelock
gerapy-auto-extractor
gne
hyperlink
idna
incremental
itemadapter
itemloaders
jmespath
joblib
loguru
lxml
marshmallow
numpy
packaging
parsel
Protego
pyasn1
pyasn1-modules
pycparser
PyDispatcher
pymongo
pyOpenSSL
pyparsing
python-dateutil
python-dotenv
pytz
pytz-deprecation-shim
PyYAML
queuelib
redis
regex
requests
requests-file
scikit-learn
scipy
Scrapy
scrapy-redis
service-identity
six
threadpoolctl
tldextract
Twisted
typing_extensions
tzdata
tzlocal
urllib3
w3lib
win32-setctime
wrapt
zope.interface
requests
selenium
aiohttp
beautifulsoup4
pyquery
pymysql
redis
pymongo
flask
django
scrapy
scrapyd
scrapyd-client
scrapy-redis
scrapy-splash

`

scrapyd.conf

`

[scrapyd]
eggs_dir    = eggs
logs_dir    = logs
items_dir   =
jobs_to_keep = 5
dbs_dir     = dbs
max_proc    = 0
max_proc_per_cpu = 10
finished_to_keep = 100
poll_interval = 5.0
bind_address = 0.0.0.0
http_port   = 6800
debug       = off
runner      = scrapyd.runner
application = scrapyd.app.application
launcher    = scrapyd.launcher.Launcher
webroot     = scrapyd.website.Root

[services]
schedule.json     = scrapyd.webservice.Schedule
cancel.json       = scrapyd.webservice.Cancel
addversion.json   = scrapyd.webservice.AddVersion
listprojects.json = scrapyd.webservice.ListProjects
listversions.json = scrapyd.webservice.ListVersions
listspiders.json  = scrapyd.webservice.ListSpiders
delproject.json   = scrapyd.webservice.DeleteProject
delversion.json   = scrapyd.webservice.DeleteVersion
listjobs.json     = scrapyd.webservice.ListJobs
daemonstatus.json = scrapyd.webservice.DaemonStatus

`

创建scrapyd文件夹,将三个文件放进去,然后编译就可以了

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值