scrapy批量插入mysql

最新推荐文章于 2024-03-17 10:08:37 发布

蛊i

最新推荐文章于 2024-03-17 10:08:37 发布

阅读量1.4k

点赞数

分类专栏： scrapy框架

本文链接：https://blog.csdn.net/qq_36197940/article/details/84790143

版权

本文介绍了如何配置和使用Scrapy爬虫框架，配合items.py定义数据模型，settings.py配置数据库连接，并通过pipeline.py处理数据，实现从网页抓取信息并批量插入到MySQL数据库的过程。

摘要由CSDN通过智能技术生成

spider.py

sql='insert into tiebadata(publish_name,publish_time,publish_url,publish_content,comment_content,comment_time,comment_name,keyword,app_name,run_time)VALUES (%s,%s,%s,%s,%s,%s,%s,%s,%s,%s)'
data=(item['publish_name'], item['publish_time'], item['publish_url'], item['publish_content'], item['comment_content'], item['comment_time'], item['comment_name'], item['keyword'], item['app_name'], item['run_time'])
item['zong']=[sql,data]

items.py

storage_type = scrapy.Field()  # 存储类型
analysis_type = scrapy.Field()  # 解析网站
zong = scrapy.Field()#数据汇总

setting.py

MYSQL_DB_NAME='**'
MYSQL_HOST='**'
MYSQL_USER='**'
MYSQL_PASSW

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

蛊i

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
scrapy批量插入mysql

spider.pysql='insert into tiebadata(publish_name,publish_time,publish_url,publish_content,comment_content,comment_time,comment_name,keyword,app_name,run_time)VALUES (%s,%s,%s,%s,%s,%s,%s,%s,%s,%s)'...
复制链接

扫一扫