![](https://img-blog.csdnimg.cn/20201014180756724.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
python
犹大的誓言
功不唐捐,玉汝于成
展开
-
python课程设计----简单爬虫
import GetHtml as g,SaveData as sif __name__ == '__main__': # 入口地址 address = ['http://news.zzu.edu.cn/mtzd.htm'] # 用来存储已经爬取过的地址,防止重复爬取 bin = [] # 队列 存放的是爬取过的url地址 while len(address) != 0: get = g.GetHtml() htmls = []原创 2022-03-16 19:53:26 · 1159 阅读 · 0 评论 -
python实现博客爬虫
python实现博客爬虫有序的存到word中# -*- coding:utf-8 -*-from bs4 import BeautifulSoupimport urllib.request, urllib.response, urllib.error, urllib.parsefrom docx import Documentfrom docx.shared import Inchesimport re# 爬取网页函数def request(url): html = ""原创 2021-06-23 20:38:03 · 436 阅读 · 0 评论