![](https://img-blog.csdnimg.cn/20201014180756928.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
爬虫
我也是在别的地方学的
不同林
互相交流 可以加QQ:421065484
展开
-
2021-03-24
爬取斗鱼并制图 from selenium import webdriver import time import matplotlib.pyplot as plt import jieba from wordcloud import WordCloud from pyecharts.charts import Bar from pyecharts import options as opts class DouYuSpider(): def __init__(self): s原创 2021-03-24 22:47:46 · 173 阅读 · 0 评论 -
2021-03-24
爬取三国演义 import requests from bs4 import BeautifulSoup import urllib.parse import os class SanGuoSpider(): def __init__(self): self.headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like原创 2021-03-24 22:46:13 · 167 阅读 · 0 评论 -
2021-03-24
爬取香哈菜谱—炒菜做饭无忧虑 import requests from lxml import etree import re import openpyxl import os class XiangHaSpider(): def __init__(self): self.url = 'https://www.xiangha.com/caipu/c-jiachang/hot-{}/' self.headers = { 'User-Age原创 2021-03-24 22:45:14 · 124 阅读 · 0 评论 -
2021-03-24
爬取豆瓣 from selenium import webdriver import time from pyecharts.charts import Bar from pyecharts import options as opts class DouBanSpider(): def __init__(self): self.driver = webdriver.Chrome() self.url = 'https://movie.douban.com/'原创 2021-03-24 22:42:08 · 82 阅读 · 0 评论 -
2021-03-24
爬取小说—获得txt文本文件 看书不求人 需要安装的文件: pip3 install beautifulsoup4 -i https://pypi.douban.com/simple pip3 install requests -i https://pypi.douban.com/simple 代码 import requests from bs4 import BeautifulSoup import os import time # BeautifulSoup需要安装html5lib解析 # 使用转载 2021-03-24 22:10:41 · 128 阅读 · 0 评论