python beautifulsoup4示例_Python爬虫实现使用beautifulSoup4爬取名言网功能案例

最新推荐文章于 2022-03-31 14:06:23 发布

贺清春

最新推荐文章于 2022-03-31 14:06:23 发布

阅读量182

点赞数

文章标签： python beautifulsoup4示例

本文链接：https://blog.csdn.net/weixin_29613721/article/details/111967518

版权

本文实例讲述了Python爬虫实现使用beautifulSoup4爬取名言网功能。分享给大家供大家参考，具体如下：

爬取名言网top10标签对应的名言，并存储到mysql中，字段(名言，作者，标签)

#! /usr/bin/python3

# -*- coding:utf-8 -*-

from urllib.request import urlopen as open

from bs4 import BeautifulSoup

import re

import pymysql

def find_top_ten(url):

response = open(url)

bs = BeautifulSoup(response,'html.parser')

tags = bs.select('span.tag-item a')

top_ten_href = [tag.get('href') for tag in tags]

top_ten_tag = [tag.text for tag in tags]

# print(top_ten_href)

# print(top_ten_tag)

return top_ten_href

def insert_into_mysql(records):

con = pymysql.connect(host='localhost',user='root',

password='root',database='quotes',charset='utf8

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

关注关注