爬取网页内容解析并存入MySQL数据库
用到的第三方库:
-
BeautifulSoup:解析网页内容,建议安装方法:
-
pip install beautifulsoup4
-
pymysql:操作数据库,建议安装方法:
-
pip install pymysql
import re
from urllib.request import urlopen
from bs4 import BeautifulSoup
import pymysql.cursors
if __name__ == '__main__':
url = 'https://baike.baidu.com/item/%E7%99%BE%E5%BA%A6/6699?fr=aladdin'
# 爬取网页中所有的链接url
response = urlo