BeautifulSoup 库入门
BeautifulSoup库主要作用是能对html xml格式进行解析,并且提供解析
import requests
from bs4 import BeautifulSoup as bs
r = requests.get ("https://www.python123.io/ws/demo.html")
r.text
demo =r.text
soup = bs(demo,"html.parser") # 解释器
print(soup)
print(soup.prettify()) # 增加换行符,分行显示,更加直观美观
BeautifulSoup 库解析器
解析器 | 使用方法 | 条件 |
---|---|---|
bs4的html解析器 | BeautifulSoup(mk,”html.parser”) | 安装bs4库 |
lxml的html解析器 | BeautifulSoup(mk,”lxml”) | pip install lxml |
lxml的xml解析器 | BeautifulSoup(mk,”xml”) | pip install lxml |
html5lib的解析器 | BeautifulSoup(mk,”html5lib”) | pip install html5lib |