beautifulsoup4
安装
sudo pip3 install beautifulsoup4
使用流程
from bs4 import BeautifulSoup
#1.创建解析对象
soup=BeautifulSoup(html,'lxml')
#2.调用find_all()方法
r_list=soup.find_all(节点,条件)
BeautifulSoup支持的解析库
1.lxml
2.html.parser
3.xml
常用方法
.1.find():找1个节点
2.find_all():列表
3.节点.get_text():文本内容
from bs4 import BeautifulSoup as bs
html = '''
<div class="test">熊好吧</div>
<div class="test">雄霸</div>
<div class="test">灭霸</div>
'''
soup = bs(html, 'lxml')
r_list = soup.find_all('div', attrs={'class': 'test'})
print(r_list)
for r in r_list:
print(r.get_text())