- select()标签名不加任何修饰,类名前加点,id名前加#,属性用[ 属性 = ‘*****’ ].
- select()以列表的形式返回。
- 导入形式from bs4 import BeautifulSoup(基于python3.6).
# 通过标签名查找
soup = BeautifulSoup(html, "lxml")
list = soup.select('div')
# 查找div标签。
#通过标类名查找
soup = BeautifulSoup(html, "lxml")
list = soup.select('.point')
#查找类名为point的类
#通过id查找
soup = BeautifulSoup(html, "lxml")
list = soup.select('#name')
#组合查找
soup = BeautifulSoup(html, "lxml")
list = soup.select('div #name')
#属性查找
soup = BeautifulSoup(html, "lxml")
list = soup.select('img[width="75"]')
#查找img标签下满足width=“75”的标签
list = soup.select('head>title')
#查找head下的title标签
#属性需要用中括号括起来,注意属性和标签属于同一节点,所以中间不能加空格,否则会无法匹配到