用BeautifulSoup 解析html和xml字符串
实例:
#!/usr/bin/python
# -*- coding: UTF-8 -*-
from bs4 import BeautifulSoup
import re
#待分析字符串
html_doc = """
The Dormouse's storyThe Dormouse's story
Once upon a time there were three little sisters; and their names were
and
and they lived at the bottom of a well.
...
"""
# html字符串创建Beautifu