我有一个HTML文档,如下所示,自我。汤是美化组对象。我试图从列表元素中获取数据。列表元素如下所示:
Name
Piter
Year
2017
python文件刮伤.py
^{pr2}$
第一次尝试specs = self.soup.select("ul.list-group li.list-group-item")
if len(specs) > 0 :
for data in specs :
text = data.get_text()
if need.has_key( data[0].strip()) :
if need[ data[0].strip() ] not in person or person[ need[ data[0].strip() ] ] == '':
person[ need[ text[0].strip() ] ] = text[1].strip()
第一个错误File "scraper.py", line 68, in scrape
if need.has_key( data[0].strip()) :
File "build/bdist.linux-x86_64/egg/bs4/element.py", line 1011, in__getitem__
KeyError: 0
第二次尝试specs = self.soup.select("ul.list-group li.list-group-item")
if len(specs) > 0 :
for data in specs :
text = data.get_text()
if need.has_key( data[0].strip()) :
if need[ data[0].strip() ] not in person or person[ need[ data[0].strip() ] ] == '':
person[ need[ text[0].strip() ] ] = text[1].strip()
第二个错误File "site_scrapers/v12software.scraper.py", line 66, in scrape
text = [ data.contents[0].get_text(), data.contents[1].get_text() ]
File "build/bdist.linux-x86_64/egg/bs4/element.py", line 737, in __getattr__
AttributeError: 'NavigableString' object has no attribute 'get_text'
我试图将上面的元素字符串转换为person数组。在
我需要这样的结果:print person['Name']
#output Piter
print person['Year']
#output 2017