这是代码
content = response.read().decode('gbk')
pattern = re.compile('<div*?"section.*?<div*?<h1.*?"srticle-title">(.*?)</h1>.*?'+
'<div.*?<span>(.*?).*?<span>(.*?).*?'+
'<div.*?article-text">(.*?)</div>.*?class="good.*?text">(.*?)<\span>'+
'.*?class="bad.*?text">(.*?)',re.S)
items = pattern.findall(pattern,content)
for item in items:
print(item[0],item[1],item[2],item[3],item[4],item[5])
运行结果是这样的,求问这是为什么。
Traceback (most recent call last):
File “D:/PycharmProjects/spiderstudy/test.py”, line 21, in
items = pattern.findall(pattern,content,flag=0)
TypeError: ‘str’ object cannot be interpreted as an integer