我正在抓取一个网站,以便将数据存储在一个包含3列的数据库中。 我正在抓取的网站部分看起来像下面三个例子中的任何一个
# Example 1:
text1
# Example 2:
text1
text2
# Example 3:
text1
text2
text3
我正在尝试分配
"text1"到var1,
要么是空字符串,要么是"text2"到var2,
要么是空字符串,要么是"text3"到var3。
这样做的最佳方法是什么?
我尝试过的一些事情是
### FIRST ATTEMPT
var1, var2, var3 = '','',''
# could also do var1, var2, var3 = ('',)*3
all = soup.find_all('a')
var1 = all[0].text
try:
var2 = all[1].text
except:
pass
try:
var3 = all[3].text
except:
pass
#### SECOND ATTEMPT
all = [s.text for s in soup.find_all('a')]
# This is where i get stuck... This could return a list of length 1, 2, or 3, and I need the output to be a list of length 3 so i can use the following line to assign variables
var1, var2, var3 = all
#### THIRD ATTEMPT
all = [s.text for s in soup.find_all('a')]
var1, var2, var3 = '','',''
n = len(all)
var1 = all[0].text
if n = 2:
var2 = all[1].text
else:
var2 = all[1].text