list index out of range 列表越界
Traceback (most recent call last):
File "G:/workSpace/Python/TB_Crawler/Crawler_train.py", line 71, in <module>
print(getCommentsList(ItemURL2))
File "G:/workSpace/Python/TB_Crawler/Crawler_train.py", line 46, in getCommentsList
comment = getComment(newURL, i)['content']
File "G:/workSpace/Python/TB_Crawler/Crawler_train.py", line 27, in getComment
comment = jd['comments'][num]
IndexError: list index out of range
提示错误在getComment方法的comment = jd['comments'][num]中
因为淘宝评论每一页最多20条评论,所以爬取评论时对每页的20条评论进行遍历。当某页评论小于20条时,num仍然是从0到20遍历,所以就会出现列表越界的问题。
解决办法:当遍历到最后一条评论时break,跳出循环。
# 获取商品id def