Python3中如何得到Unicode码对应的中文:https://www.zhihu.com/question/26921730
抓取百度贴吧出现这种乱码:['人ä¸\xadé¾\x99å\x87¤'] ['http://tb.himg.baidu.com/sys/portrait/item/tb.1.9c87a1ac.W-YVyJf4miCNGfpMTnkAJA']
['人ä¸\xadé¾\x99å\x87¤'] ['http://tb.himg.baidu.com/sys/portrait/item/tb.1.9c87a1ac.W-YVyJf4miCNGfpMTnkAJA']
['å\x9b\x9bæ\x96¹æ¸¸ä¾\xa0'] ['http://tb.himg.baidu.com/sys/portrait/item/tb.1.408ad0ef.Rzh-2gAsL3jy6dseTEvHbg']
解决方法:
response = requests.get(url,headers=self.headers)
return response.content
改成:
response = requests.get(url,headers=self.headers)
return response.text