import requests as re
from bs4 import BeautifulSoup as bs
web=re.get("http://www.xbiquge.la/13/13959/")
soup=bs(web.text,"lxml")
print(soup.title)
使用上述代码时,中文标题出现乱码
"D:\Program Files\Python\python.exe" D:/PC/test/python/mine.py
<title>å£å¢å°è¯´_è¾°ä¸_å£å¢ææ°ç« è_å£å¢æ å¼¹çª_æ°ç¬è¶£é</title>
Process finished with exit code 0
出现这种情况时,先查看网页的编码方式,上述网页的编码方式为utf-8,只需添加为web指定编码方式即可
import requests as re
from bs4 import BeautifulSoup as bs
web=re.get("http://www.xbiquge.la/13/13959/")
web.encoding="utf8"
soup=bs(web.text,"lxml")
print(soup.title)
"D:\Program Files\Python\python.exe" D:/PC/test/python/mine.py
<title>圣墟小说_辰东_圣墟最新章节_圣墟无弹窗_新笔趣阁</title>
Process finished with exit code 0