【Python】url基础：网络爬虫技术

最新推荐文章于 2022-07-22 10:48:25 发布

CS正阳

最新推荐文章于 2022-07-22 10:48:25 发布

阅读量294

点赞数 1

分类专栏：开发工具：Python 文章标签： python 爬虫

本文链接：https://blog.csdn.net/sunyaowu315/article/details/81840378

版权

开发工具：Python 专栏收录该内容

31 篇文章 1 订阅

订阅专栏

使用urllib.request请求一个网页内容

自动解码器 conda install chardet

from urllib import request
import chardet


if __name__ =='__main__':
    #获取网址
    url = "http://ts.zhaopin.com/jump/index_new.html?utm_source=other&utm_medium=cnt&utm_term=&utm_campaign=121113803&utm_provider=zp&sid=121113803&site=pzzhubiaoti"
    #发送请求
    rsp = request.urlopen(url)
    #读取返回结果
    html = rsp.read()
    print(type(html))
    #html解码，调用自动解码器
    cs = chardet.detect(html)
    html = html.decode(cs.get("encoding","utf-8"))
    print(html)

CS正阳

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
【Python】url基础：网络爬虫技术

使用urllib.request请求一个网页内容自动解码器 conda install chardetfrom urllib import requestimport chardetif __name__ =='__main__': #获取网址 url = &amp;amp;amp;amp;amp;amp;amp;amp;quot;http://ts.zhaopin.com/jump/index_new.html?utm_sour.
复制链接

扫一扫