[Python3.x]网络爬虫（一）：利用urllib通过指定的URL抓取网页内容

最新推荐文章于 2024-07-30 17:21:38 发布

albert1828

最新推荐文章于 2024-07-30 17:21:38 发布

阅读量5.1k

点赞数

分类专栏： python 文章标签： python

本文链接：https://blog.csdn.net/zhangyaping123/article/details/72730896

版权

本文介绍了Python3.x进行网络爬虫的基本操作，包括如何爬取百度首页内容，使用方法1和方法2进行GET请求，讲解了发送data表单数据的POST请求，以及如何在HTTP请求中设置Headers。

摘要由CSDN通过智能技术生成

1.爬百度首页,
方法1:

#!/usr/bin/python
# -*- coding: UTF-8 -*-
import urllib.request
response = urllib.request.urlopen('http://www.lovejing.com/')
html = response.read();
print(html);

方法2:

#!/usr/bin/python
# -*- coding: UTF-8 -*-
import urllib.request
req = urllib.request.Request('http://www.lovejing.com/')
response = urllib.request.urlopen(req)
html = response.read();
print(html);

2.发送data表单数据(

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

albert1828

关注关注

0
点赞
踩
8

收藏

觉得还不错? 一键收藏
0
评论
[Python3.x]网络爬虫（一）：利用urllib通过指定的URL抓取网页内容

1.爬百度首页, 方法1:#!/usr/bin/python# -*- coding: UTF-8 -*-import urllib.requestresponse = urllib.request.urlopen('http://www.baidu.com/')html = response.read();print(html);方法2:#!/usr/bin/python# -*-
复制链接

扫一扫