Web crawler study(1)

最新推荐文章于 2024-09-29 14:13:49 发布

weixin_30853329

最新推荐文章于 2024-09-29 14:13:49 发布

阅读量52

点赞数

文章标签： python

原文链接：http://www.cnblogs.com/yongdaiblog-201409/p/6731056.html

版权

Web crawler study(1)

1. setup the python3 enviromemt via download the excuted files from the website https://www.python.org/downloads/

2.Atfer seting up ,confirm that whether the enviroment is successful or not .

open the CMD windows / Linux terminal to type "python" ,then press the enter key.

3.create a python file for coding.eg :demo.py

　　# coding=gbk #it can be avoid the syntaxerror：non-utf-8 code starting with \x3
　　

      import urllib.request                           # urllib.request is a package which usally used to get the infomation form the web pages
　　
　　url="http://www.baidu.com"                # the web site that we want to get the information from it

　　response=urllib.request.urlopen(url)    # get the reponse from the web server,the expected result is the information that we wanted.

　　html=response.read()                          # return the information the Binary string,so that the infromation can be displayed.

　　codeOfHtml=html.decode('utf-8')         #decoding the information

　　print(codeOfHtml)                                #print the information