Web crawler study(1)

1. setup the python3 enviromemt via download the excuted files from the website https://www.python.org/downloads/

 

2.Atfer seting up ,confirm that whether the enviroment is successful or not .

   open the CMD windows  /  Linux terminal  to type "python" ,then press the enter  key.

 

3.create a python file for coding.eg :demo.py

 

  # coding=gbk                                      #it can be avoid the syntaxerror:non-utf-8 code starting with \x3
  

      import urllib.request                             # urllib.request  is a package which usally used to get the infomation form the web pages
   
  url="http://www.baidu.com"                  # the web site that we want to get the information from it

  response=urllib.request.urlopen(url)      # get the reponse from the web server,the expected result is the information  that we wanted.
 
  html=response.read()                          # return the information the Binary string,so that the infromation can be displayed.
 
  codeOfHtml=html.decode('utf-8')          #decoding the information
 
  print(codeOfHtml)                                #print the information

4. Run the demo.py script

posted on 2017-04-18 22:57  Daimon.gu 阅读( ...) 评论( ...) 编辑 收藏

转载于:https://www.cnblogs.com/yongdaiblog-201409/p/6731056.html

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值