爬虫小程序-01

爬虫小程序之01
1.py:python 3x运行测试后的正确代码

#!user/bin/env python
# -*- coding:gbk -*-
# import importlib
# importlib.reload(sys)
import re
import urllib.request,os
import pymysql
from bs4 import BeautifulSoup

url1="http://www.doyouhike.net/dest/hongkongtamendao-camping"

# 赋一个URL
fp = urllib.request.urlopen(url1) #打开此URL
s = fp.read() #把上面操作的结果读取出来赋值给S
soup = BeautifulSoup(s) # 用BeautifulSoup分析S
polist = soup.findAll('span') # 找到所有tag <span>的内容

print (polist[0].contents[0]) # 打印出第一个tag <span>中间的内容

运行结果:

D:\SparkCollection\NetWorkSpark>python 01.py
C:\Users\Administrator\AppData\Local\Programs\Python\Python35\lib\site-packages
bs4\__init__.py:181: UserWarning: No parser was explicitly specified, so I'm us
ng the best available HTML parser for this system ("lxml"). This usually isn't
problem, but if you run this code on another system, or in a different virtual
environment, it may use a different parser and behave differently.

The code that caused this warning is on line 15 of the file 01.py. To get rid o
this warning, change code that looks like this:

BeautifulSoup(YOUR_MARKUP})

to this:

BeautifulSoup(YOUR_MARKUP, "lxml")

markup_type=markup_type))
Traceback (most recent call last):
File "01.py", line 18, in <module>
print (polist[0]/contents[0]) # 打印出第一个tag <span>中间的内容
NameError: name 'contents' is not defined

D:\SparkCollection\NetWorkSpark>python 01.py
C:\Users\Administrator\AppData\Local\Programs\Python\Python35\lib\site-packages
bs4\__init__.py:181: UserWarning: No parser was explicitly specified, so I'm us
ng the best available HTML parser for this system ("lxml"). This usually isn't
problem, but if you run this code on another system, or in a different virtual
environment, it may use a different parser and behave differently.

The code that caused this warning is on line 15 of the file 01.py. To get rid o
this warning, change code that looks like this:

BeautifulSoup(YOUR_MARKUP})

to this:

BeautifulSoup(YOUR_MARKUP, "lxml")

markup_type=markup_type))
目的地


【附注】:该代码是参照书本,书本源代码的风格是python 3x版本以下的,但是我是python 3.5x版本,所以遇到一些问题,同时做了一些修改

详情可查看:
python 3x运行python 2x的代码可能遇到的问题 - 个人文章 - SegmentFault

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值