python beautifulsoup抓取京东商品名称和价格

最新推荐文章于 2021-10-31 21:57:11 发布

scharging

最新推荐文章于 2021-10-31 21:57:11 发布

阅读量1.7k

点赞数

文章标签： python beautiful

本文链接：https://blog.csdn.net/hi_hpi/article/details/46826089

版权

本文介绍如何利用Python的BeautifulSoup库抓取京东网站上的商品名称及对应的价格，实现数据的自动化采集。

摘要由CSDN通过智能技术生成

使用beautifulsoup抓取京东商品的名称和价格

#coding=utf-8

import urllib
import urllib2
import cookielib
from bs4 import BeautifulSoup
import sys
reload(sys)
sys.setdefaultencoding("utf-8") #处理编码

product ="ThinkPad i5"

text = urllib.urlopen("http://search.jd.com/Search?keyword="+product+"&enc=utf-8").read()
soup = BeautifulSoup(text)

content = soup.find_all('div', attrs={'class':'lh-wrap'})
for wrap in content:
name_tags = wrap.find_all('div', attrs={'class':'p-name'})
for name in name_tags:
print '商品：'+name.find('a').get_text()