思路分析:
获取商品基本信息(名称、价格、详情页链接)
整合信息用pandas库写入excel
整理excel最终导出
代码实现:
请求网页:
import requests
# 键入搜索信息
kw = input("Some things:")
url = 'https://search.jd.com/Search?keyword=' + kw
headers = {
"User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) "
"Chrome/94.0.4606.61 Safari/537.36",
} # 不加头会跳转至登录页面
resp = requests.get(url=url, headers=headers)
京东商品基本样式:
<li data-sku="100026667884" data-spu="100026667884" ware-type="10" class="gl-item">
<div class="gl-i-wrap">
<div class="p-img">
<a target="_blank" title="性能全面升级,先进双摄系统,续航大提升!选购[1年期官方AppleCare+版]限时抢服务5折券,尽享超值原厂服务,放心用机,无惧意外!查看" href="//item.jd.com/100026667884.html" onclick="searchlog(1, '100026667884','1','2','','flagsClk=230477530');">
<img width="220" height="220" data-img="1" data-lazy-img="done" source-data-lazy-img="" src="//img14.360buyimg.com/n7/jfs/t1/109384/3/17626/212571/614be585Eb8f44b8c/be8ecd190a639396.jpg">
</a>
<div data-lease="" data-catid="655" data-venid="1000000127" data-presale="0" data-done="1"></div>
</div>
<div class="p-price">
<strong class="J_100026667884" data-presale="0" data-done="1" stock-done="1">
<em>¥</em><i data-price="100026667884">6799.00</i>
</strong>
</div>
<div class="p-name p-name-type-2">
<a target="_blank" title="性能全面升级,先进双摄系统,续航大提升!选购[1年期官方AppleCare+版]限时抢服务5折券,尽享超值原厂服务,放心用机,无惧意外!查看" href="//item.jd.com/100026667884.html" onclick="searchlog(1, '100026667884','1','1','','flagsClk=2