selenium

最新推荐文章于 2024-07-14 20:43:20 发布

GoldenFong

最新推荐文章于 2024-07-14 20:43:20 发布

阅读量99

点赞数

分类专栏：爬虫文章标签：爬虫

本文链接：https://blog.csdn.net/weixin_50248555/article/details/121125853

版权

爬虫专栏收录该内容

5 篇文章 1 订阅

订阅专栏

#导入模块
from selenium import webdriver
#制定网址
url = 'https://www.taobao.com'
#打开浏览器，指定为chrome浏览器，chromedrive是
drive = webdriver.Chrome(r"C:\Program Files\Google\Chrome\Application\chromedriver.exe")
#加载网页
drive.get(url)
#目标获取手机名称、价格、月销量、评论数
#获取所有商品的链接,单数的element是获取第一个，复数是获取所有
pros = drive.find_elements_by_xpath('//div[@class="row row-2 title"]/a')
len(pros)
pros[0].click()
#操作对象切换到打开的页面
drive.switch_to.window(drive.window_handles[1])
#商品名称
title = drive.find_element_by_xpath('//h1[@data-spm="1000983"]').text
#价格
price = drive.find_element_by_xpath('//div[@class="tm-promo-price"]').text
#销量
mcount = drive.find_element_by_xpath('//span[@class="tm-count"]').text
#人气
renqi = drive.find_element_by_xpath('//span[@id="J_CollectCount"]').text
#关闭页面
drive.close()
#页面切换
drive.switch_to.window(drive.window_handles[0])
#先爬取三个商品
#存储
titles = []
prices = []
mcounts = []
renqis = []
for i in pros[:3]:
    i.click()
    drive.switch_to.window(drive.window_handles[1])
    # 商品名称
    title = drive.find_element_by_xpath('//h1[@data-spm="1000983"]').text
    print(title)
    # 价格
    price = drive.find_element_by_xpath('//div[@class="tm-promo-price"]').text
    print(price)
    # 销量
    mcount = drive.find_element_by_xpath('//span[@class="tm-count"]').text
    print(mcount)
    # 人气
    renqi = drive.find_element_by_xpath('//span[@id="J_CollectCount"]').text
    print(renqi)
    print('===============================')
    #存储
    titles.append(title)
    prices.append(price)
    mcounts.append(mcount)
    renqis.append(renqi)
    drive.close()
    drive.switch_to.window(drive.window_handles[0])

import pandas as pd
data = pd.DataFrame()
data['名称'] = titles
data['价格'] = price
data
data.to_excel('淘宝商品数据.xlsx')

GoldenFong

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
selenium

#导入模块from selenium import webdriver#制定网址url = 'https://www.taobao.com'#打开浏览器，指定为chrome浏览器，chromedrive是drive = webdriver.Chrome(r"C:\Program Files\Google\Chrome\Application\chromedriver.exe")#加载网页drive.get(url)#目标获取手机名称、价格、月销量、评论数#获取所有商品的链接,单数的ele.
复制链接

扫一扫