


by Hiren Patel

希伦·帕特尔(Hiren Patel)

使用R进行网页抓取的简介 (An introduction to web scraping using R)

With the e-commerce boom, businesses have gone online. Customers, too, look for products online. Unlike the offline marketplace, a customer can compare the price of a product available at different places in real time.

随着电子商务的繁荣,企业已经上网。 客户也在线寻找产品。 与离线市场不同,客户可以实时比较不同地方的可用产品价格。

Therefore, competitive pricing is something that has become the most crucial part of a business strategy.


In order to keep prices of your products competitive and attractive, you need to monitor and keep track of prices set by your competitors. If you know what your competitors’ pricing strategy is, you can accordingly align your pricing strategy to get an edge over them.

为了使您的产品价格具有竞争力和吸引力,您需要监视并跟踪竞争对手确定的价格。 如果您知道竞争对手的定价策略是什么,则可以相应地调整定价策略以获取竞争优势。

Hence, price monitoring has become a vital part of the process of running an e-commerce business.


You might wonder how to get hold of the data to compare prices.


获取价格比较所需数据的3种主要方法 (The top 3 ways of getting the data you need for price comparison)

1.商人的饲料 (1. Feeds from Merchants)

As you might be aware, there are several price comparison sites available on the internet. These sites get into a sort of understanding with the businesses wherein they get the data directly from them and which they use for price comparison.

您可能已经知道,互联网上有几个价格比较站点。 这些站点与业务部门建立了某种了解,他们可以直接从他们那里获取数据并将其用于价格比较。

These businesses put into place an API, or utilize FTP to provide the data. Generally, a referral commission is what makes a price comparison site financially viable.

这些企业部署了API,或利用FTP提供数据。 通常,推荐佣金使价格比较站点在财务上可行。

2.来自第三方API的产品Feed (2. Product feeds from third-party APIs)

On the other hand, there are services which offer e-commerce data through an API. When such a service is used, the third party pays for the volume of data.

另一方面,有些服务通过API提供电子商务数据。 使用此类服务​​时,第三方将为数据量付费。

3.网页抓取 (3. Web Scraping)

Web scraping is one of the most robust and reliable ways of getting web data from the internet. It is increasingly used in price intelligence because it is an efficient way of getting the product data from e-commerce sites.

Web抓取是从Internet获取Web数据的最可靠,最可靠的方法之一。 由于它是从电子商务站点获取产品数据的有效方法,因此越来越多地用于价格情报中。

You may not have access to the first and second option. Hence, web scraping can come to your rescue. You can use web scraping to leverage the power of data to arrive at competitive pricing for your business.

您可能无权访问第一个和第二个选项。 因此,网页抓取可以助您一臂之力。 您可以使用网络抓取来利用数据的功能来为您的业务确定具有竞争力的价格。

Web scraping can be used to get current prices for the current market scenario, and e-commerce more generally. We will use web scraping to get the data from an e-commerce site. In this blog, you will learn how to scrape the names and prices of products from Amazon in all categories, under a particular brand.

Web抓取可用于获取当前市场情况下的当前价格,以及更广泛的电子商务。 我们将使用网络抓取来从电子商务网站获取数据。 在此博客中,您将学习如何从一个特定品牌的所有类别中刮取亚马逊产品的名称和价格。

Extracting data from Amazon periodically can help you keep track of the market trends of pricing and enable you to set your prices accordingly.


目录 (Table of contents)

  1. Web scraping for price comparison


  2. Web scraping in R


  3. Implementation


  4. End note


1.网页抓取以进行价格比较 (1. Web scraping for price comparison)

As the market wisdom says, price is everything. The customers make their purchase decisions based on price. They base their understanding of the quality of a product on price. In short, price is what drives the customers and, hence, the market.

正如市场智慧所言,价格就是一切。 客户根据价格做出购买决定。 他们基于价格对产品质量的了解。 简而言之,价格是驱动客户以及市场的驱动力。

Therefore, price comparison sites are in great demand. Customers can easily navigate the whole market by looking at the prices of the same product across the brands. These price comparison websites extract the price of the same product from different sites.

因此, 价格比较网站的需求量很大。 客户可以通过查看各品牌相同产品的价格轻松浏览整个市场。 这些价格比较网站从不同的站点提取相同产品的价格。

Along with price, price comparison websites also scrape data such as the product description, techn





当前余额3.43前往充值 >
领取后你会自动成为博主和红包主的粉丝 规则
钱包余额 0


