2020 MCM Problem C “A Wealth of Data” 题目及翻译 美国大学生数学建模 美赛

2020 MCM Problem C “A Wealth of Data” 题目及翻译 美国大学生数学建模 美赛

题目官网

https://www.comap.com/undergraduate/contests/mcm/contests/2020/problems/2020_MCM_Problem_C.pdf

题目

Problem C: “A Wealth of Data”

In the online marketplace it created, Amazon provides customers with an opportunity to rate and review purchases. Individual ratings -called “star ratings” –allow purchasers to express their level of satisfaction with a product using a scale of 1 (low rated, low satisfaction) to 5 (highly rated, high satisfaction). Additionally, customers can submit text-based messages –called “reviews” –that express further opinions and information about the product. Other customers can submit ratings on these reviews as being helpful or not –called a “helpfulness rating” –towards assisting their own product purchasing decision. Companies use these data to gain insights into the markets in which they participate, the timing of that participation, and the potential success of product design feature choices.

Sunshine Company is planning to introduce and sell three new products in the online marketplace: a microwave oven, a baby pacifier, and a hair dryer. They have hired your team as consultants to identify key patterns, relationships, measures, and parameters in past customer-supplied ratings and reviews associated with other competing products to 1) inform their online sales strategy and 2) identify potentially important design features that would enhance product desirability. Sunshine Company has used data to inform sales strategies in the past, but they have not previously used this particular combination and type of data. Of particular interest to Sunshine Company are time-based patterns in these data, and whether they interact in ways that will help the company craft successful products.

To assist you, Sunshine’s data center has provided you with three data files for this project: hair_dryer.tsv, microwave.tsv, and pacifier.tsv. These data represent customer-supplied ratings and reviews for microwave ovens, baby pacifiers, and hair dryers sold in the Amazon marketplace over the time period(s) indicated in the data. A glossary of data label definitions is provided as well. THE DATA FILES PROVIDED CONTAIN THE ONLY DATA YOU SHOULD USE FOR THIS PROBLEM.

Requirements
  1. Analyze the three product data sets provided to identify, describe, and support with mathematical evidence, meaningful quantitative and/or qualitative patterns, relationships, measures, and parameters within and between star ratings, reviews, and helpfulness ratings that will help Sunshine Company succeed in their three new online marketplace product offerings.

  2. Use your analysis to address the following specific questions and requests from the Sunshine Company Marketing Director:
    a. Identify data measures based on ratings and reviews that are most informative for Sunshine Company to track, once their three products are placed on sale in the online marketplace.
    b. Identify and discuss time-based measures and patterns within each data set that might suggest that a product’s reputation is increasing or decreasing in the online marketplace.
    c. Determine combinations of text-based measure(s) and ratings-based measures that best indicate a potentially successful or failing product.
    d. Do specific star ratings incite more reviews? For example, are customers more likely to write some type of review after seeing a series of low star ratings?
    e. Are specific quality descriptors of text-based reviews such as ‘enthusiastic’, ‘disappointed’, and others, strongly associated with rating levels?

  3. Write a one-to two-page letter to the Marketing Director of Sunshine Company summarizing your team’s analysis and results. Include specific justification(s) for the result that your team most confidently recommends to the Marketing Director.

Your submission should consist of:

  • One-page Summary Sheet
  • Table of Contents
  • One-to Two-page Letter
  • Your solution of no more than 20 pages, for a maximum of 24 pages with your summary sheet, table of contents, and two-page letter.

Note: Reference List and any appendices do not count toward the page limit and should appear after your completed solution. You should not make use of unauthorized images and materials whose use is restricted by copyright laws. Ensure you cite the sources for your ideas and the materials used in your report.

Glossary

Helpfulness Rating: an indication of how valuable a particular product review is when making a decision whether or not to purchase that product.
Pacifier: a rubber or plastic soothing device, often nipple shaped, given to a baby to suck or bite on.
Review: a written evaluation of a product.
Star Rating: a score given in a system that allows people to rate a product with a number of stars.

Attachments: The Problem Datasets:

The Problem DatasetsProblem_C_Data.zipThe three data sets provided contain product user ratings and reviews extracted from the Amazon Customer Reviews Dataset the Amazon Simple Storage Service (Amazon S3).

Data Set Definitions: Each row represents data partitioned into the following columns.

  • marketplace (string): 2 letter country code of the marketplace where the review was written.
  • customer_id (string): Random identifier that can be used to aggregate reviews written by a single author.
  • review_id (string): The unique ID of the review.
  • product_id (string): The unique Product ID the review pertains to.
  • product_parent (string): Random identifier that can be used to aggregate reviews for the same product.
  • product_title (string): Title of the product.
  • product_category(string): The major consumer category for the product.
  • star_rating (int): The 1-5 star rating of the review.
  • helpful_votes (int): Number of helpful votes.
  • total_votes (int): Number of total votes the review received.
  • vine (string): Customers are invited to become Amazon Vine Voices based on the trust that they have earned in the Amazon community for writing accurate and insightful reviews. Amazon provides Amazon Vine members with free copies of products that have been submitted to the program by vendors. Amazon doesn’t influence the opinions of Amazon Vine members, nor do they modify or edit reviews.
  • verified_purchase (string): A “Y” indicates Amazon verified that the person writing the review purchased the product at Amazon and didn’t receive the product at a deep discount.
  • review_headline (string): The title of the review.
  • review_body (string): The review text.
  • review_date (bigint): The date the review was written.

翻译

问题C:“数据中的财富”

在其创建的在线市场中,亚马逊为客户提供了对购买进行评分和评价的机会。个体评分-称为“星级评分” –购买者可以使用1(低评分,低满意度)到5(高评分,高满意度)的等级来表示他们对产品的满意度。此外,客户可以提交基于文本的消息(称为“评论”),以表达有关产品的更多意见和信息。其他客户可以针对这些评论提交有帮助或无帮助的评分(称为“帮助评分”),以协助他们自己的产品购买决策。公司使用这些数据来深入了解其参与的市场,参与的时间以及产品设计功能选择的潜在成功。

Sunshine 公司计划在在线市场上推出和销售三种新产品:微波炉,婴儿奶嘴和吹风机。他们已聘请您的团队作为顾问,以在与其他竞争产品相关的过去客户提供的评分和评论中识别关键模式,关系,度量和参数,以:
1)告知其在线销售策略;
2)识别潜在的重要设计功能,以增强产品的合意性。
Sunshine 公司过去曾使用数据为销售策略提供信息,但他们以前从未使用过这种特殊的组合和数据类型。 Sunshine 公司特别感兴趣的是这些数据中的基于时间的模式,以及它们是否以有助于该公司制造成功产品的方式进行交互。

为了给您提供帮助,Sunshine的数据中心为您提供了该项目的三个数据文件:hair_dryer.tsv,microwave.tsv 和 pacifier.tsv。这些数据代表在数据指示的时间段内,在亚马逊市场上出售的微波炉,婴儿奶嘴和吹风机的客户提供的评分和评论。还提供了数据标签定义的词汇表。提供的数据文件包含您应用于此问题的唯一数据。

要求

1.分析提供的三个产品数据集,以使用数学证据来识别,描述和支持有意义的定量和/或定性模式,关系,量度和参数,这些数据将在有助于评估阳光公司的星级,评论和帮助等级之内和之间在三个新的在线市场产品中都取得了成功。

2.使用您的分析来解决阳光公司市场总监的以下特定问题和要求:
\quad a. 通过评分及评论识别出一些有用的产品衡量标准供公司参考,来帮助公司以后商品上线后衡量产品情况。
\quad b. 对这三个数据分别识别及讨论与时间相关的能够表明某个产品的名声变化的衡量标准和模式。
\quad c. 结合基于文字(评论、标题等)的衡量标准和基于评分的衡量标准,该组合揭示一个产品是否趋向成功或失败。
\quad d. 判断是否存在某些人因为某些特定的评分而写出一些特定类型的评论?例如某些人看过一些低分的评价后作出某些类型的评论。
\quad e. 判断是否存在一些词组的描述与评分等级强相关?

3.写一两页的信给阳光公司的市场总监,总结您团队的分析和结果。包括针对您的团队最有信心地推荐给市场总监的结果的具体理由。

您提交的内容应包括

  • 一页摘要表
  • 目录
  • 一页到两页的信函
  • 您的解决方案不超过20页,最多包含摘要页,目录和两页信函的24页。

注意:参考列表和任何附录不计入页数限制,应在完成解决方案后出现。您不应使用未经版权法限制使用的未经授权的图像和材料。确保您引用了想法的来源和报告中使用的材料。

词汇表

Helpfulness Rating:表示在决定是否购买该产品时特定产品评论的价值。
Pacifier:一种橡胶或塑料的舒缓装置,通常为乳头状,提供给婴儿吸吮或咬。
Review:对产品的评估。
Star Rating:在系统中给出的分数,该分数使人们可以对具有多个星级的产品进行评分。

附件:问题数据集
Problem_C_Data.zip提供的三个数据集包含产品用户评分和从Amazon Simple Storage Service(Amazon S3)的Amazon客户评论数据集中提取的评论。

数据集定义:每行代表划分为以下几列的数据。

  • marketplace(字符串):撰写评论的市场的2个字母的国家/地区代码。
  • customer_id(字符串):随机标识符,可用于汇总单个作者撰写的评论。
  • review_id(字符串):评论的唯一ID。
  • product_id(字符串):评论所属的唯一产品ID。
  • product_parent(字符串):随机标识符,可用于汇总同一产品的评论。
  • product_title(字符串):产品的标题。
  • product_category(字符串):产品的主要消费者类别。
  • star_rating(int):评论的1-5星级。
  • helpful_votes(int):有用的投票数。
  • total_votes(int):评论获得的总票数。
  • vine(字符串):根据客户在Amazon社区中撰写准确而有见地的评论所获得的信任,邀请他们成为Amazon Vine Voices。亚马逊为Amazon Vine成员提供了供应商已提交给该程序的产品的免费副本。 Amazon不会影响Amazon Vine成员的意见,也不会修改或编辑评论。
  • verified_purchase(字符串):“ Y”表示亚马逊已验证撰写评论的人在亚马逊上购买了该产品,并且没有以大幅折扣收到该产品。
  • review_headline(字符串):评论的标题。
  • review_body(字符串):评论文本。
  • review_date(bigint):撰写评论的日期。

大佬对该题的见解:

如何评价2020 数学建模美赛 C 题?
https://www.zhihu.com/question/377089279/answer/1059132874

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值