MCM Problem C Overview

消息来源:http://www.comap.com/undergraduate/contests/index.html

美国大学生数学建模竞赛(MCM/ICM)主办单位 COMAP发布了一份名为“MCM Problem C Overview”的文档,就MCM的C题进行了说明。下面简单介绍一下该文档的主要内容,希望对备战2017 MCM/ICM的朋友有所帮助。

C题是MCM于2016年新增设的题目,被称为Data Insights类问题,关注与数据有关的数学模型。因此,与之前的MCM赛题相比,统计、模式识别等领域的模型可能用的更多。

C题是与数据有关的实际问题,建模的时候可能会遇到各种困难,如数据集较大(但还不是大数据级别),混合的数据类型,数据缺失等。但C题不是大数据(big data)问题,不需要参赛队掌握特殊的计算机科学知识,如数据处理的基本算法、分析技巧,或是访问高性能计算平台等。

  • 题目的数据是可以公开访问的。
  • 虽然不是大数据问题,但是压缩后的数据文件可能会超过100MB,这比往年MCM赛题的数据要大。选题时要考虑是否有足够的实力处理这么大的数据集。顺便说一下,经常有人问竞赛的时候找不到数据怎么办,真正去年C题给了数据,又说处理不了。
  • 压缩文件中除了数据库文件,可能还会有字典,映射文件,或者代码,用以建立标签。
  • 将以多种格式提供数据文件,如SAS、SPSS、STATA和CSV。
  • 可以使用软件,如Statistic, JMP, SAS, SPSS, Excel, R, Matlab等,但不要求必须使用某种特定的软件。如果竞赛中使用了特殊的软件或者代码,要了解其背后的数学原理。
  • 竞赛只需要提交论文,不需要提交数据库文件。

对C题感兴趣的朋友可以查看原始文档

附:原始文档内容:

The 2016 MCM introduces a new modeling challenge – Problem C - that is best described as Data Insights. Problem C is intended to focus on and amplify specific elements of mathematical modeling challenges associated with data. In this sense, techniques stemming from statistics and pattern classification will play a larger role in creating a mathematical model on this problem than in previous contests.
While not a ‘big data’ challenge in the sense of teams needing to develop specialized computer science-based data handling algorithms and analysis techniques or have access to high performance computing platforms, the problem will provide teams with an opportunity to encounter real-world, challenging data that have interesting characteristics. Naturally occurring complicating factors such as data set size (but not big data), blend of data types, breadth of representation in data elements, cross-discipline sources, time series dependencies, censored or missing data, and others could present themselves depending on the specifics of the modeling problem.
MCM Problem C: Data Insights
 Teams will be given access to database files that will be made available from a public website.
 The database files will be compressed for size but the file size could still be 100mbs or more and teams should take this into consideration prior to choosing Problem C.
 Each zipped file may include the database files along with the data dictionary, data mapping file, and program code to create value labels.
 The database will be made available in multiple formats SAS, SPSS, STATA and CSV.
 Software such as Statistica, JMP, SAS, SPSS, Excel, R, Matlab or other applications may be used to aid in your solution but no one particular piece of software is endorsed or required. If specialized software or custom code is used to support the contest effort, teams should take care to clearly communicate an understanding of the mathematics and assumptions applied via tools and algorithms in the software.
 When submitting your final electronic solution you are NOT required to submit back the database file or any data for that matter. The only thing that should be submitted is your electronic (word or PDF) solution.

  • 1
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
2020 MCM Weekend 2 Problem C: A Wealth of Data 2020年MCM周末2问C:数据的财富 Problem In the online marketplace it created, Amazon provides customers with an opportunity to rate and review purchases. Individual ratings - called “star ratings” – allow purchasers to express their level of satisfaction with a product using a scale of 1 (low rated, low satisfaction) to 5 (highly rated, high satisfaction). Additionally, customers can submit text-based messages – called “reviews” – that express further opinions and information about the product. Other customers can submit ratings on these reviews as being helpful or not – called a “helpfulness rating” – towards assisting their own product purchasing decision. Companies use these data to gain insights into the markets in which they participate, the timing of that participation, and the potential success of product design feature choices. 在其创建的在线市场中,亚马逊为客户提供了对购买进行评分和评价的机会。个人评级-称为“星级”-使购买者可以使用1(低评级,低满意度)到5(高评级,高满意度)的等级来表示他们对产品的满意度。此外,客户可以提交基于文本的消息(称为“评论”),以表达有关产品的更多意见和信息。其他客户可以在这些评论中提交有帮助或无帮助的评分(称为“帮助评分”),以协助他们自己的产品购买决策。公司使用这些数据来深入了解其参与的市场,参与的时间以及产品设计功能选择的潜在成功。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值