文献回顾-美赛建模

文献回顾-美赛建模1

Trajectory Tracking of Asian Giant Hornets Based on SVM and BA-SVM Algorithm

based on 2021 MCM Problem C: Confirming the Buzz about Hornets(to find out whether a pest or not)

类型:大数据分析题

大数据分析共三种类型–字符类型 文本类型 图像/视频类型

赛题重述

In September 2019, a colony of Vespa mandarinia (also known as the Asian giant hornet)was discovered on Vancouver Island in British Columbia, Canada. The nest was quicklydestroyed, but the news of the event spread rapidly throughout the area. Since that time,several confirmed sightings of the pest have occurred in neighboring Washington State, aswell as a multitude of mistaken sightings. See Figure 1 below for a map of detections,hornet watches, and public sightings.

在这里插入图片描述

Vespa mandarinia is the largest species of hornet in the world, and the occurrence of the nest was alarming. Additionally, the giant hornet is a predator of European honeybees, invading and destroying their nests. A small number of the hornets are capable of destroying a whole colony of European honeybees in a short time. At the same time, they are voracious predators of other insects that are considered agricultural pests.

The life cycle of this hornet is similar to many other wasps. Fertilized queens emerge in the spring and begin a new colony. In the fall, new queens leave the nest and will spend the winter in the soil waiting for the spring. A new queen has a range estimated at 30km for establishing her nest. More detailed information on Asian hornets is included in the problem attachments and can also be found online.

Due to the potential severe impact on local honeybee populations, the presence of Vespamandarinia can cause a good deal of anxiety. The State of Washington has created help line sand a website for people to report sightings of these hornets. Based on these reports from the public, the state must decide how to prioritize its limited resources to follow-up with additional in vestigation. While some reports have been determined to be Vespa mandarinia, many other sightings have turned out to be other types of insects.

The primary questions for this problem are“How can we interpret the data provided by the public reports?”and "What strategies can we use to prioritize these public reports for additional investigation given the limited resources of government agencies?”

Problems:

Your paper should explore and address the following aspects:

1.Address and discuss whether or not the spread of this pest over time can be predicted,and with what level of precision.

预测模型(讨论大黄蜂的出现随着时间等的变化,可以考虑空间特征进行分析)

2.Most reported sightings mistake other hornets for the Vespa mandarinia. Use only the data set file provided, and (possibly) the image files provided, to create,analyze, and discuss a model that predicts the likelihood of a mistaken classification.

分类模型(我的文章使用SVM来分类,要在模型分析上下功夫,对于Recall 等指标进一步分析)

3.Use your model to discuss how your classification analyses leads to prioritizing investigation of the reports most likely to be positive sightings.

评价(给出无监督样本的结果并评价模型的准确率等指标,便于进一步分析)

4.Address how you could update your model given additional new reports over time, and how often the updates should occur.

优化模型,学习率等参数的更新规则,以及数据集迭代的频次等

5.Using your model, what would constitute evidence that the pest has been eradicated in Washington State?

我对于黄蜂数量减少到认定范围的判定标准

Finally, your report should include a two-page memorandum that summarizes yourresults for the Washington State Department of Agriculture.

Your PDF solution of no more than 25 total pages should include:
您的PDF解决方案(总共不超过25页)应包括:
·One-page Summary Sheet.一页的摘要表
·Table of Contents.目录。
·Your complete solution.您的完整解决方案。

·Two-page Article.两页文章。
·References list.考文献清单。

赛题分析

1、找到判断亚洲大黄蜂的指标(选取判断指标)

2、根据指标和图像—转变为数据,建模(理论建模,有监督的训练,做图像的是转积神经网络,或者yolo5算法)

3、输入无监督的测试集,预测一些结果

4、当有结果数据增多时,即训练集增加时,多长时间要更新一次模型?如何更新(模型改进)

5、说明模型的判断精度,基于我的模型,找到一个大黄蜂就消灭一个,何时可以消灭大黄蜂

经验分享记录-2021 C F奖(特等奖提名)

来源:西南交通大学钱学院辅B站分享

数据采集(华盛顿地图的csv获取)

使用jpg格式地图按颜色进行阈值分割,进行读取,得到csv格式文件

在这里插入图片描述

数据的分析(赛题给出的表格)

分析过程:数据采集—数据清洗—数据分析—数据解释—数据可视化

数据集的特点:

detection datenoteslab statuslab commentssubmission datelatitude、longtitude
目击报告-提交时间目击报告-文本描述信息类别标识实验室描述-文本信息实验室检测时间经纬度记录

检查数据异常或者缺失值—进行补充(插值或者回归)

针对notes 将该数据去除(因为噪音非常大,会对模型造型麻烦)

语义数据预处理(进行独热编码、离散化、NLP网络提取)

好看的框图结构展示

在这里插入图片描述

意外获得一条pandas入门之路:(看第一条解答)

https://www.zhihu.com/question/439115857

明日阅读优秀论文后再更~

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值