《How hard can it be? Estimating the difficulty of visual search in an image》和 PPT

       《How hard can it be? Estimating the difficulty of visual search in an image》发表于2016年CVPR上,这篇文章是老师推荐的,需要说明的是它的model相对来说简单,而其真正的想法才是比较新颖的。


        作者做的主要工作是提出了一种在视觉搜索中衡量图像的搜索难度的方法。我们知道人的视觉在进行搜索时,首先对搜索图像进行观察,获取图像中的有特征的重要主体,再搜寻与其相似的图像,即先对图像进行抽象,再进行搜索。在当前的研究中,大多数人都在研究如何提高搜索精度,但是作者却在思考,什么样的图片是容易被检索的,什么样的是难被检索的。打个比喻,有一根绳子,有人说1米,有人说2米,到底是多少呢,我们得拿一把尺子来测量一下,作者提出的方法就相当于“尺子”,解决一种度量问题,类似于秦始皇统一度量衡,以后大家都要拿这个来衡量,所以这是一种开创性的工作。

        作为一种开创性的工作,数据一般需要自己获取,所以第一部分作者也是花了大量的精力标记数据,提出用人的反应时间作为标记;随后作者证明了图像的搜索难度与人的反应时间是成正比的,即用时间标记是合理的;然后作者从人的认知角度分析了影响图像搜索难易的七大因素,并分别分析各自的影响程度;之后作者提出自己的模型CNN + v-SVR和图像难度分数的计算方法:对该图片的所有标记时间的几何中值(时间连乘的平方根)作为难度分数;与baselines比较,获得比较好的效果;最后作者提出了本模型在弱监督目标定位和半监督目标分类领域能也能起到作用,并给出实验证明。详细说明请参看另一篇博客:http://blog.csdn.net/helloeveryon/article/details/51893251

        模型很简单,但是这种思路值得学习。由于博客上不能上传PPT,所以我就直接贴图了。需要PPT的同学们可以留言。


  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 1
    评论
With the rapid development of China's economy, the per capita share of cars has rapidly increased, bringing great convenience to people's lives. However, with it came a huge number of traffic accidents. A statistical data from Europe shows that if a warning can be issued to drivers 0.5 seconds before an accident occurs, 70% of traffic accidents can be avoided. Therefore, it is particularly important to promptly remind drivers of potential dangers to prevent traffic accidents from occurring. The purpose of this question is to construct a machine vision based driving assistance system based on machine vision, providing driving assistance for drivers during daytime driving. The main function of the system is to achieve visual recognition of pedestrians and traffic signs, estimate the distance from the vehicle in front, and issue a warning to the driver when needed. This driving assistance system can effectively reduce the probability of traffic accidents and ensure the safety of drivers' lives and property. The main research content of this article includes the following aspects: 1. Implement object detection based on the YOLOv5 model. Conduct research on convolutional neural networks and YOLOv5 algorithm, and develop an object detection algorithm based on YOLO5. Detect the algorithm through road images, and analyze the target detection algorithm based on the data returned after training. 2. Estimate the distance from the front vehicle based on a monocular camera. Study the principle of estimating distance with a monocular camera, combined with parameters fed back by object detection algorithms, to achieve distance estimation for vehicles ahead. Finally, the distance estimation function was tested and the error in the system's distance estimation was analyzed. 3. Design and implementation of a driving assistance system. Based on the results of two parts: target detection and distance estimation, an intelligent driving assistance system is constructed. The system is tested through actual road images, and the operational effectiveness of the intelligent driving assistance system is analyzed. Finally, the driving assistance system is analyzed and summarized.
06-03

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值