The three R’s of computer vision: Recognition, reconstruction andreorganization阅读

The three R’s of computer vision: Recognition, reconstruction andreorganization阅读

CV的三个R:识别 重建 重组

识别:在图片中找到物品

重建:从图建构三维

重组:

instead of the classical separation of vision into low level,mid level and high level vision, it is more fruitful to think of vision as resulting from the interaction of three processes: recognition, reconstruction and reorganization which operate in tandem,and where each provides input to the others and fruitfully exploits their output.

相比于传统的将视觉分割为low,mid high三个层次,更为有效的方式是将vision理解为三个步骤的交互:识别、重建、重组。这三个流程为串联关系且相互提供输入相互利用输出。

Note that the emphasis of this paper is on the relationship between the 3R’s of vision, which is somewhat independent of the(very important) choice of features needed to implement particular algorithms.

本文的重点在于3R之间的关系,这种关系某种层面上独立于实施特定算法需要的对特征的选取(?。

Reorganization →recognition 重组帮助识别

目标检测:传统方法为滑窗法,要求全部object有相同的长宽比(可用混合模型解决);另一种方法如R-CNN,先得到一系列可能的图像区域,再对区域进行过滤找到目标

R-CNNs scale very well with the number of

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 1
    评论
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值