图像分类，物体检测，语义分割，实例分割的联系和区别

最新推荐文章于 2024-08-03 20:59:04 发布

鸟恋旧林XD

最新推荐文章于 2024-08-03 20:59:04 发布

阅读量1.5w

点赞数 17

分类专栏： Image Segmentation 文章标签：图像分类物体检测语义分割实例分割

本文链接：https://blog.csdn.net/niaolianjiulin/article/details/52948274

版权

Image Segmentation 专栏收录该内容

13 篇文章 0 订阅

订阅专栏

从10月中旬开始，科研转为“Object Segment”，即物体分割。这属于图像理解范畴。图像理解包含众多，如图像分类、物体检测、物体分割、实例分割等若干具体问题。每个问题研究的范畴是什么？或者说每个问题中，对于某幅图像的处理结果是什么？整理如下。

Image Classification

The task of object classification requires binary labels indicating whether objects are present in an image.[1] 图像分类，该任务需要我们对出现在某幅图像中的物体做标注。比如一共有1000个物体类，对一幅图中所有物体来说，某个物体要么有，要么没有。可实现：输入一幅测试图片，输出该图片中物体类别的候选集。

Object detection

Detecting an object entails both stating that an object belonging to a specified class is present, and localizing it in the image. The location of an object is typically represented by a bounding box. 物体检测，包含两个问题，一是判断属于某个特定类的物体是否出现在图中；二是对该物体定位，定位常用表征就是物体的边界框。可实现：输入测试图片，输出检测到的物体类别和位置。

Semantic scene labeling

The task of labeling semantic objects in a scene requires that each pixel of an image be labeled as belonging to a category, such as sky, chair, floor, street, etc. In contrast to the detection task, individual instances of objects do not need to be segmented. 语义标注/分割：该任务需要将图中每一点像素标注为某个物体类别。同一物体的不同实例不需要单独分割出来。对下图，标注为人，羊，狗，草地。而不需要羊1，羊2，羊3，羊4，羊5.