deepfashion2

最新推荐文章于 2024-08-22 08:29:44 发布

沙雅云

最新推荐文章于 2024-08-22 08:29:44 发布

阅读量3.7k

点赞数 1

分类专栏：目标检测姿态估计

本文链接：https://blog.csdn.net/yychentracy/article/details/96480925

版权

目标检测同时被 2 个专栏收录

48 篇文章 5 订阅

订阅专栏

姿态估计

4 篇文章 0 订阅

订阅专栏

main idea
这篇文章提出了一个新的数据集，是在原有的数据集上进行扩充的，包含491k的images，每张图片都包含丰富的语义标注，包括 style,scale,occlusion,zooming,viewpoint,bounding box,dense landmarks and pose.pixel-level masks.pair of image.
这篇章提出了一个benchmark，包含多个任务的，必去fashion understanding，including clothing detection，landmark，pose estimation。clothes segmentation，consumer-to-shop，verifacation and retrieval。
abstract

deepfashion的缺陷 1 每张图片只有一个衣服2 稀疏的landmark 3，没有逐像素的标记
deepfashion2的四个任务 cloth detection、pose estimation、 segmentation、 retrieval
rich annotation 这个数据集比AIfashion还要大，有801k items，style，scale，viewpoint，occlushion，bounding box，dence landmarks，mask。
strong baseline match R-CNN，是在mask R-CNN的基础上建立的网络，使用一种端到端的方法解决了以上的四个任务introduction
the chanllenge of image understanding：large deformation、occlusion、discrepancies of cloth
three main contribution
1 建立一个large scale fashion benchmark with comprehensive task and annocation
2 a full spectrum of task is carefully define on the dataset
3 extensively evalute mask r-cnn，aggregate all the lead feature from clothes category，pose，and mask；
related work
clothes dataset：WIBI，DARN，CCP，deepfashion，包含801k的instance of landmark，mask，bounding box，491k的images，873pair
fashion image understanding
需要一个统一的基准和框架来考虑这些任务，deepfashion2和match R-CNN考虑服装的variations，scale，occlusion，zoom-in，viewpoint来instance-level retrieval将人体姿态估计运用到服装估计
deep fashion2 dataset and benchmark
four unique characteristic compared to exiting fashion dataset
1 large sample
2 versality (fashion clothes understanding)
3 expresivety (多个items展示再一张图上，13种defination of landmarks and pose for
13 category
4 diverty control their variation in term of four properties including scale，occlusion，zoom in
data collection and cleaning
1 deep fashion主要来自deepfashion1和一些shopping websites或者crawl a large set of image on Internet
2 variation：包括scale，occlusion、zoom-in，（一些item outside the image）就是啊，百分之七的没有人，78%正面穿，其他的从侧面或者背后穿。
3 data labeling
4 category and bounding box
5 mask generate mask from contours and human annotators
6style
benchmark
包括四个
clothes detection
landmark estimation
segmentation
commercial -consumer clothes retrieval
match r-cnn
如下如所示
通过使用不同的流，并且在这些流上叠加一个Siamese 模块来聚合所学习的特征。

实验结果
clothes detection

landmark estimate

clothes segmentation