Image Annotation Result 图像标注结果

Some experiment results of image annotation according to the papers.

Result

      Corel-5KESP GameIAPRTC-12
MethodYearPRF1N+PRF1N+PRF1N+
MBRM2004242524.5122181918.5209242323.5223
JEC2008273229.3139222523.4224282928.5250
Group Sparsity2010303331.4146322930.4252
CNN-R20153241.336.116644.528.534.7248493138.0272
FastTag2013324336.7166462229.8247472633.5280
TagProp(σML)2009334237.0160392731.9239463539.8266
2PKNN2012394039.5177512331.7245493238.7274
GLKNN2015364740.8184413638.3282343132.4255
SVM-DMBRM2014364841.1197552534.4259562938.2283
SKL-CRM2014394642.2184412631.8248473238.1274
KCCA-2PKNN2014424643.9179593039.8259
KCCA2015395344.9184303632.7252383938.5273
2PKNN+ML2012444645.0191532735.8252543743.9278
NMF-KNN2014385645.3150332629.1238
CCA-KNN2015425246.5201463640.4260453841.2278
context-RM-B2015612434.4242612030.1234
SLED2015355141.549.8247.3648.6
NSIDML201644.1251.7647.7619449.829.537.0525356.936.545.21282

Dataset

  • Corel 5K
    Paper: P. Duygulu, K. Barnard, J. F. de Freitas, and D. A. Forsyth. Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In ECCV, 2002.

  • ESP Game
    Paper: L. Von Ahn and L. Dabbish. Labeling images with a computer game. In SIGCHI Conference on Human Factors in Computing Systems, 2004.
    http://hunch.net/~learning/ESP-ImageSet.tar.gz

  • IAPRTC-12
    http://www.imageclef.org/photodata

Large Scale Dataset

  • NUS-WIDE
    Paper: Chua T S, Tang J, Hong R, et al. NUS-WIDE: a real-world web image database from National University of Singapore[C]//Proceedings of the ACM international conference on image and video retrieval. ACM, 2009: 48.
    http://lms.comp.nus.edu.sg/research/NUS-WIDE.htm
DatasetCorel 5KESP GameIAPR TC-12NUS-WIDE
No. of images50002077019627269648 (209347 annotated)
No. of labels26026829181
Train images45001868917665110K (not fixed)
Test images500208119624K (not fixed)
labels per image3.4, 4, 54.7, 5, 155.7, 5, 232.4, 2
images per label58.6, 22, 1004326.7, 172, 4553347.7, 153, 49995701.3, 1682
No. of labels < mean-freq195 (75.0%)201 (75.0%)217 (74.6%)

(entry format: mean, median, maximum)

Features and annotations from INRIA

http://lear.inrialpes.fr/people/guillaumin/data.php

  • gen_annotation.m
    input: files provided in the website
    output: train_annot.txt, test_annot.txt foreach dataset folder

    datasets = { 'corel5k', 'iaprtc12', 'espgame' };
    sets = { 'test' , 'train' };
    for db=1:length(datasets),
        ds = datasets{db};
        for s=1:length(sets),
            str = sets{s};
            list = textread([ds '/' ds '_' str '_list.txt'],'%s');
            annot = logical(vec_read([ds '/' ds '_' str '_annot.hvecs']));
            fid = fopen([ds '/' str '_list.txt'], 'w');
            for i=1:length(list),
                annotation = annot(i,:);
                fprintf(fid, '%d', annot(1));
                for j=2:length(annotation),
                    fprintf(fid, '\t%d', annot(j));
                end
                fprintf(fid, '\n');
            end
            fclose(fid);
        end
    end

Reference

MethodYearConferenceReference Paper
MBRM2004CVPRMultiple bernoulli relevance models for image and video annotation
JEC2008ECCVA new baseline for image annotation
TagProp(σML)2009ICCVTagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation
Group Sparsity2010CVPRAutomatic image annotation using group sparsity
2PKNN2012ECCVImage annotation using metric learning in semantic neighbourhoods
2PKNN+ML2012ECCVImage annotation using metric learning in semantic neighbourhoods
FastTag2013ICMLFast image tagging
NMF-KNN2014CVPRNMF-KNN: image annotation using weighted multi-view non-negative matrix factorization
SVM-DMBRM2014ICMRA Hybrid Model for Automatic Image Annotation
KCCA-2PKNN2014ICMRA cross-media model for automatic image annotation
SKL-CRM2014MIRA sparse kernel relevance model for automatic image annotation
context-RM-B2015CVPRFeature-Independent Context Estimation for Automatic Image Annotation
CNN-R2015ICMRAutomatic Image Annotation using Deep Learning Representations
KCCA2015ICMRAutomatic Image Annotation using Deep Learning Representations
CCA-KNN2015ICMRAutomatic Image Annotation using Deep Learning Representations
GLKNN2015ICMRGraph Learning on K Nearest Neighbours for Automatic Image Annotation
SLED2015J. TIPSLED: Semantic Label Embedding Dictionary Representation for Multilabel Image Annotation
NSIDML2016J. VCIRImage distance metric learning based on neighborhood sets for automatic image annotation

Large Scale Dataset

YearConferenceReference Paper
2014ICLRDeep Convolutional Ranking for Multilabel Image Annotation
2015ICCVLove Thy Neighbors Image Annotation by Exploiting Image Metadata
2015ICMRLarge Scale Image Annotation via Deep Representation Learning and Tag Embedding Learning
MethodLink
TagProp(σML)http://lear.inrialpes.fr/people/guillaumin/code.php#tagprop
Group Sparsityhttp://ranger.uta.edu/~huang/codes/annotation_corel.zip
2PKNN(+ML)http://cvit.iiit.ac.in/projects/imageAnnotation/
FastTaghttp://www.cse.wustl.edu/~mchen/
NMF-KNNhttp://crcv.ucf.edu/people/phd_students/mahdi/
SKL-CRMhttps://github.com/sjmoran/sklcrm
  • 2
    点赞
  • 6
    收藏
    觉得还不错? 一键收藏
  • 7
    评论
深圳电信培训中心的徐海蛟博士教学用的真正的Corel-5K图像数据集。5000张图片,标注词,训练数据,测试数据。特征提取说明。吐血奉献! 这是真正的Corel-5K图像集,共包含科雷尔(Corel)公司收集整理的5000幅图片,故名:Corel-5K,童鞋们可用于科学图像实验:分类、检索等。Corel-5k数据集是图像实验的事实标准数据集。 Corel图像库是科雷尔(Corel)公司收集整理的较为丰富的图像库涵盖多个主题。Corel图像库由若干个CD组成,每个CD包含100张大小相等的图像,可以转换成多种格式。每张CD代表一个语义主题,例如有公共汽车、恐龙、海滩等。 Corel-5k自从被提出用于图像标注实验后,已经成为图像实验的标准数据集,被广泛应用于标注算法性能的比较。Corel5k由50张CD组成,包含50个语义主题。 Corel-5k图像库通常被分成三个部分: 4000张图像作为训练集,500张图像作为验证集用来估计模型参数,其余500张作为测试集评价算法性能。使用验证集寻找到最优模型参数后4000张训练集和500张验证集混合起来组成新的训练集。 该图像库中的每张图片被标注1~5个标注词,训练集中总共有374个标注词,在测试集中总共使用了263个标注词。 童鞋们自己去提取相关低层视觉特征:Rgb Lab Hsv Sift Gist HOG等等。 童鞋们完成 svm knn adaboost 逻辑回归 随机森林 mimlsvm mimlknn mimlboost 自定义算法 等等多类与多标签实验吧。Go, ...

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论 7
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值