Image Annotation Result 图像标注结果

Some experiment results of image annotation according to the papers.

Result

      Corel-5KESP GameIAPRTC-12
MethodYearPRF1N+PRF1N+PRF1N+
MBRM2004242524.5122181918.5209242323.5223
JEC2008273229.3139222523.4224282928.5250
Group Sparsity2010303331.4146322930.4252
CNN-R20153241.336.116644.528.534.7248493138.0272
FastTag2013324336.7166462229.8247472633.5280
TagProp(σML)2009334237.0160392731.9239463539.8266
2PKNN2012394039.5177512331.7245493238.7274
GLKNN2015364740.8184413638.3282343132.4255
SVM-DMBRM2014364841.1197552534.4259562938.2283
SKL-CRM2014394642.2184412631.8248473238.1274
KCCA-2PKNN2014424643.9179593039.8259
KCCA2015395344.9184303632.7252383938.5273
2PKNN+ML2012444645.0191532735.8252543743.9278
NMF-KNN2014385645.3150332629.1238
CCA-KNN2015425246.5201463640.4260453841.2278
context-RM-B2015612434.4242612030.1234
SLED2015355141.549.8247.3648.6
NSIDML201644.1251.7647.7619449.829.537.0525356.936.545.21282

Dataset

  • Corel 5K
    Paper: P. Duygulu, K. Barnard, J. F. de Freitas, and D. A. Forsyth. Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In ECCV, 2002.

  • ESP Game
    Paper: L. Von Ahn and L. Dabbish. Labeling images with a computer game. In SIGCHI Conference on Human Factors in Computing Systems, 2004.
    http://hunch.net/~learning/ESP-ImageSet.tar.gz

  • IAPRTC-12
    http://www.imageclef.org/photodata

Large Scale Dataset

  • NUS-WIDE
    Paper: Chua T S, Tang J, Hong R, et al. NUS-WIDE: a real-world web image database from National University of Singapore[C]//Proceedings of the ACM international conference on image and video retrieval. ACM, 2009: 48.
    http://lms.comp.nus.edu.sg/research/NUS-WIDE.htm
DatasetCorel 5KESP GameIAPR TC-12NUS-WIDE
No. of images50002077019627269648 (209347 annotated)
No. of labels26026829181
Train images45001868917665110K (not fixed)
Test images500208119624K (not fixed)
labels per image3.4, 4, 54.7, 5, 155.7, 5, 232.4, 2
images per label58.6, 22, 1004326.7, 172, 4553347.7, 153, 49995701.3, 1682
No. of labels < mean-freq195 (75.0%)201 (75.0%)217 (74.6%)

(entry format: mean, median, maximum)

Features and annotations from INRIA

http://lear.inrialpes.fr/people/guillaumin/data.php

  • gen_annotation.m
    input: files provided in the website
    output: train_annot.txt, test_annot.txt foreach dataset folder

    datasets = { 'corel5k', 'iaprtc12', 'espgame' };
    sets = { 'test' , 'train' };
    for db=1:length(datasets),
        ds = datasets{db};
        for s=1:length(sets),
            str = sets{s};
            list = textread([ds '/' ds '_' str '_list.txt'],'%s');
            annot = logical(vec_read([ds '/' ds '_' str '_annot.hvecs']));
            fid = fopen([ds '/' str '_list.txt'], 'w');
            for i=1:length(list),
                annotation = annot(i,:);
                fprintf(fid, '%d', annot(1));
                for j=2:length(annotation),
                    fprintf(fid, '\t%d', annot(j));
                end
                fprintf(fid, '\n');
            end
            fclose(fid);
        end
    end

Reference

MethodYearConferenceReference Paper
MBRM2004CVPRMultiple bernoulli relevance models for image and video annotation
JEC2008ECCVA new baseline for image annotation
TagProp(σML)2009ICCVTagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation
Group Sparsity2010CVPRAutomatic image annotation using group sparsity
2PKNN2012ECCVImage annotation using metric learning in semantic neighbourhoods
2PKNN+ML2012ECCVImage annotation using metric learning in semantic neighbourhoods
FastTag2013ICMLFast image tagging
NMF-KNN2014CVPRNMF-KNN: image annotation using weighted multi-view non-negative matrix factorization
SVM-DMBRM2014ICMRA Hybrid Model for Automatic Image Annotation
KCCA-2PKNN2014ICMRA cross-media model for automatic image annotation
SKL-CRM2014MIRA sparse kernel relevance model for automatic image annotation
context-RM-B2015CVPRFeature-Independent Context Estimation for Automatic Image Annotation
CNN-R2015ICMRAutomatic Image Annotation using Deep Learning Representations
KCCA2015ICMRAutomatic Image Annotation using Deep Learning Representations
CCA-KNN2015ICMRAutomatic Image Annotation using Deep Learning Representations
GLKNN2015ICMRGraph Learning on K Nearest Neighbours for Automatic Image Annotation
SLED2015J. TIPSLED: Semantic Label Embedding Dictionary Representation for Multilabel Image Annotation
NSIDML2016J. VCIRImage distance metric learning based on neighborhood sets for automatic image annotation

Large Scale Dataset

YearConferenceReference Paper
2014ICLRDeep Convolutional Ranking for Multilabel Image Annotation
2015ICCVLove Thy Neighbors Image Annotation by Exploiting Image Metadata
2015ICMRLarge Scale Image Annotation via Deep Representation Learning and Tag Embedding Learning
MethodLink
TagProp(σML)http://lear.inrialpes.fr/people/guillaumin/code.php#tagprop
Group Sparsityhttp://ranger.uta.edu/~huang/codes/annotation_corel.zip
2PKNN(+ML)http://cvit.iiit.ac.in/projects/imageAnnotation/
FastTaghttp://www.cse.wustl.edu/~mchen/
NMF-KNNhttp://crcv.ucf.edu/people/phd_students/mahdi/
SKL-CRMhttps://github.com/sjmoran/sklcrm
评论 7
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值