Visual Geometry Group 资源(vgg)

1.Phog 代码:http://www.robots.ox.ac.uk/~vgg/research/caltech/phog.html


其它链接:http://www.robots.ox.ac.uk/~vgg/software/


Homogeneous kernel map for fast non-linear classification with additive kernels

The homogeneous kernel map allows large scale training of non-linear SVMs based on additive kernels such as Chi2, intersection, and Jensen-Shannon. The map transforms the data into a compact linear representation which reproduces the desired kernel to a very good level of approximation. This representation enables then to use very fast linear SVM solvers.

User interface for interactive image segmentation

matlab implementation of a user interface for interactive segmentation. Implements the star-convexity algorithms described in Gulshan et al. in CVPR2010 and other commonly used interactive segmentation methods.

FASTANN and FASTCLUSTER for approximate k-means (AKM)

A distributed implementation of the approximate k-means (AKM) algorithm presented in Philbin et al. at CVPR 2007. The software consists of two libraries: i) FASTANN, a library for fast, approximate nearest neighbours. ii) FASTCLUSTER, an MPI-distributed library for doing exact and approximate k-means.

Descriptor Learning Using Convex Optimisation

Code and learnt models for feature descriptor computation and evaluation, as described in Simonyan et al., ECCV 2012.

2D articulated human pose estimation

The software for articulated human pose estimation in still images is designed to operate in uncontrolled images with difficult illumination conditions and cluttered backgrounds. The only assumption the algorithm makes is that people are upright (i.e. their head is above their torso) and they are seen approximately from a frontal viewpoint.

Hand detection using multiple proposals

The code for hand detection in static images implementing the method described in Mittal et al. at BMVC 2011.

Efficient structured output SVM ranking

The reference implementation of the structured output ranking algorithm with linear time constraint generation method proposed in Mittal et al. at ECCV 2012.

Encoding Methods Evaluation Toolkit

MATLAB code for evaluation of different bag of visual words encoding schemes over standard image classification test datasets.

Multiple Kernel Learning for Image Classification

The VGG MKL classifier is an implementation of multiple kernel learning for image classification. It bundles powerful image descriptors (spatial pyramids of geometric blur, fast dense SIFT, and self-similarity features) and achieves state-of-the-art performance on Caltech-101. In MATLAB/C, compatible with Linux 32/64, Windows 32/64, and Mac OS X.

Upper-body Detector

The upper-body detector software pages provide download links for software designed to detect the region between the top of the head and the upper half of the torso. Example results (images and video) and performance evaluations are included.

Self-Similarity Descriptor

Implementation of the Self-Similarity Descriptor by Varun Gulshan, based on the paper Matching Local Self-Similarities across Images and Videos, by Eli Shechtman and Michal Irani at CVPR '07.

Multi-frame Image Super-resolution

The Super-resolution code page provides a basic suite of Matlab/C-Mex functions for computing ML and MAP super-resolution image estimates, including documentation and a demo m-file.

Pyramid Histogram of Oriented Gradients

Code for computing the Pyramid Histogram of Oriented Gradients (PHOG) descriptor over a Region Of Interest (ROI) is provided on the phog page, which is part of theImage Classification section of the research pages.

Affine Covariant Features

The following items link into the Affine Covariant Features section of the VGG'sresearch pages.

 

Affine Normalized Regions

input
dino
output
dino
  • Linux binary to compute affine normalized regions around interest points.
  • With thanks to Frederik Schaffalitzky, VGG, Univ of Oxford.

 

Probabilistic Latent Semantic Analysis (pLSA)


  • 1
    点赞
  • 3
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
# 工程内容 这个程序是基于tensorflow的tflearn库实现部分RCNN功能。 # 开发环境 windows10 + python3.5 + tensorflow1.2 + tflearn + cv2 + scikit-learn # 数据集 采用17flowers据集, 官网下载:http://www.robots.ox.ac.uk/~vgg/data/flowers/17/ # 程序说明 1、setup.py---初始化路径 2、config.py---配置 3、tools.py---进度条和显示带框图像工具 4、train_alexnet.py---大数据集预训练Alexnet网络,140个epoch左右,bitch_size为64 5、preprocessing_RCNN.py---图像的处理(选择性搜索、数据存取等) 6、selectivesearch.py---选择性搜索源码 7、fine_tune_RCNN.py---小数据集微调Alexnet 8、RCNN_output.py---训练SVM并测试RCNN(测试的时候测试图片选择第7、16类中没有参与训练的,单朵的花效果好,因为训练用的都是单朵的) # 文件说明 1、train_list.txt---预训练数据,数据在17flowers文件夹中 2、fine_tune_list.txt---微调数据2flowers文件夹中 3、1.png---直接用选择性搜索的区域划分 4、2.png---通过RCNN后的区域划分 # 程序问题 1、由于数据集小的原因,在微调时候并没有像论文一样按一个bitch32个正样本,128个负样本输入,感觉正样本过少; 2、还没有懂最后是怎么给区域打分的,所有非极大值抑制集合canny算子没有进行,待续; 3、对选择的区域是直接进行缩放的; 4、由于数据集合论文采用不一样,但是微调和训练SVM时采用的IOU阈值一样,有待调参。

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值