自定义博客皮肤VIP专享

*博客头图:

格式为PNG、JPG,宽度*高度大于1920*100像素,不超过2MB,主视觉建议放在右侧,请参照线上博客头图

请上传大于1920*100像素的图片!

博客底图:

图片格式为PNG、JPG,不超过1MB,可上下左右平铺至整个背景

栏目图:

图片格式为PNG、JPG,图片宽度*高度为300*38像素,不超过0.5MB

主标题颜色:

RGB颜色,例如:#AFAFAF

Hover:

RGB颜色,例如:#AFAFAF

副标题颜色:

RGB颜色,例如:#AFAFAF

自定义博客皮肤

-+

AI小作坊 的博客

大道至简,天人合一

  • 博客(39)
  • 资源 (91)
  • 收藏
  • 关注

原创 卫星图像中的车辆分析--A Large Contextual Dataset for Classification, Detection and Counting of Cars

A Large Contextual Dataset for Classification, Detection and Counting of Cars with Deep Learning ECCV2016 https://gdo-datasci.ucllnl.org/cowc/本文针对卫星图像中的车辆分析建立了一个新的数据库:Cars Overhead with Context (COWC

2017-09-30 15:03:43 1813 1

原创 人车密度估计--Towards perspective-free object counting with deep learning

Towards perspective-free object counting with deep learning ECCV2016 https://github.com/gramuah/ccnn本文针对人车密度估计问题,主要做了两个工作:1)提出了一个 novel convolutional neural network:Counting CNN (CCNN),将图像块回归到密度图,2)

2017-09-30 10:26:47 3125

原创 快速去阴影--Fast Shadow Detection from a Single Image Using a Patched Convolutional Neural Network

Fast Shadow Detection from a Single Image Using a Patched Convolutional Neural Network https://arxiv.org/abs/1709.09283本文主要解决快速去阴影问题,这里使用的策略是 SVM+CNNA. Computing Shadow Prior 首先使用 mean shift 算法对输入图像进

2017-09-30 08:56:09 1783 1

原创 快速人群密度估计--Multi-scale Convolutional Neural Networks for Crowd Counting

Multi-scale Convolutional Neural Networks for Crowd Countinghttps://arxiv.org/abs/1702.02359对于人群密度估计问题,由于图像中 scale variations problem,所以提出使用多个CNN来解决 Multi-column/network。使用多个CNN网络导致 网络的参数数量增加,计算量增加,不利于

2017-09-29 16:15:19 3789 1

原创 人群计数--Mixture of Counting CNNs

Mixture of Counting CNNs: Adaptive Integration of CNNs Specialized to Specific Appearance for Crowd Counting https://arxiv.org/abs/1703.09393本文是人群计数的,不是人群密度估计。所以网络结构比较比较简单。这里主要的思路是针对不同场景的 scale and c

2017-09-29 14:15:54 2730

原创 人群密度估计--Fully Convolutional Crowd Counting On Highly Congested Scenes

Fully Convolutional Crowd Counting On Highly Congested Scenes The 12th International Conference on Computer Vision Theory and Applications (VISAPP) VISAPP 2017本文使用 FCN 来做人群密度估计,主要参考 Single-image cro

2017-09-29 09:25:26 2317

原创 人群密度估计--CrowdNet: A Deep Convolutional Network for Dense Crowd Counting

CrowdNet: A Deep Convolutional Network for Dense Crowd Counting published in the proceedings of ACM Conference on Multimedia (ACMMM) - 2016 http://val.serc.iisc.ernet.in/CrowdNet/ Caffe: https://g

2017-09-28 14:26:20 5028 2

原创 人群密度估计--Learning to Count with CNN Boosting

Learning to Count with CNN Boosting ECCV2016本文使用CNN来进行人群密度估计,主要有两个改进地方:layered boosting and selective samplingBoosting deep networks : Boosting 在组合学习中是一种知名的贪婪技术。基本的思想就是对前一个分类器的误差训练一个新的分类器来矫正。广义上,当使用多个

2017-09-28 10:02:32 3006

原创 人群分析--ResnetCrowd: A Residual Deep Learning Architecture

ResnetCrowd: A Residual Deep Learning Architecture for Crowd Counting, Violent Behaviour Detection and Crowd Density Level Classification (AVSS 2017) 2017 14th IEEE International Conference on Advance

2017-09-27 16:25:39 1800

原创 人群密度估计--CNN-based Cascaded Multi-task Learning of High-level Prior and Density Estimation for Crowd

CNN-based Cascaded Multi-task Learning of High-level Prior and Density Estimation for Crowd Counting International Conference on Advanced Video and Signal Based Surveillance (AVSS) 2017 Torch: http

2017-09-27 11:10:32 2815 1

原创 视频物体分割--One-Shot Video Object Segmentation

One-Shot Video Object Segmentation CVPR2017 http://www.vision.ee.ethz.ch/~cvlsegmentation/osvos/One-Shot Video Object Segmentation,基于单帧标记的视频物体分割,对于一个视频中的某一个物体,我们只提供一张训练样本,怎么把视频里所有的该物体分割出来? 上图第一张图像

2017-09-26 16:17:28 9616

原创 运动相机检测无人机-- Detecting Flying Objects using a Single Moving Camera

Detecting Flying Objects using a Single Moving Camera PAMI 2017 http://cvlab.epfl.ch/research/unmanned/detection https://drive.switch.ch/index.php/s/3b3bdbd6f8fb61e05d8b0560667ea992Flying Objects De

2017-09-21 09:40:54 1827 1

原创 视频动作识别--Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition ECCV2016 https://github.com/yjxiong/temporal-segment-networks本文侧重于从更长的视频中提取 long-range temporal structure,因为某些动作的过程较长,

2017-09-20 15:45:33 1749

原创 嵌入式目标检测--Fast YOLO: A Fast You Only Look Once System for Real-time Embedded Object Detection

Fast YOLO: A Fast You Only Look Once System for Real-time Embedded Object Detection in Video针对在嵌入式设备使用CNN进行目标检测,本文对 YOLOv2进行改进,在稍微降低精度的情况下,减少模型的参数量,提高运算速度。在视频处理中,相对 YOLOv2 平均加速 ∼3.3X, run an average o

2017-09-20 14:06:22 7176 4

原创 无人驾驶中的目标检测--MODNet: Moving Object Detection Network for Autonomous Driving

MODNet: Moving Object Detection Network with Motion and Appearance for Autonomous Driving这里讲视频动作识别中的 two stream networks 框架应用于无人驾驶中的目标检测,视频分析中的 motion and appearance cues 本文的 contributions 主要以下三点: 1

2017-09-20 09:28:20 6172

原创 快速小目标检测--Feature-Fused SSD: Fast Detection for Small Objects

Feature-Fused SSD: Fast Detection for Small Objects本文针对小目标检测问题,对 SSD 模型进行了一个小的改进,将 contextual information 引入到 SSD 中 帮助SSD检测小目标。 contextual information 对于小目标的检测 重要性是不言而喻的。小目标在图像中 limited resolution and

2017-09-19 16:22:07 16594 27

原创 人群密度估计--Generating High-Quality Crowd Density Maps using Contextual Pyramid CNNs

Generating High-Quality Crowd Density Maps using Contextual Pyramid CNNs ICCV2017针对人群密度估计问题,本文主要从 incorporating global and local contextual information 来降低人群密度估计误差 使用多个CNN网络来估计不同尺度的 context 来帮助人群密度估

2017-09-19 13:51:18 4683 2

原创 人群密度估计--Spatiotemporal Modeling for Crowd Counting in Videos

Spatiotemporal Modeling for Crowd Counting in Videos ICCV2017针对视频人群密度估计问题,这里主要侧重视频中的 temporal information,使用 convolutionalLSTM(ConvLSTM) 的一个变体 a bidirectional ConvLSTM model 来提取当前帧的前后帧信息提升人群密度估计当前基于回

2017-09-19 09:23:09 2887 2

原创 人群行为分类数据库--Novel Dataset for Fine-grained Abnormal Behavior Understanding in Crowd

Novel Dataset for Fine-grained Abnormal Behavior Understanding in Crowd 数据库:https://github.com/hosseinm/med本文针对人群行为分类建立了一个数据库,这里有5类:Panic,Fight,Congestion,Obstacle ,Neutral目前已有的数据库情况: 2 Proposed Data

2017-09-18 13:59:57 1752

原创 人群行为分类数据库--Crowd-11: A Dataset for Fine Grained Crowd Behaviour Analysis

Crowd-11: A Dataset for Fine Grained Crowd Behaviour Analysis CVPRW2017这个数据库目前貌似没有公开,以后应该公开吧。针对人群行为分析方面的研究,本文主要的工作有以下三点: 1)针对人群行为细分,我们建立了一个较大的数据库 Crowd-11, 11 crowd motion patterns and it is compose

2017-09-18 11:15:12 3765 1

原创 人群分割--Fully Convolutional Neural Networks for Crowd Segmentation

Fully Convolutional Neural Networks for Crowd Segmentation https://arxiv.org/abs/1411.4464这里设计了一个全卷积网络用于视频中的人群分割,主要考虑三个信息:Apperance、 Motion 、Structure,思路还是很原始的。 主要的难度在于 静态的人群我们也想分割出来,再就是当人群的纹理和背景相似的时

2017-09-18 10:08:54 2186

原创 视频动作识别--Convolutional Two-Stream Network Fusion for Video Action Recognition

Convolutional Two-Stream Network Fusion for Video Action Recognition CVPR2016http://www.robots.ox.ac.uk/~vgg/software/two_stream_action/ https://github.com/feichtenhofer/twostreamfusion对视频动作识别 采用 two

2017-09-15 16:27:30 4684 5

原创 视频动作识别--Towards Good Practices for Very Deep Two-Stream ConvNets

Towards Good Practices for Very Deep Two-stream ConvNets http://yjxiong.me/others/action_recog/ https://github.com/yjxiong/caffe/tree/action_recog本文首先指出对于静态图像分类,CNN已经取得很大进步,但是对于视频动作分类,CNN网络表现的不是很好。这里

2017-09-15 15:48:26 1650

原创 视频动作识别--Two-Stream Convolutional Networks for Action Recognition in Videos

Two-Stream Convolutional Networks for Action Recognition in Videos NIPS2014 http://www.robots.ox.ac.uk/~vgg/software/two_stream_action/本文针对视频中的动作分类问题,这里使用 两个独立的CNN来分开处理 视频中的空间信息和时间信息 spatial 和 tempal

2017-09-15 14:11:58 7181

原创 时空特征--Learning Spatiotemporal Features with 3D Convolutional Networks

Learning Spatiotemporal Features with 3D Convolutional Networks ICCV 2015 http://vlg.cs.dartmouth.edu/c3d/ https://github.com/facebook/C3D本文使用 3D CNN 来分析视频序列,学习到的时空特征称之为 C3D,主要寻找3D CNN 中的最优3D滤波器结构

2017-09-15 10:05:23 6600

原创 人群场景分析--Slicing Convolutional Neural Network for Crowd Video Understanding

Slicing Convolutional Neural Network for Crowd Video Understanding CVPR2016 http://www.ee.cuhk.edu.hk/~jshao/SCNN.html Caffe code: https://github.com/amandajshao/Slicing-CNN本文也是使用CNN网络对一段视频进行分析,网络输出

2017-09-14 16:36:11 2047

原创 人群行为分析--Understanding Pedestrian Behaviors from Stationary Crowd Groups

Understanding Pedestrian Behaviors from Stationary Crowd Groups CVPR2015本文主要探讨 静态人群对行人行为的影响 人群行为的建模以前主要考虑的因素有:scene layout (e.g. entrances, exits, walls, and obstacles), pedestrian beliefs (the choic

2017-09-14 11:16:59 2312 2

原创 人群场景的属性--Deeply Learned Attributes for Crowded Scene Understandin

Deeply Learned Attributes for Crowded Scene Understanding CVPR2015 http://www.ee.cuhk.edu.hk/~jshao/WWWCrowdDataset.html https://github.com/amandajshao/www_deep_crowd本文要解决的问题是什么了? 给你一段人群场景的视频,算法能否给出

2017-09-13 15:21:20 995

原创 人群运动--Scene-Independent Group Profiling in Crowd

Scene-Independent Group Profiling in Crowd CVPR2014 http://www.ee.cuhk.edu.hk/~jshao/CUHKcrowd.html https://github.com/amiltonwong/crowd_group_profilecrowd 由 groups 组成,这里我们对 groups 属性进行分析,提出几个可以定量分析的

2017-09-13 13:58:30 1218

原创 群体运动度量--Measuring Crowd Collectiveness

Measuring Crowd Collectiveness CVPR2013 http://mmlab.ie.cuhk.edu.hk/projects/collectiveness/ https://github.com/metalbubble/collectiveness本文针对人群运动的藐视 Collective motions 定义了一个描述子 collectiveness descr

2017-09-13 09:36:41 1852

原创 人群计数--Cross-scene Crowd Counting via Deep Convolutional Neural Networks

Cross-scene Crowd Counting via Deep Convolutional Neural Networks CVPR2015本文主要使用深度学习来完成跨场景人群计数 cross-scene crowd counting,简单的说就是在多个场景训练,在没有训练过的场景测试。 我们建立了一个新的人群计数方面的数据库 http://www.ee.cuhk.edu.hk/~xg

2017-09-12 14:21:20 6438 7

原创 视频中的运动特征--Learning Motion Patterns in Videos

Learning Motion Patterns in Videos CVPR2017 Torch code: http://thoth.inrialpes.fr/research/mpnet 本文要解决的问题是 determining whether an object is in motion, irrespective of camera motion, 注意这里的相机是可以运动

2017-09-11 15:36:10 2841

原创 弱监督语义分割--Object Region Mining with Adversarial Erasing

Object Region Mining with Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach CVPR2017怎么将只有图像标签的训练图像用于语义分割的训练,这里我们提出使用一个分类网络来讲训练图像中的物体进行分割,得到像素标记的训练图像,同时提出一个 online prohibi

2017-09-08 15:17:33 7941 5

原创 特征匹配--GMS: Grid-based Motion Statistics for Fast, Ultra-robust Feature Correspondence

GMS: Grid-based Motion Statistics for Fast, Ultra-robust Feature Correspondence CVPR2017 c++ code: https://github.com/JiawangBian/GMS-Feature-Matcher主要本要针对特征匹配问题,提出了一个简单的基于统计的解决方法,可以快速区分出正确的匹配和错误的匹

2017-09-08 10:59:33 12649 2

原创 网络模型--Squeeze-and-Excitation Networks

Squeeze-and-Excitation Networks https://arxiv.org/abs/1709.01507ILSVRC 2017 image classification winner https://github.com/hujie-frank/SENet本文主要提出了一个新的网络模块 Squeeze-and-Excitation block,作用就是对不同 channe

2017-09-07 15:41:58 4963

原创 卫星图像分割--Effective Use of Dilated Convolutions for Segmenting Small Object Instances

Effective Use of Dilated Convolutions for Segmenting Small Object Instances in Remote Sensing Imagery https://arxiv.org/abs/1709.00179针对卫星图像中的小目标分割问题,本文从 dilated convolution 的有效使用给出了解决方法,主要是 先 increa

2017-09-07 11:12:57 3966 5

原创 人群计数--Switching Convolutional Neural Network for Crowd Counting

Switching Convolutional Neural Network for Crowd Counting CVPR2017 Code for SCNN is based on Lasagne\Theano :https://github.com/val-iisc/crowd-counting-scnn 针对人群密度估计问题提出了一个 Switch-CNN网络,大的思路...

2017-09-06 11:40:54 8799 1

原创 人群计数--Single-Image Crowd Counting via Multi-Column Convolutional Neural Network

Single-Image Crowd Counting via Multi-Column Convolutional Neural Network CVPR2016 https://github.com/svishwa/crowdcount-mcnn https://github.com/leeyeehoo/Reduplication-of-Single-Image-Crowd-Countin

2017-09-05 15:57:35 4462 3

原创 目标检测-- DeNet: Scalable Real-time Object Detection with Directed Sparse Sampling

DeNet: Scalable Real-time Object Detection with Directed Sparse Sampling ICCV2017 An easily extendedTheanobased code: https://github.com/lachlants/denet本文针对候选区域提取这个步骤进行加速: 使用角点提取来初步过滤大部分候选区域我们提出的

2017-09-05 10:42:33 5331 2

Accuracy of Laplacian Edge Detectors

The sources of error for the edge finding technique proposed by Marr and Hildreth (D. Marr and T. Poggio, Proc. R. Soc. London Ser. B204, 1979, 301–328; D. Marr and E. Hildreth, Proc. R. Soc. London Ser. B.207, 1980, 187–217) are identified, and the magnitudes of the errors are estimated, based on idealized models of the most common error producing situations. Errors are shown to be small for linear illuminations, as well as for nonlinear illuminations with a second derivative less than a critical value. Nonlinear illuminations are shown to lead to spurious contours under some conditions, and some fast techniques for discarding such contours are suggested.

2011-10-12

The Canny Edge Detector Revisited

Canny (1986) suggested that an optimal edge detector should maximize both signal-to-noise ratio and localization, and he derived mathematical expressions for these criteria. Based on these criteria, he claimed that the optimal step edge detector was similar to a derivative of a gaussian. However, Canny’s work suffers from two problems. First, his derivation of localization criterion is incorrect. Here we provide a more acurate localization criterion and derive the optimal detector from it. Second, and more seriously, the Canny criteria yield an infinitely wide optimal edge detector. The width of the optimal detector can however be limited by considering the effect of the neighbouring edges in the image. If we do so, we find that the optimal step edge detector, according to the Canny criteria, is the derivative of an ISEF filter, proposed by Shen and Castan (1992). In addition, if we also consider detecting blurred (or non-sharp) gaussian edges of different widths, we find that the optimal blurred-edge detector is the above optimal step edge detector convolved with a gaussian. This implies that edge detection must be performed at multiple scales to cover all the blur widths in the image. We derive a simple scale selection procedure for edge detection, and demonstrate it in one and two dimensions.

2011-08-11

OpenCV 2 Computer Vision Application Programming Cookbook

Overview of OpenCV 2 Computer Vision Application Programming Cookbook Teaches you how to program computer vision applications in C++ using the different features of the OpenCV library Demonstrates the important structures and functions of OpenCV in detail with complete working examples Describes fundamental concepts in computer vision and image processing Gives you advice and tips to create more effective object-oriented computer vision programs Contains examples with source code and shows results obtained on real images with detailed explanations and the required screenshots

2011-06-24

Learning based Symmetric Features Selection for Vehicle Detection

Learning based Symmetric Features Selection for Vehicle Detection This paper describes a symmetric features selection strategy based on statistical learning method for detecting vehicles with a single moving camera for autonomous driving. Symmetry is a good class of feature for vehicle detection, but the areas with high symmetry and threshold for segmentation is hard to be decided. Usually, the additional supposition is added artificially, and this will decrease the robustness of algorithms. In this paper, we focus on the problem of symmetric features selection using learning method for autonomous driving environment. Global symmetry and local symmetry are defined and used to construct a cascaded structure with a one-class classifier followed by a two-class classifier.

2011-04-11

Intensity and Edge-Based Symmetry Detection Applied to Car-Following

Intensity and Edge-Based Symmetry Detection Applied to Car-Following We present two methods for detecting symmetry in images, one based directly on the intensity values and another one based on a discrete representation of local orientation. A symmetry finder has been developed which uses the intensity-based method to search an image for compact regions which display some degree of mirror symmetry due to intensity similarities across a straight axis. In a different approach, we look at symmetry as a bilateral relationship between local orientations. A symmetryenhancing edge detector is presented which indicates edges dependent on the orientations at two different image positions. SEED, as we call it, is a detector element implemented by a feedforward network that holds the symmetry conditions. We use SEED to find the contours of symmetric objects of which we know the axis of symmetry from the intensity-based symmetry finder. The methods presented have been applied to the problem of visually guided car-following. Real-time experiments with a system for automatic headway control on motorways have been successful.

2011-04-11

Accurate Robust Symmetry Estimation

Accurate Robust Symmetry Estimation Stephen Smith and Mark Jenkinson There are various applications, both in medical and non-medical image analysis, which require the automatic detection of the line (2D images) or plane (3D) of reflective symmetry of objects. There exist relatively simple methods of finding reflective symmetry when object images are complete (i.e., completely symmetric and perfectly segmented from image “background”). A much harder problem is finding the line or plane of symmetry when the object of interest contains asymmetries, and may not have well defined edges.

2011-04-11

Approach of vehicle segmentation based on texture character

Approach of vehicle segmentation based on texture character

2011-04-01

Method of removing moving shadow based on texture

Method of removing moving shadow based on texture

2011-04-01

Environmentally Robust Motion Detection for Video Surveillance

Most video surveillance systems require to manually set a motion detection sensitivity level to generate motion alarms. The performance of motion detection algorithms, embedded in closed circuit television (CCTV) camera and digital video recorder (DVR), usually depends upon the preselected motion sensitivity level, which is expected to work in all environmental conditions. Due to the preselected sensitivity level, false alarms and detection failures usually exist in video surveillance systems. The proposed motion detection model based upon variational energy provides a robust detection method at various illumination changes and noise levels of image sequences without tuning any parameter manually. We analyze the structure mathematically and demonstrate the effectiveness of the proposed model with numerous experiments in various environmental conditions. Due to the compact structure and efficiency of the proposed model, it could be implemented in a small embedded system.

2011-03-17

Optimal multi-level thresholding using a two-stage Otsu optimization approach

Otsu’s method of image segmentation selects an optimum threshold by maximizing the between-class variance in a gray image. However, this method becomes very time-consuming when extended to a multi-level threshold problem due to the fact that a large number of iterations are required for computing the cumulative probability and the mean of a class. To greatly improve the efficiency of Otsu’s method, a new fast algorithm called the TSMO method (Two-Stage Multithreshold Otsu method) is presented. The TSMO method outperforms Otsu’s method by greatly reducing the iterations required for computing the between-class variance in an image. The experimental results show that the computational time increases exponentially for the conventional Otsu method with an average ratio of about 76. For TSMO-32, the maximum computational time is only 0.463 s when the class number M increases from two to six with relative errors of less than 1% when compared to Otsu’s method. The ratio of computational time of Otsu’s method to TSMO-32 is rather high, up to 109,708, when six classes (M = 6) in an image are used. This result indicates that the proposed method is far more efficient with an accuracy equivalent to Otsu’s method. It also has the advantage of having a small variance in runtimes for different test images.

2011-03-17

A Background Reconstruction Method Based on Double-background

In this paper, we show a new method to reconstruct and update the background. This approach is based on double-background. We use the statistical information of the pixel intensity to construct a background that represents the status during a long time, and construct another background with feedback information in motion detection that represents the recent changes at a short time. This couple of background images is fused to construct and update the background image used to motion detection. The background reconstruction algorithm can perform well on the tests that we have applied it to.

2011-03-17

Statistical Change Detection by the Pool Adjacent Violators Algorithm

In this paper we present a statistical change detection approach aimed at being robust with respect to the main disturbance factors acting in real-world applications, such as illumination changes, camera gain and exposure variations, noise. We rely on modeling the effects of disturbance factors on images as locally order-preserving transformations of pixel intensities plus additive noise. This allows us to identify within the space of all the possible image change patterns the subspace corresponding to disturbance factors effects. Hence, scene changes can be detected by a-contrario testing the hypothesis that the measured pattern is due to disturbance factors, that is by computing a distance between the pattern and the subspace. By assuming additive gaussian noise, the distance can be computed within a maximum likelihood non-parametric isotonic regression framework. In particular, the projection of the pattern onto the subspace is computed by an O(N) iterative procedure known as Pool Adjacent Violators algorithm.

2011-03-17

Cooperative Fusion of Stereo and Motion

Cooperative Fusion of Stereo and Motion This paper presents a new matching algorithm based on cooperative fusion of stereo and motion cues. In this algorithm, stereo disparity and image flow values are recovered from two successive pairs of stereo images by solving the stereo and motion corresponde

2011-03-09

A Treatise on Mathematical Theory of Elasticity (1944)(ISBN 0486601749)

Love, A Treatise on Mathematical Theory of Elasticity (1944)(ISBN 0486601749).djvu 第三部分(共三部分)

2011-02-27

A Treatise on Mathematical Theory of Elasticity (1944)(ISBN 0486601749)

Love, A Treatise on Mathematical Theory of Elasticity (1944)(ISBN 0486601749).djvu 第二部分(共三部分)

2011-02-27

Love, A Treatise on Mathematical Theory of Elasticity (1944)(ISBN 0486601749)

Love, A Treatise on Mathematical Theory of Elasticity (1944)(ISBN 0486601749) 第一部分(共三部分)

2011-02-27

Computation of Real-Time Optical Flow Based on Corner Features

This paper describes an approach to real-time optical flow computation that combines the corner features and pyramid Lucas-Kanade. Corners instead of all the points in the image are taken into optical flow computation, which could reduce the amount of calculation to a large extend. The experiment has shown that using this optical flow algorithm to track targets is effective and could meet the requirements of real-time applications.

2011-02-24

II-LK – A Real-Time Implementation for Sparse Optical Flow

In this paper we present an approach to speed up the computation of sparse optical flow fields by means of integral images and provide implementation details. Proposing a modification of the Lucas-Kanade energy functional allows us to use integral images and thus to speed up the method notably while affecting only slightly the quality of the computed optical flow. The approach is combined with an efficient scanline algorithm to reduce the computation of integral images to those areas where there are features to be tracked. The proposed method can speed up current surveillance algorithms used for scene description and crowd analysis.

2011-02-24

Medical Image Reconstruction A Conceptual Tutorial --pdf

Medical Image Reconstruction: A Conceptual Tutorial" introduces the classical and modern image reconstruction technologies, such as two-dimensional (2D) parallel-beam and fan-beam imaging, three-dimensional (3D) parallel ray, parallel plane, and cone-beam imaging. This book presents both analytical and iterative methods of these technologies and their applications in X-ray CT (computed tomography), SPECT (single photon emission computed tomography), PET (positron emission tomography), and MRI (magnetic resonance imaging). Contemporary research results in exact region-of-interest (ROI) reconstruction with truncated projections, Katsevich's cone-beam filtered backprojection algorithm, and reconstruction with highly undersampled data with l0-minimization are also included.

2011-02-24

Extraction and recognition of license plates of motorcycles and vehicles on highways

Extraction and recognition of license plates of motorcycles and vehicles on highways

2011-02-22

High Performance Implementation of License Plate Recognition in Image Sequences

High Performance Implementation of License Plate Recognition in Image Sequences

2011-02-22

Vs-star-- A visual interpretation system for visual surveillance

Vs-star-- A visual interpretation system for visual surveillance

2011-02-22

Robust fragments-based tracking with adaptive feature selection

Robust fragments-based tracking with adaptive feature selection

2011-02-22

Robust and automated unimodal histogram thresholding and potential applications

Robust and automated unimodal histogram thresholding and potential applications

2011-02-22

角点检测方法研究-- 毛雁明, 兰美辉

角点检测方法研究---根据实现方法不同可将角点检测方法分为两大类:基于边缘的角点检测方法与基于灰度变化的角点检测方法,并对现有的角点检测方法作了较为详细的分析与比较,指出角点检测技术的研究与发展方向.

2011-02-22

图像融合中角点检测技术研究

图像融合中角点检测技术研究--图像融合中角点检测技术研究

2011-02-22

Fast image region growing

Fast image region growing---Fast image region growing

2011-02-22

Simple Low Level Features for Image Analysis

Simple Low Level Features for Image Analysis

2011-02-22

Direct methods for sparse matrices

second edition 2017, Oxford University Press

2024-04-07

百面机器学习.pdf

收录了超过100道机器学习算法工程师的面试题目和解答,本书将从特征工程、模型评估、降维等经典机器学习领域出发,构建一个算法工程师必-备的知识体系。其中大部分源于Hulu算法研究岗位的真实场景。

2019-06-01

CLIP-Q CVPR2018 code

CLIP-Q: Deep Network Compression Learning by In-Parallel Pruning-Quantization,CVPR2018 code

2018-10-30

Vehicle model recognition from frontal view image measurements

This paper deals with a novel vehicle manufacturer and model recognition scheme, which is enhanced by color recognition for more robust results. A probabilistic neural network is assessed as a classifier and it is demonstrated that relatively simple image processing measurements can be used to obtain high performance vehicle authentication. The proposed system is assisted by a previously developed license plate recognition, a symmetry axis detector and an image phase congruency calculation modules. The reported results indicate a high recognition rate and a fast processing time, making the system suitable for real-time applications.

2011-10-15

Vehicle Detection and Tracking in Car Video Based on Motion Model

Vehicle Detection and Tracking in Car Video Based on Motion Model--This work aims at real-time in-car video analysis to detect and track vehicles ahead for safety, auto-driving, and target tracing. This paper describes a comprehensive approach to localize target vehicles in video under various environmental conditions. The extracted geometry features from the video are projected onto a 1D profile continuously and are tracked constantly. We rely on temporal information of features and their motion behaviors for vehicle identification, which compensates for the complexity in recognizing vehicle shapes, colors, and types. We model the motion in the field of view probabilistically according to the scene characteristic and vehicle motion model. The Hidden Markov Model is used for separating target vehicles from background, and tracking them probabilistically. We have investigated videos of day and night on different types of roads, showing that our approach is robust and effective in dealing with changes in environment and illumination, and that real time processing becomes possible for vehicle borne cameras.

2011-10-15

Projection and Least Square Fitting

Projection and Least Square Fitting with Perpendicular Offsets based Vehicle License Plate Tilt Correction

2011-10-15

An Algorithm for License Plate Recognition Applied to ITS

An algorithm for license plate recognition (LPR) applied to the intelligent transportation system is proposed on the basis of a novel shadow removal technique and character recognition algorithms. This paper has two major contributions. One contribution is a new binary method, i.e., the shadow re- moval method, which is based on the improved Bernsen algorithm combined with the Gaussian filter. Our second contribution is a character recognition algorithm known as support vector machine (SVM) integration. In SVM integration, character features are extracted from the elastic mesh, and the entire address character string is taken as the object of study, as opposed to a single character. This paper also presents improved techniques for im- age tilt correction and image gray enhancement. Our algorithm is robust to the variance of illumination, view angle, position, size, and color of the license plates when working in a complex environment. The algorithm was tested with 9026 images, such as natural-scene vehicle images using different backgrounds and ambient illumination particularly for low-resolution images. The license plates were properly located and segmented as 97.16%and 98.34%, respectively. The optical character recognition system is the SVM integration with different character features, whose performance for numerals, Kana, and address recognition reached 99.5%, 98.6%, and 97.8%, respectively. Combining the preceding tests, the overall performance of success for the license plate achieves 93.54% when the system is used for LPR in various complex conditions

2011-10-15

A Review of Computer Vision Techniques for the Analysis of Urban Traffic

Automatic video analysis from urban surveillance cameras is a fast-emerging field based on computer vision techniques. We present here a comprehensive review of the state-of-the-art computer vision for traffic video with a critical analysis and an outlook to future research directions. This field is of increasing relevance for intelligent transport systems (ITSs). The decreasing hardware cost and, therefore, the increasing de- ployment of cameras have opened a wide application field for video analytics. Several monitoring objectives such as congestion, traffic rule violation, and vehicle interaction can be targeted using cameras that were typically originally installed for human oper- ators. Systems for the detection and classification of vehicles on highways have successfully been using classical visual surveillance techniques such as background estimation and motion tracking for some time. The urban domain is more challenging with respect to traffic density, lower camera angles that lead to a high degree of occlusion, and the variety of road users. Methods from object categorization and 3-D modeling have inspired more advanced techniques to tackle these challenges. There is no commonly used data set or benchmark challenge, which makes the direct com- parison of the proposed algorithms difficult. In addition, evalu- ation under challenging weather conditions (e.g., rain, fog, and darkness) would be desirable but is rarely performed. Future work should be directed toward robust combined detectors and classifiers for all road users, with a focus on realistic conditions during evaluation.

2011-10-15

On Improving the Efficiency of Tensor Voting

This paper proposes two alternative formulations to reduce the high computational complexity of tensor voting, a robust perceptual grouping technique used to extract salient information from noisy data. The first scheme consists of numerical approximations of the votes, which have been derived from an in-depth analysis of the plate and ball voting processes. The second scheme simplifies the formulation while keeping the same perceptual meaning of the original tensor voting: The stick tensor voting and the stick component of the plate tensor voting must reinforce surfaceness, the plate components of both the plate and ball tensor voting must boost curveness, whereas junctionness must be strengthened by the ball component of the ball tensor voting. Two new parameters have been proposed for the second formulation in order to control the potentially conflictive influence of the stick component of the plate vote and the ball component of the ball vote. Results show that the proposed formulations can be used in applications where efficiency is an issue since they have a complexity of order O(1). Moreover, the second proposed formulation has been shown to be more appropriate than the original tensor voting for estimating saliencies by appropriately setting the two new parameters.

2011-10-11

Selecting Critical Patterns Based on Local Geometrical

Pattern selection methods have been traditionally developed with a dependency on a specific classifier. In contrast, this paper presents a method that selects critical patterns deemed to carry essential information applicable to train those types of classifiers which require spatial information of the training data set. Critical patterns include those edge patterns that define the boundary and those border patterns that separate classes. The proposed method selects patterns from a new perspective, primarily based on their location in input space. It determines class edge patterns with the assistance of the approximated tangent hyperplane of a class surface. It also identifies border patterns between classes using local probability. The proposed method is evaluated on benchmark problems using popular classifiers, including multilayer perceptrons, radial basis functions, support vector machines, and nearest neighbors. The proposed approach is also compared with four state-of-the-art approaches and it is shown to provide similar but more consistent accuracy from a reduced data set. Experimental results demonstrate that it selects patterns sufficient to represent class boundary and to preserve the decision surface.

2011-10-11

Fast LOG Filtering Using Recursive Filters

Marr and Hildreth's theory of LoG filtering with multiple scales has been extensively elaborated. One problem with LoG filtering is that it is very time-consuming, especially with a large size of filters. This paper presents a recursive convolution scheme for LoG filtering and a fast algorithm to extract zero-crossings. It has a constant computational complexity per pixel and is independent of the size of the filter. A line buffer is used to determine the locations of zero-crossings along with filtering hence avoiding the need for an additional convolution and extra memory units. Various images have been tested

2011-10-11

A discrete expression of Canny's criteria for step

Optimal filters for edge detection are usually developed in the continuous domain and then transposed by sampling to the discrete domain. Simpler filters are directly defined in the discrete domain. We define criteria to compare filter performances in the discrete domain. Canny has defined (1983, 1986) three criteria to derive the equation of an optimal filter for step edge detection: good detection, good localization, and low-responses multiplicity. These criteria seem to be good candidates for filter comparison. Unfortunately, they have been developed in the continuous domain, and their analytical expressions cannot be used in the discrete domain. We establish three criteria with the same meaning as Canny's.

2011-10-11

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人

提示
确定要删除当前文章?
取消 删除