matlab intergral,matlab學習:人臉識別之HOG(Histograms of Oriented Gradients)

HOG descriptors 是應用在計算機視覺和圖像處理領域,用於目標檢測的特征描述器。這項技術是用來計算局部圖像梯度的方向信息的統計值。這種方法跟邊緣方向直方圖(edge orientation histograms)、尺度不變特征變換(scale-invariant feature transform descriptors) 以及形狀上下文方法( shape contexts)有很多相似之處,但與它們的不同點是:HOG描述器是在一個網格密集的大小統一的細胞單元(dense grid of uniformly spaced cells)上計算,而且為了提高性能,還采用了重疊的局部對比度歸一化(overlapping local contrast normalization)技術。

這篇文章的作者Navneet Dalal和Bill Triggs是法國國家計算機技術和控制研究所French National Institute for Research in Computer Science and Control (INRIA)的研究員。他們在這篇文章中首次提出了HOG方法。這篇文章被發表在2005年的CVPR上。他們主要是將這種方法應用在靜態圖像中的行人 檢測上,但在后來,他們也將其應用在電影和視頻中的行人檢測,以及靜態圖像中的車輛和常見動物的檢測。

HOG描述器最重要的思想是:在一副 圖像中,局部目標的表象和形狀(appearance and shape)能夠被梯度或邊緣的方向密度分布很好地描述。具體的實現方法是:首先將圖像分成小的連通區域,我們把它叫細胞單元。然后采集細胞單元中各像素 點的梯度的或邊緣的方向直方圖。最后把這些直方圖組合起來就可以構成特征描述器。為了提高性能,我們還可以把這些局部直方圖在圖像的更大的范圍內(我們把 它叫區間或block)進行對比度歸一化(contrast-normalized),所采用的方法是:先計算各直方圖在這個區間(block)中的密 度,然后根據這個密度對區間中的各個細胞單元做歸一化。通過這個歸一化后,能對光照變化和陰影獲得更好的效果。

與其他的特征描述方法相 比,HOG描述器后很多優點。首先,由於HOG方法是在圖像的局部細胞單元上操作,所以它對圖像幾何的(geometric)和光學的 (photometric)形變都能保持很好的不變性,這兩種形變只會出現在更大的空間領域上。其次,作者通過實驗發現,在粗的空域抽樣(coarse spatial sampling)、精細的方向抽樣(fine orientation sampling)以及較強的局部光學歸一化(strong local photometric normalization)等條件下,只要行人大體上能夠保持直立的姿勢,就容許行人有一些細微的肢體動作,這些細微的動作可以被忽略而不影響檢測效 果。綜上所述,HOG方法是特別適合於做圖像中的行人檢測的。

上圖是作者做的行人檢測試驗,其中(a)表示所有訓練圖像集 的平均梯度(average gradient across their training images);(b)和(c)分別表示:圖像中每一個區間(block)上的最大最大正、負SVM權值;(d)表示一副測試圖像;(e)計算完R- HOG后的測試圖像;(f)和(g)分別表示被正、負SVM權值加權后的R-HOG圖像。

算法的實現:

色彩和伽馬歸一化 (color and gamma normalization)

作者分別在灰度空間、RGB色彩空間和LAB色彩空間上對圖像進行色彩和 伽馬歸一化,但實驗結果顯示,這個歸一化的預處理工作對最后的結果沒有影響,原因可能是:在后續步驟中也有歸一化的過程,那些過程可以取代這個預處理的歸 一化。所以,在實際應用中,這一步可以省略。

梯度的計算(Gradient computation)

最常用的方法是:簡單 地使用一個一維的離散微分模板(1-D centered point discrete derivative mask)在一個方向上或者同時在水平和垂直兩個方向上對圖像進行處理,更確切地說,這個方法需要使用下面的濾波器核濾除圖像中的色彩或變化劇烈的數據 (color or intensity data)

作者也嘗試了其他一些更復雜的模板,如3×3 Sobel 模板,或對角線模板(diagonal masks),但是在這個行人檢測的實驗中,這些復雜模板的表現都較差,所以作者的結論是:模板越簡單,效果反而越好。作者也嘗試了在使用微分模板前加入 一個高斯平滑濾波,但是這個高斯平滑濾波的加入使得檢測效果更差,原因是:許多有用的圖像信息是來自變化劇烈的邊緣,而在計算梯度之前加入高斯濾波會把這 些邊緣濾除掉。

構建方向的直方圖(creating the orientation histograms)

第三步就是為 圖像的每個細胞單元構建梯度方向直方圖。細胞單元中的每一個像素點都為某個基於方向的直方圖通道(orientation-based histogram channel)投票。投票是采取加權投票(weighted voting)的方式,即每一票都是帶權值的,這個權值是根據該像素點的梯度幅度計算出來。可以采用幅值本身或者它的函數來表示這個權值,實際測試表明: 使用幅值來表示權值能獲得最佳的效果,當然,也可以選擇幅值的函數來表示,比如幅值的平方根(square root)、幅值的平方(square of the gradient magnitude)、幅值的截斷形式(clipped version of the magnitude)等。細胞單元可以是矩形的(rectangular),也可以是星形的(radial)。直方圖通道是平均分布在0-1800(無 向)或0-3600(有向)范圍內。作者發現,采用無向的梯度和9個直方圖通道,能在行人檢測試驗中取得最佳的效果。

把細胞單元組 合成大的區間(grouping the cells together into larger blocks)

由於局部光照的變化 (variations of illumination)以及前景-背景對比度(foreground-background contrast)的變化,使得梯度強度(gradient strengths)的變化范圍非常大。這就需要對梯度強度做歸一化,作者采取的辦法是:把各個細胞單元組合成大的、空間上連通的區間(blocks)。 這樣以來,HOG描述器就變成了由各區間所有細胞單元的直方圖成分所組成的一個向量。這些區間是互有重疊的,這就意味着:每一個細胞單元的輸出都多次作用 於最終的描述器。區間有兩個主要的幾何形狀——矩形區間(R-HOG)和環形區間(C-HOG)。R-HOG區間大體上是一些方形的格子,它可以有三個參 數來表征:每個區間中細胞單元的數目、每個細胞單元中像素點的數目、每個細胞的直方圖通道數目。作者通過實驗表明,行人檢測的最佳參數設置是:3×3細胞 /區間、6×6像素/細胞、9個直方圖通道。作者還發現,在對直方圖做處理之前,給每個區間(block)加一個高斯空域窗口(Gaussian spatial window)是非常必要的,因為這樣可以降低邊緣的周圍像素點(pixels around the edge)的權重。

R- HOG跟SIFT描述器看起來很相似,但他們的不同之處是:R-HOG是在單一尺度下、密集的網格內、沒有對方向排序的情況下被計算出來(are computed in dense grids at some single scale without orientation alignment);而SIFT描述器是在多尺度下、稀疏的圖像關鍵點上、對方向排序的情況下被計算出來(are computed at sparse scale-invariant key image points and are rotated to align orientation)。補充一點,R-HOG是各區間被組合起來用於對空域信息進行編碼(are used in conjunction to encode spatial form information),而SIFT的各描述器是單獨使用的(are used singly)。

C- HOG區間(blocks)有兩種不同的形式,它們的區別在於:一個的中心細胞是完整的,一個的中心細胞是被分割的。如右圖所示:

作者發現 C-HOG的這兩種形式都能取得相同的效果。C-HOG區間(blocks)可以用四個參數來表征:角度盒子的個數(number of angular bins)、半徑盒子個數(number of radial bins)、中心盒子的半徑(radius of the center bin)、半徑的伸展因子(expansion factor for the radius)。通過實驗,對於行人檢測,最佳的參數設置為:4個角度盒子、2個半徑盒子、中心盒子半徑為4個像素、伸展因子為2。前面提到過,對於R- HOG,中間加一個高斯空域窗口是非常有必要的,但對於C-HOG,這顯得沒有必要。C-HOG看起來很像基於形狀上下文(Shape Contexts)的方法,但不同之處是:C-HOG的區間中包含的細胞單元有多個方向通道(orientation channels),而基於形狀上下文的方法僅僅只用到了一個單一的邊緣存在數(edge presence count)。

區間歸一化 (Block normalization)

作者采用了四中不同的方法對區間進行歸一化,並對結果進行了比較。引入v表示一個還沒有被歸一 化的向量,它包含了給定區間(block)的所有直方圖信息。| | vk | |表示v的k階范數,這里的k去1、2。用e表示一個很小的常數。這時,歸一化因子可以表示如下:

L2-norm:

L1-norm:

L1-sqrt:

還 有第四種歸一化方式:L2-Hys,它可以通過先進行L2-norm,對結果進行截短(clipping),然后再重新歸一化得到。作者發現:采用L2- Hys L2-norm 和 L1-sqrt方式所取得的效果是一樣的,L1-norm稍微表現出一點點不可靠性。但是對於沒有被歸一化的數據來說,這四種方法都表現出來顯着的改進。

SVM 分類器(SVM classifier)

最后一步就是把提取的HOG特征輸入到SVM分類器中,尋找一個最優超平面作為決策函數。作者采用 的方法是:使用免費的SVMLight軟件包加上HOG分類器來尋找測試圖像中的行人。

I=imread('05598.jpg');

c=imresize(I,[64 64],'bilinear');

f=hogcalculator(c,8,8,2,2,9,0.5,'localinterpolate','unsigned','l2hys');I=imread('05598.jpg');

c=imresize(I,[64 64],'bilinear');

f=hogcalculator(c,8,8,2,2,9,0.5,'localinterpolate','unsigned','l2hys');

函數舉例,將讀取圖像變成64*64,然后block=(8*2)*(8*2)=16*16 ,cell=8*8  bins=9

function F = hogcalculator(img, cellpw, cellph, nblockw, nblockh,...

nthet, overlap, isglobalinterpolate, issigned, normmethod)

% HOGCALCULATOR calculate R-HOG feature vector of an input image using the

% procedure presented in Dalal and Triggs's paper in CVPR 2005.

%

% Author:   timeHandle

% Time:     March 24, 2010

%           May 12,2010 update.

%

%       this copy of code is written for my personal interest, which is an

%       original and inornate realization of [Dalal CVPR2005]'s algorithm

%       without any optimization. I just want to check whether I understand

%       the algorithm really or not, and also do some practices for knowing

%       matlab programming more well because I could be called as 'novice'.

%       OpenCV 2.0 has realized Dalal's HOG algorithm which runs faster

%       than mine without any doubt, ╮(╯▽╰)╭ . Ronan pointed a error in

%       the code,thanks for his correction. Note that at the end of this

%       code, there are some demonstration code,please remove in your work.

%

% F = hogcalculator(img, cellpw, cellph, nblockw, nblockh,

%    nthet, overlap, isglobalinterpolate, issigned, normmethod)

%

% IMG:

%       IMG is the input image.

%

% CELLPW, CELLPH:

%       CELLPW and CELLPH are cell's pixel width and height respectively.

%

% NBLOCKW, NBLCOKH:

%       NBLOCKW and NBLCOKH are block size counted by cells number in x and

%       y directions respectively.

%

% NTHET, ISSIGNED:

%       NTHET is the number of the bins of the histogram of oriented

%       gradient. The histogram of oriented gradient ranges from 0 to pi in

%       'unsigned' condition while to 2*pi in 'signed' condition, which can

%       be specified through setting the value of the variable ISSIGNED by

%       the string 'unsigned' or 'signed'.

%

% OVERLAP:

%       OVERLAP is the overlap proportion of two neighboring block.

%

% ISGLOBALINTERPOLATE:

%       ISGLOBALINTERPOLATE specifies whether the trilinear interpolation(三線性插值)

%       is done in a single global 3d histogram of the whole detecting

%       window by the string 'globalinterpolate' or in each local 3d

%       histogram corresponding to respective blocks by the string

%       'localinterpolate' which is in strict accordance with the procedure

%       proposed in Dalal's paper. Interpolating in the whole detecting

%       window requires the block's sliding step to be an integral multiple

%       of cell's width and height because the histogram is fixing before

%       interpolate. In fact here the so called 'global interpolation' is

%       a notation given by myself. at first the spatial interpolation is

%       done without any relevant to block's slide position, but when I was

%       doing calculation while OVERLAP is 0.75, something occurred and

%       confused me o__O"… . This let me find that the operation I firstly

%       did is different from which mentioned in Dalal's paper. But this

%       does not mean it is incorrect ^◎^, so I reserve this. As for name,

%       besides 'global interpolate', any others would be all ok, like 'Lady GaGa'

%       or what else, :-).

%

% NORMMETHOD:

%       NORMMETHOD is the block histogram normalized method which can be

%       set as one of the following strings:

%               'none', which means non-normalization;

%               'l1', which means L1-norm normalization;

%               'l2', which means L2-norm normalization;

%               'l1sqrt', which means L1-sqrt-norm normalization;

%               'l2hys', which means L2-hys-norm normalization.

% F:

%       F is a row vector storing the final histogram of all of the blocks

%       one by one in a top-left to bottom-right image scan manner, the

%       cells histogram are stored in the same manner in each block's

%       section of F.

%

% Note that CELLPW*NBLOCKW and CELLPH*NBLOCKH should be equal to IMG's

% width and height respectively.

%

% Here is a demonstration, which all of parameters are set as the

% best value mentioned in Dalal's paper when the window detected is 128*64

% size(128 rows, 64 columns):

%       F = hogcalculator(window, 8, 8, 2, 2, 9, 0.5,

%                               'localinterpolate', 'unsigned', 'l2hys');

% Also the function can be called like:

%       F = hogcalculator(window);

% the other parameters are all set by using the above-mentioned "dalal's

% best value" as default.

%

if nargin 

% set default parameters value.

cellpw = 2;

cellph = 2;

nblockw = 2;

nblockh = 2;

nthet = 9;

overlap = 0.5;

isglobalinterpolate = 'localinterpolate';

issigned = 'unsigned';

normmethod = 'l2hys';

else

if nargin 

error('Input parameters are not enough.');

end

end

% check parameters's validity.

[M, N, K] = size(img);

if mod(M,cellph*nblockh) ~= 0

error('IMG''s height should be an integral multiple of CELLPH*NBLOCKH.');

end

if mod(N,cellpw*nblockw) ~= 0

error('IMG''s width should be an integral multiple of CELLPW*NBLOCKW.');

end

if mod((1-overlap)*cellpw*nblockw, cellpw) ~= 0 ||...

mod((1-overlap)*cellph*nblockh, cellph) ~= 0

str1 = 'Incorrect OVERLAP or ISGLOBALINTERPOLATE parameter';

str2 = ', slide step should be an intergral multiple of cell size';

error([str1, str2]);

end

% set the standard deviation of gaussian spatial weight window.

delta = cellpw*nblockw * 0.5;

% calculate gradient scale matrix.

hx = [-1,0,1];

hy = -hx';

gradscalx = imfilter(double(img),hx);

gradscaly = imfilter(double(img),hy);

if K > 1

gradscalx = max(max(gradscalx(:,:,1),gradscalx(:,:,2)), gradscalx(:,:,3));

gradscaly = max(max(gradscaly(:,:,1),gradscaly(:,:,2)), gradscaly(:,:,3));

end

gradscal = sqrt(double(gradscalx.*gradscalx + gradscaly.*gradscaly));

% calculate gradient orientation matrix.

% plus small number for avoiding dividing zero.

gradscalxplus = gradscalx+ones(size(gradscalx))*0.0001;

gradorient = zeros(M,N);

% unsigned situation: orientation region is 0 to pi.

if strcmp(issigned, 'unsigned') == 1

gradorient =...

atan(gradscaly./gradscalxplus) + pi/2;

or = 1;

else

% signed situation: orientation region is 0 to 2*pi.

if strcmp(issigned, 'signed') == 1

idx = find(gradscalx >= 0 & gradscaly >= 0);

gradorient(idx) = atan(gradscaly(idx)./gradscalxplus(idx));

idx = find(gradscalx 

gradorient(idx) = atan(gradscaly(idx)./gradscalxplus(idx)) + pi;

idx = find(gradscalx >= 0 & gradscaly 

gradorient(idx) = atan(gradscaly(idx)./gradscalxplus(idx)) + 2*pi;

or = 2;

else

error('Incorrect ISSIGNED parameter.');

end

end

% calculate block slide step.

xbstride = cellpw*nblockw*(1-overlap);

ybstride = cellph*nblockh*(1-overlap);

xbstridend = N - cellpw*nblockw + 1;

ybstridend = M - cellph*nblockh + 1;

% calculate the total blocks number in the window detected, which is

% ntotalbh*ntotalbw.

ntotalbh = ((M-cellph*nblockh)/ybstride)+1;

ntotalbw = ((N-cellpw*nblockw)/xbstride)+1;

% generate the matrix hist3dbig for storing the 3-dimensions histogram. the

% matrix covers the whole image in the 'globalinterpolate' condition or

% covers the local block in the 'localinterpolate' condition. The matrix is

% bigger than the area where it covers by adding additional elements

% (corresponding to the cells) to the surround for calculation convenience.

if strcmp(isglobalinterpolate, 'globalinterpolate') == 1

ncellx = N / cellpw;

ncelly = M / cellph;

hist3dbig = zeros(ncelly+2, ncellx+2, nthet+2);

F = zeros(1, (M/cellph-1)*(N/cellpw-1)*nblockw*nblockh*nthet);

glbalinter = 1;

else

if strcmp(isglobalinterpolate, 'localinterpolate') == 1

hist3dbig = zeros(nblockh+2, nblockw+2, nthet+2);

F = zeros(1, ntotalbh*ntotalbw*nblockw*nblockh*nthet);

glbalinter = 0;

else

error('Incorrect ISGLOBALINTERPOLATE parameter.')

end

end

% generate the matrix for storing histogram of one block;

sF = zeros(1, nblockw*nblockh*nthet);

% generate gaussian spatial weight.

[gaussx, gaussy] = meshgrid(0:(cellpw*nblockw-1), 0:(cellph*nblockh-1));

weight = exp(-((gaussx-(cellpw*nblockw-1)/2)...

.*(gaussx-(cellpw*nblockw-1)/2)+(gaussy-(cellph*nblockh-1)/2)...

.*(gaussy-(cellph*nblockh-1)/2))/(delta*delta));

% vote for histogram. there are two situations according to the interpolate

% condition('global' interpolate or local interpolate). The hist3d which is

% generated from the 'bigger' matrix hist3dbig is the final histogram.

if glbalinter == 1

xbstep = nblockw*cellpw;

ybstep = nblockh*cellph;

else

xbstep = xbstride;

ybstep = ybstride;

end

% block slide loop

for btly = 1:ybstep:ybstridend

for btlx = 1:xbstep:xbstridend

for bi = 1:(cellph*nblockh)

for bj = 1:(cellpw*nblockw)

i = btly + bi - 1;

j = btlx + bj - 1;

gaussweight = weight(bi,bj);

gs = gradscal(i,j);

go = gradorient(i,j);

if glbalinter == 1

jorbj = j;

iorbi = i;

else

jorbj = bj;

iorbi = bi;

end

% calculate bin index of hist3dbig

binx1 = floor((jorbj-1+cellpw/2)/cellpw) + 1;

biny1 = floor((iorbi-1+cellph/2)/cellph) + 1;

binz1 = floor((go+(or*pi/nthet)/2)/(or*pi/nthet)) + 1;

if gs == 0

continue;

end

binx2 = binx1 + 1;

biny2 = biny1 + 1;

binz2 = binz1 + 1;

x1 = (binx1-1.5)*cellpw + 0.5;

y1 = (biny1-1.5)*cellph + 0.5;

z1 = (binz1-1.5)*(or*pi/nthet);

% trilinear interpolation.

hist3dbig(biny1,binx1,binz1) =...

hist3dbig(biny1,binx1,binz1) + gs*gaussweight...

* (1-(jorbj-x1)/cellpw)*(1-(iorbi-y1)/cellph)...

*(1-(go-z1)/(or*pi/nthet));

hist3dbig(biny1,binx1,binz2) =...

hist3dbig(biny1,binx1,binz2) + gs*gaussweight...

* (1-(jorbj-x1)/cellpw)*(1-(iorbi-y1)/cellph)...

*((go-z1)/(or*pi/nthet));

hist3dbig(biny2,binx1,binz1) =...

hist3dbig(biny2,binx1,binz1) + gs*gaussweight...

* (1-(jorbj-x1)/cellpw)*((iorbi-y1)/cellph)...

*(1-(go-z1)/(or*pi/nthet));

hist3dbig(biny2,binx1,binz2) =...

hist3dbig(biny2,binx1,binz2) + gs*gaussweight...

* (1-(jorbj-x1)/cellpw)*((iorbi-y1)/cellph)...

*((go-z1)/(or*pi/nthet));

hist3dbig(biny1,binx2,binz1) =...

hist3dbig(biny1,binx2,binz1) + gs*gaussweight...

* ((jorbj-x1)/cellpw)*(1-(iorbi-y1)/cellph)...

*(1-(go-z1)/(or*pi/nthet));

hist3dbig(biny1,binx2,binz2) =...

hist3dbig(biny1,binx2,binz2) + gs*gaussweight...

* ((jorbj-x1)/cellpw)*(1-(iorbi-y1)/cellph)...

*((go-z1)/(or*pi/nthet));

hist3dbig(biny2,binx2,binz1) =...

hist3dbig(biny2,binx2,binz1) + gs*gaussweight...

* ((jorbj-x1)/cellpw)*((iorbi-y1)/cellph)...

*(1-(go-z1)/(or*pi/nthet));

hist3dbig(biny2,binx2,binz2) =...

hist3dbig(biny2,binx2,binz2) + gs*gaussweight...

* ((jorbj-x1)/cellpw)*((iorbi-y1)/cellph)...

*((go-z1)/(or*pi/nthet));

end

end

% In the local interpolate condition. F is generated in this block

% slide loop. hist3dbig should be cleared in each loop.

if glbalinter == 0

if or == 2

hist3dbig(:,:,2) = hist3dbig(:,:,2)...

+ hist3dbig(:,:,nthet+2);

hist3dbig(:,:,(nthet+1)) =...

hist3dbig(:,:,(nthet+1)) + hist3dbig(:,:,1);

end

hist3d = hist3dbig(2:(nblockh+1), 2:(nblockw+1), 2:(nthet+1));

for ibin = 1:nblockh

for jbin = 1:nblockw

idsF = nthet*((ibin-1)*nblockw+jbin-1)+1;

idsF = idsF:(idsF+nthet-1);

sF(idsF) = hist3d(ibin,jbin,:);

end

end

iblock = ((btly-1)/ybstride)*ntotalbw +...

((btlx-1)/xbstride) + 1;

idF = (iblock-1)*nblockw*nblockh*nthet+1;

idF = idF:(idF+nblockw*nblockh*nthet-1);

F(idF) = sF;

hist3dbig(:,:,:) = 0;

end

end

end

% In the global interpolate condition. F is generated here outside the

% block slide loop

if glbalinter == 1

ncellx = N / cellpw;

ncelly = M / cellph;

if or == 2

hist3dbig(:,:,2) = hist3dbig(:,:,2) + hist3dbig(:,:,nthet+2);

hist3dbig(:,:,(nthet+1)) = hist3dbig(:,:,(nthet+1)) + hist3dbig(:,:,1);

end

hist3d = hist3dbig(2:(ncelly+1), 2:(ncellx+1), 2:(nthet+1));

iblock = 1;

for btly = 1:ybstride:ybstridend

for btlx = 1:xbstride:xbstridend

binidx = floor((btlx-1)/cellpw)+1;

binidy = floor((btly-1)/cellph)+1;

idF = (iblock-1)*nblockw*nblockh*nthet+1;

idF = idF:(idF+nblockw*nblockh*nthet-1);

for ibin = 1:nblockh

for jbin = 1:nblockw

idsF = nthet*((ibin-1)*nblockw+jbin-1)+1;

idsF = idsF:(idsF+nthet-1);

sF(idsF) = hist3d(binidy+ibin-1, binidx+jbin-1, :);

end

end

F(idF) = sF;

iblock = iblock + 1;

end

end

end

% adjust the negative value caused by accuracy of floating-point

% operations.these value's scale is very small, usually at E-03 magnitude

% while others will be E+02 or E+03 before normalization.

F(F<0) = 0;

% block normalization.

e = 0.001;

l2hysthreshold = 0.2;

fslidestep = nblockw*nblockh*nthet;

switch normmethod

case 'none'

case 'l1'

for fi = 1:fslidestep:size(F,2)

div = sum(F(fi:(fi+fslidestep-1)));

F(fi:(fi+fslidestep-1)) = F(fi:(fi+fslidestep-1))/(div+e);

end

case 'l1sqrt'

for fi = 1:fslidestep:size(F,2)

div = sum(F(fi:(fi+fslidestep-1)));

F(fi:(fi+fslidestep-1)) = sqrt(F(fi:(fi+fslidestep-1))/(div+e));

end

case 'l2'

for fi = 1:fslidestep:size(F,2)

sF = F(fi:(fi+fslidestep-1)).*F(fi:(fi+fslidestep-1));

div = sqrt(sum(sF)+e*e);

F(fi:(fi+fslidestep-1)) = F(fi:(fi+fslidestep-1))/div;

end

case 'l2hys'

for fi = 1:fslidestep:size(F,2)

sF = F(fi:(fi+fslidestep-1)).*F(fi:(fi+fslidestep-1));

div = sqrt(sum(sF)+e*e);

sF = F(fi:(fi+fslidestep-1))/div;

sF(sF>l2hysthreshold) = l2hysthreshold;

div = sqrt(sum(sF.*sF)+e*e);

F(fi:(fi+fslidestep-1)) = sF/div;

end

otherwise

error('Incorrect NORMMETHOD parameter.');

end

% the following code, which can be removed because of having no

% contributions to HOG feature calculation, are just for result

% demonstration when the trilinear interpolation is 'global' for this

% condition is easier to give a simple and intuitive illustration. so in

% 'local' condition it will produce error.

% figure;

% hold on;

% axis equal;

% xlim([0, N]);

% ylim([0, M]);

% for u = 1:(M/cellph)

%     for v = 1:(N/cellpw)

%         cx = (v-1)*cellpw + cellpw/2 + 0.5;

%         cy = (u-1)*cellph + cellph/2 + 0.5;

%         hist3d(u,v,:)=0.9*min(cellpw,cellph)*hist3d(u,v,:)/max(hist3d(u,v,:));

%         for t = 1:nthet

%             s = hist3d(u,v,t);

%             thet = (t-1)*pi/nthet + pi*0.5/nthet;

%             x1 = cx - s*0.5*cos(thet);

%             x2 = cx + s*0.5*cos(thet);

%             y1 = cy - s*0.5*sin(thet);

%             y2 = cy + s*0.5*sin(thet);

%             plot([x1,x2],[M-y1+1,M-y2+1]);

%         end

%     end

% endfunction F = hogcalculator(img, cellpw, cellph, nblockw, nblockh,...

nthet, overlap, isglobalinterpolate, issigned, normmethod)

% HOGCALCULATOR calculate R-HOG feature vector of an input image using the

% procedure presented in Dalal and Triggs's paper in CVPR 2005.

%

% Author: timeHandle

% Time: March 24, 2010

% May 12,2010 update.

%

% this copy of code is written for my personal interest, which is an

% original and inornate realization of [Dalal CVPR2005]'s algorithm

% without any optimization. I just want to check whether I understand

% the algorithm really or not, and also do some practices for knowing

% matlab programming more well because I could be called as 'novice'.

% OpenCV 2.0 has realized Dalal's HOG algorithm which runs faster

% than mine without any doubt, ╮(╯▽╰)╭ . Ronan pointed a error in

% the code,thanks for his correction. Note that at the end of this

% code, there are some demonstration code,please remove in your work.

%

% F = hogcalculator(img, cellpw, cellph, nblockw, nblockh,

% nthet, overlap, isglobalinterpolate, issigned, normmethod)

%

% IMG:

% IMG is the input image.

%

% CELLPW, CELLPH:

% CELLPW and CELLPH are cell's pixel width and height respectively.

%

% NBLOCKW, NBLCOKH:

% NBLOCKW and NBLCOKH are block size counted by cells number in x and

% y directions respectively.

%

% NTHET, ISSIGNED:

% NTHET is the number of the bins of the histogram of oriented

% gradient. The histogram of oriented gradient ranges from 0 to pi in

% 'unsigned' condition while to 2*pi in 'signed' condition, which can

% be specified through setting the value of the variable ISSIGNED by

% the string 'unsigned' or 'signed'.

%

% OVERLAP:

% OVERLAP is the overlap proportion of two neighboring block.

%

% ISGLOBALINTERPOLATE:

% ISGLOBALINTERPOLATE specifies whether the trilinear interpolation(三線性插值)

% is done in a single global 3d histogram of the whole detecting

% window by the string 'globalinterpolate' or in each local 3d

% histogram corresponding to respective blocks by the string

% 'localinterpolate' which is in strict accordance with the procedure

% proposed in Dalal's paper. Interpolating in the whole detecting

% window requires the block's sliding step to be an integral multiple

% of cell's width and height because the histogram is fixing before

% interpolate. In fact here the so called 'global interpolation' is

% a notation given by myself. at first the spatial interpolation is

% done without any relevant to block's slide position, but when I was

% doing calculation while OVERLAP is 0.75, something occurred and

% confused me o__O"… . This let me find that the operation I firstly

% did is different from which mentioned in Dalal's paper. But this

% does not mean it is incorrect ^◎^, so I reserve this. As for name,

% besides 'global interpolate', any others would be all ok, like 'Lady GaGa'

% or what else, :-).

%

% NORMMETHOD:

% NORMMETHOD is the block histogram normalized method which can be

% set as one of the following strings:

% 'none', which means non-normalization;

% 'l1', which means L1-norm normalization;

% 'l2', which means L2-norm normalization;

% 'l1sqrt', which means L1-sqrt-norm normalization;

% 'l2hys', which means L2-hys-norm normalization.

% F:

% F is a row vector storing the final histogram of all of the blocks

% one by one in a top-left to bottom-right image scan manner, the

% cells histogram are stored in the same manner in each block's

% section of F.

%

% Note that CELLPW*NBLOCKW and CELLPH*NBLOCKH should be equal to IMG's

% width and height respectively.

%

% Here is a demonstration, which all of parameters are set as the

% best value mentioned in Dalal's paper when the window detected is 128*64

% size(128 rows, 64 columns):

% F = hogcalculator(window, 8, 8, 2, 2, 9, 0.5,

% 'localinterpolate', 'unsigned', 'l2hys');

% Also the function can be called like:

% F = hogcalculator(window);

% the other parameters are all set by using the above-mentioned "dalal's

% best value" as default.

%

if nargin < 2

% set default parameters value.

cellpw = 2;

cellph = 2;

nblockw = 2;

nblockh = 2;

nthet = 9;

overlap = 0.5;

isglobalinterpolate = 'localinterpolate';

issigned = 'unsigned';

normmethod = 'l2hys';

else

if nargin < 10

error('Input parameters are not enough.');

end

end

% check parameters's validity.

[M, N, K] = size(img);

if mod(M,cellph*nblockh) ~= 0

error('IMG''s height should be an integral multiple of CELLPH*NBLOCKH.');

end

if mod(N,cellpw*nblockw) ~= 0

error('IMG''s width should be an integral multiple of CELLPW*NBLOCKW.');

end

if mod((1-overlap)*cellpw*nblockw, cellpw) ~= 0 ||...

mod((1-overlap)*cellph*nblockh, cellph) ~= 0

str1 = 'Incorrect OVERLAP or ISGLOBALINTERPOLATE parameter';

str2 = ', slide step should be an intergral multiple of cell size';

error([str1, str2]);

end

% set the standard deviation of gaussian spatial weight window.

delta = cellpw*nblockw * 0.5;

% calculate gradient scale matrix.

hx = [-1,0,1];

hy = -hx';

gradscalx = imfilter(double(img),hx);

gradscaly = imfilter(double(img),hy);

if K > 1

gradscalx = max(max(gradscalx(:,:,1),gradscalx(:,:,2)), gradscalx(:,:,3));

gradscaly = max(max(gradscaly(:,:,1),gradscaly(:,:,2)), gradscaly(:,:,3));

end

gradscal = sqrt(double(gradscalx.*gradscalx + gradscaly.*gradscaly));

% calculate gradient orientation matrix.

% plus small number for avoiding dividing zero.

gradscalxplus = gradscalx+ones(size(gradscalx))*0.0001;

gradorient = zeros(M,N);

% unsigned situation: orientation region is 0 to pi.

if strcmp(issigned, 'unsigned') == 1

gradorient =...

atan(gradscaly./gradscalxplus) + pi/2;

or = 1;

else

% signed situation: orientation region is 0 to 2*pi.

if strcmp(issigned, 'signed') == 1

idx = find(gradscalx >= 0 & gradscaly >= 0);

gradorient(idx) = atan(gradscaly(idx)./gradscalxplus(idx));

idx = find(gradscalx < 0);

gradorient(idx) = atan(gradscaly(idx)./gradscalxplus(idx)) + pi;

idx = find(gradscalx >= 0 & gradscaly < 0);

gradorient(idx) = atan(gradscaly(idx)./gradscalxplus(idx)) + 2*pi;

or = 2;

else

error('Incorrect ISSIGNED parameter.');

end

end

% calculate block slide step.

xbstride = cellpw*nblockw*(1-overlap);

ybstride = cellph*nblockh*(1-overlap);

xbstridend = N - cellpw*nblockw + 1;

ybstridend = M - cellph*nblockh + 1;

% calculate the total blocks number in the window detected, which is

% ntotalbh*ntotalbw.

ntotalbh = ((M-cellph*nblockh)/ybstride)+1;

ntotalbw = ((N-cellpw*nblockw)/xbstride)+1;

% generate the matrix hist3dbig for storing the 3-dimensions histogram. the

% matrix covers the whole image in the 'globalinterpolate' condition or

% covers the local block in the 'localinterpolate' condition. The matrix is

% bigger than the area where it covers by adding additional elements

% (corresponding to the cells) to the surround for calculation convenience.

if strcmp(isglobalinterpolate, 'globalinterpolate') == 1

ncellx = N / cellpw;

ncelly = M / cellph;

hist3dbig = zeros(ncelly+2, ncellx+2, nthet+2);

F = zeros(1, (M/cellph-1)*(N/cellpw-1)*nblockw*nblockh*nthet);

glbalinter = 1;

else

if strcmp(isglobalinterpolate, 'localinterpolate') == 1

hist3dbig = zeros(nblockh+2, nblockw+2, nthet+2);

F = zeros(1, ntotalbh*ntotalbw*nblockw*nblockh*nthet);

glbalinter = 0;

else

error('Incorrect ISGLOBALINTERPOLATE parameter.')

end

end

% generate the matrix for storing histogram of one block;

sF = zeros(1, nblockw*nblockh*nthet);

% generate gaussian spatial weight.

[gaussx, gaussy] = meshgrid(0:(cellpw*nblockw-1), 0:(cellph*nblockh-1));

weight = exp(-((gaussx-(cellpw*nblockw-1)/2)...

.*(gaussx-(cellpw*nblockw-1)/2)+(gaussy-(cellph*nblockh-1)/2)...

.*(gaussy-(cellph*nblockh-1)/2))/(delta*delta));

% vote for histogram. there are two situations according to the interpolate

% condition('global' interpolate or local interpolate). The hist3d which is

% generated from the 'bigger' matrix hist3dbig is the final histogram.

if glbalinter == 1

xbstep = nblockw*cellpw;

ybstep = nblockh*cellph;

else

xbstep = xbstride;

ybstep = ybstride;

end

% block slide loop

for btly = 1:ybstep:ybstridend

for btlx = 1:xbstep:xbstridend

for bi = 1:(cellph*nblockh)

for bj = 1:(cellpw*nblockw)

i = btly + bi - 1;

j = btlx + bj - 1;

gaussweight = weight(bi,bj);

gs = gradscal(i,j);

go = gradorient(i,j);

if glbalinter == 1

jorbj = j;

iorbi = i;

else

jorbj = bj;

iorbi = bi;

end

% calculate bin index of hist3dbig

binx1 = floor((jorbj-1+cellpw/2)/cellpw) + 1;

biny1 = floor((iorbi-1+cellph/2)/cellph) + 1;

binz1 = floor((go+(or*pi/nthet)/2)/(or*pi/nthet)) + 1;

if gs == 0

continue;

end

binx2 = binx1 + 1;

biny2 = biny1 + 1;

binz2 = binz1 + 1;

x1 = (binx1-1.5)*cellpw + 0.5;

y1 = (biny1-1.5)*cellph + 0.5;

z1 = (binz1-1.5)*(or*pi/nthet);

% trilinear interpolation.

hist3dbig(biny1,binx1,binz1) =...

hist3dbig(biny1,binx1,binz1) + gs*gaussweight...

* (1-(jorbj-x1)/cellpw)*(1-(iorbi-y1)/cellph)...

*(1-(go-z1)/(or*pi/nthet));

hist3dbig(biny1,binx1,binz2) =...

hist3dbig(biny1,binx1,binz2) + gs*gaussweight...

* (1-(jorbj-x1)/cellpw)*(1-(iorbi-y1)/cellph)...

*((go-z1)/(or*pi/nthet));

hist3dbig(biny2,binx1,binz1) =...

hist3dbig(biny2,binx1,binz1) + gs*gaussweight...

* (1-(jorbj-x1)/cellpw)*((iorbi-y1)/cellph)...

*(1-(go-z1)/(or*pi/nthet));

hist3dbig(biny2,binx1,binz2) =...

hist3dbig(biny2,binx1,binz2) + gs*gaussweight...

* (1-(jorbj-x1)/cellpw)*((iorbi-y1)/cellph)...

*((go-z1)/(or*pi/nthet));

hist3dbig(biny1,binx2,binz1) =...

hist3dbig(biny1,binx2,binz1) + gs*gaussweight...

* ((jorbj-x1)/cellpw)*(1-(iorbi-y1)/cellph)...

*(1-(go-z1)/(or*pi/nthet));

hist3dbig(biny1,binx2,binz2) =...

hist3dbig(biny1,binx2,binz2) + gs*gaussweight...

* ((jorbj-x1)/cellpw)*(1-(iorbi-y1)/cellph)...

*((go-z1)/(or*pi/nthet));

hist3dbig(biny2,binx2,binz1) =...

hist3dbig(biny2,binx2,binz1) + gs*gaussweight...

* ((jorbj-x1)/cellpw)*((iorbi-y1)/cellph)...

*(1-(go-z1)/(or*pi/nthet));

hist3dbig(biny2,binx2,binz2) =...

hist3dbig(biny2,binx2,binz2) + gs*gaussweight...

* ((jorbj-x1)/cellpw)*((iorbi-y1)/cellph)...

*((go-z1)/(or*pi/nthet));

end

end

% In the local interpolate condition. F is generated in this block

% slide loop. hist3dbig should be cleared in each loop.

if glbalinter == 0

if or == 2

hist3dbig(:,:,2) = hist3dbig(:,:,2)...

+ hist3dbig(:,:,nthet+2);

hist3dbig(:,:,(nthet+1)) =...

hist3dbig(:,:,(nthet+1)) + hist3dbig(:,:,1);

end

hist3d = hist3dbig(2:(nblockh+1), 2:(nblockw+1), 2:(nthet+1));

for ibin = 1:nblockh

for jbin = 1:nblockw

idsF = nthet*((ibin-1)*nblockw+jbin-1)+1;

idsF = idsF:(idsF+nthet-1);

sF(idsF) = hist3d(ibin,jbin,:);

end

end

iblock = ((btly-1)/ybstride)*ntotalbw +...

((btlx-1)/xbstride) + 1;

idF = (iblock-1)*nblockw*nblockh*nthet+1;

idF = idF:(idF+nblockw*nblockh*nthet-1);

F(idF) = sF;

hist3dbig(:,:,:) = 0;

end

end

end

% In the global interpolate condition. F is generated here outside the

% block slide loop

if glbalinter == 1

ncellx = N / cellpw;

ncelly = M / cellph;

if or == 2

hist3dbig(:,:,2) = hist3dbig(:,:,2) + hist3dbig(:,:,nthet+2);

hist3dbig(:,:,(nthet+1)) = hist3dbig(:,:,(nthet+1)) + hist3dbig(:,:,1);

end

hist3d = hist3dbig(2:(ncelly+1), 2:(ncellx+1), 2:(nthet+1));

iblock = 1;

for btly = 1:ybstride:ybstridend

for btlx = 1:xbstride:xbstridend

binidx = floor((btlx-1)/cellpw)+1;

binidy = floor((btly-1)/cellph)+1;

idF = (iblock-1)*nblockw*nblockh*nthet+1;

idF = idF:(idF+nblockw*nblockh*nthet-1);

for ibin = 1:nblockh

for jbin = 1:nblockw

idsF = nthet*((ibin-1)*nblockw+jbin-1)+1;

idsF = idsF:(idsF+nthet-1);

sF(idsF) = hist3d(binidy+ibin-1, binidx+jbin-1, :);

end

end

F(idF) = sF;

iblock = iblock + 1;

end

end

end

% adjust the negative value caused by accuracy of floating-point

% operations.these value's scale is very small, usually at E-03 magnitude

% while others will be E+02 or E+03 before normalization.

F(F<0) = 0;

% block normalization.

e = 0.001;

l2hysthreshold = 0.2;

fslidestep = nblockw*nblockh*nthet;

switch normmethod

case 'none'

case 'l1'

for fi = 1:fslidestep:size(F,2)

div = sum(F(fi:(fi+fslidestep-1)));

F(fi:(fi+fslidestep-1)) = F(fi:(fi+fslidestep-1))/(div+e);

end

case 'l1sqrt'

for fi = 1:fslidestep:size(F,2)

div = sum(F(fi:(fi+fslidestep-1)));

F(fi:(fi+fslidestep-1)) = sqrt(F(fi:(fi+fslidestep-1))/(div+e));

end

case 'l2'

for fi = 1:fslidestep:size(F,2)

sF = F(fi:(fi+fslidestep-1)).*F(fi:(fi+fslidestep-1));

div = sqrt(sum(sF)+e*e);

F(fi:(fi+fslidestep-1)) = F(fi:(fi+fslidestep-1))/div;

end

case 'l2hys'

for fi = 1:fslidestep:size(F,2)

sF = F(fi:(fi+fslidestep-1)).*F(fi:(fi+fslidestep-1));

div = sqrt(sum(sF)+e*e);

sF = F(fi:(fi+fslidestep-1))/div;

sF(sF>l2hysthreshold) = l2hysthreshold;

div = sqrt(sum(sF.*sF)+e*e);

F(fi:(fi+fslidestep-1)) = sF/div;

end

otherwise

error('Incorrect NORMMETHOD parameter.');

end

% the following code, which can be removed because of having no

% contributions to HOG feature calculation, are just for result

% demonstration when the trilinear interpolation is 'global' for this

% condition is easier to give a simple and intuitive illustration. so in

% 'local' condition it will produce error.

% figure;

% hold on;

% axis equal;

% xlim([0, N]);

% ylim([0, M]);

% for u = 1:(M/cellph)

% for v = 1:(N/cellpw)

% cx = (v-1)*cellpw + cellpw/2 + 0.5;

% cy = (u-1)*cellph + cellph/2 + 0.5;

% hist3d(u,v,:)=0.9*min(cellpw,cellph)*hist3d(u,v,:)/max(hist3d(u,v,:));

% for t = 1:nthet

% s = hist3d(u,v,t);

% thet = (t-1)*pi/nthet + pi*0.5/nthet;

% x1 = cx - s*0.5*cos(thet);

% x2 = cx + s*0.5*cos(thet);

% y1 = cy - s*0.5*sin(thet);

% y2 = cy + s*0.5*sin(thet);

% plot([x1,x2],[M-y1+1,M-y2+1]);

% end

% end

% end

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
【优质项目推荐】 1、项目代码均经过严格本地测试,运行OK,确保功能稳定后才上传平台。可放心下载并立即投入使用,若遇到任何使用问题,随时欢迎私信反馈与沟通,博主会第一时间回复。 2、项目适用于计算机相关专业(如计科、信息安全、数据科学、人工智能、通信、物联网、自动化、电子信息等)的在校学生、专业教师,或企业员工,小白入门等都适用。 3、该项目不仅具有很高的学习借鉴价值,对于初学者来说,也是入门进阶的绝佳选择;当然也可以直接用于 毕设、课设、期末大作业或项目初期立项演示等。 3、开放创新:如果您有一定基础,且热爱探索钻研,可以在此代码基础上二次开发,进行修改、扩展,创造出属于自己的独特应用。 欢迎下载使用优质资源!欢迎借鉴使用,并欢迎学习交流,共同探索编程的无穷魅力! 基于业务逻辑生成特征变量python实现源码+数据集+超详细注释.zip基于业务逻辑生成特征变量python实现源码+数据集+超详细注释.zip基于业务逻辑生成特征变量python实现源码+数据集+超详细注释.zip基于业务逻辑生成特征变量python实现源码+数据集+超详细注释.zip基于业务逻辑生成特征变量python实现源码+数据集+超详细注释.zip基于业务逻辑生成特征变量python实现源码+数据集+超详细注释.zip基于业务逻辑生成特征变量python实现源码+数据集+超详细注释.zip 基于业务逻辑生成特征变量python实现源码+数据集+超详细注释.zip 基于业务逻辑生成特征变量python实现源码+数据集+超详细注释.zip
提供的源码资源涵盖了安卓应用、小程序、Python应用和Java应用等多个领域,每个领域都包含了丰富的实例和项目。这些源码都是基于各自平台的最新技术和标准编写,确保了在对应环境下能够无缝运行。同时,源码中配备了详细的注释和文档,帮助用户快速理解代码结构和实现逻辑。 适用人群: 这些源码资源特别适合大学生群体。无论你是计算机相关专业的学生,还是对其他领域编程感兴趣的学生,这些资源都能为你提供宝贵的学习和实践机会。通过学习和运行这些源码,你可以掌握各平台开发的基础知识,提升编程能力和项目实战经验。 使用场景及目标: 在学习阶段,你可以利用这些源码资源进行课程实践、课外项目或毕业设计。通过分析和运行源码,你将深入了解各平台开发的技术细节和最佳实践,逐步培养起自己的项目开发和问题解决能力。此外,在求职或创业过程中,具备跨平台开发能力的大学生将更具竞争力。 其他说明: 为了确保源码资源的可运行性和易用性,特别注意了以下几点:首先,每份源码都提供了详细的运行环境和依赖说明,确保用户能够轻松搭建起开发环境;其次,源码中的注释和文档都非常完善,方便用户快速上手和理解代码;最后,我会定期更新这些源码资源,以适应各平台技术的最新发展和市场需求。

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值