MATLAB机器视觉工具箱的应用_computer vision toolbox-CSDN博客

本文链接：https://blog.csdn.net/Simuworld/article/details/131397876

MATLAB机器视觉工具箱是MATLAB软件的一个扩展工具箱，主要用于处理和分析图像和视频。该工具箱包含了许多用于图像处理、特征提取、目标检测、图像配准、三维重建以及机器学习等领域的算法和函数。MATLAB机器视觉工具箱不仅提供了快速、高效的图像处理工具，还提供了丰富的可视化功能，使用户能够更加直观地了解图像和视频数据。本文将介绍MATLAB机器视觉工具箱的使用方法，并给出5个使用案例。

1、使用方法

1.1安装和启用工具箱

在使用MATLAB机器视觉工具箱之前，需要先安装并启用该工具箱。可以通过以下步骤来安装和启用MATLAB机器视觉工具箱：

（1）在MATLAB软件中，点击“Home”选项卡，然后选择“Add-Ons”选项卡。

（2）在“Add-On Explorer”界面中，选择“Get Add-Ons”选项卡，并搜索“Computer Vision Toolbox”。

（3）选择“Computer Vision Toolbox”后，点击“Install”按钮进行安装。

（4）在安装完成后，可以在MATLAB软件的命令窗口中输入“ver”命令来查看是否已经安装成功。

（5）安装成功后，可以通过在MATLAB命令窗口中输入“cv”命令来启用MATLAB机器视觉工具箱。

1.2图像读取和显示

在MATLAB机器视觉工具箱中，可以使用imread函数来读取图像，使用imshow函数来显示图像。以下是一个简单的示例：

% 读取图像
I = imread('lena.png');

% 显示图像
imshow(I);

MATLAB机器视觉工具箱提供了许多用于图像处理的函数，例如imresize、imrotate、imcrop等。以下是一个简单的示例：

% 缩放图像
J = imresize(I, 0.5);

% 旋转图像
K = imrotate(I, 45);

% 裁剪图像
L = imcrop(I, [100, 100, 200, 200]);

% 显示结果
figure
subplot(2,2,1),imshow(I),title('原始图像');
subplot(2,2,2),imshow(J),title('缩放图像');
subplot(2,2,3),imshow(K),title('旋转图像');
subplot(2,2,4),imshow(L),title('裁剪图像');

1.3 特征提取

MATLAB机器视觉工具箱提供了许多用于特征提取的函数，例如detectSURFFeatures、extractHOGFeatures、extractLBPFeatures等。以下是一个简单的示例：

% 提取SURF特征
points = detectSURFFeatures(I);
features = extractFeatures(I, points);

% 提取HOG特征
[hog, visualization] = extractHOGFeatures(I, 'CellSize', [8 8]);

% 提取LBP特征
lbpFeatures = extractLBPFeatures(I);

% 显示结果
figure
subplot(2,2,1),imshow(I),title('原始图像');
subplot(2,2,2),imshow(I); hold on; plot(points.selectStrongest(50));
title('SURF特征');
subplot(2,2,3),imshow(visualization),title('HOG特征');
subplot(2,2,4),bar(lbpFeatures),title('LBP特征');

1.4目标检测

MATLAB机器视觉工具箱提供了许多用于目标检测的函数，例如trainCascadeObjectDetector、vision.CascadeObjectDetector等。以下是一个简单的示例：

% 训练分类器
positiveInstances = objectDetectorTrainingData(gTruth);
negativeFolder = fullfile(matlabroot, 'toolbox', 'vision', 'visiondata', 'nonObjectImages');
negativeImages = imageDatastore(negativeFolder);
trainCascadeObjectDetector('detector.xml', positiveInstances, negativeFolder, 'FalseAlarmRate', 0.1, 'NumCascadeStages', 5);

% 加载分类器
detector = vision.CascadeObjectDetector('detector.xml');

% 检测目标
bbox = step(detector, I);

% 显示结果
figure
imshow(I); hold on;
for i = 1:size(bbox,1)
    rectangle('Position', bbox(i,:), 'EdgeColor', 'r', 'LineWidth', 2);
end
title('检测结果');

1.5图像配准

MATLAB机器视觉工具箱提供了许多用于图像配准的函数，例如imregtform、imwarp、imregister等。以下是一个简单的示例：

% 读取图像
I1 = imread('image1.png');
I2 = imread('image2.png');

% 提取SURF特征
points1 = detectSURFFeatures(I1);
points2 = detectSURFFeatures(I2);
features1 = extractFeatures(I1, points1);
features2 = extractFeatures(I2, points2);

% 匹配特征
indexPairs = matchFeatures(features1, features2);

% 选择最佳匹配
matchedPoints1 = points1(indexPairs(:,1), :);
matchedPoints2 = points2(indexPairs(:,2), :);
[tform, inlierPoints1, inlierPoints2] = estimateGeometricTransform(matchedPoints1, matchedPoints2, 'affine');

% 图像配准
outputView = imref2d(size(I1));
I2Registered = imwarp(I2, tform, 'OutputView', outputView);

% 显示结果
figure
imshowpair(I1, I2Registered, 'blend');
title('图像配准');

1.6 三维重建

MATLAB机器视觉工具箱提供了许多用于三维重建的函数，例如triangulate、pointCloud、pcshow等。以下是一个简单的示例：

% 读取图像
I1 = imread('image1.png');
I2 = imread('image2.png');

% 提取SURF特征
points1 = detectSURFFeatures(I1);
points2 = detectSURFFeatures(I2);
features1 = extractFeatures(I1, points1);
features2 = extractFeatures(I2, points2);

% 匹配特征
indexPairs = matchFeatures(features1, features2);

% 选择最佳匹配
matchedPoints1 = points1(indexPairs(:,1), :);
matchedPoints2 = points2(indexPairs(:,2), :);
[tform, inlierPoints1, inlierPoints2] = estimateGeometricTransform(matchedPoints1, matchedPoints2, 'affine');

% 三维重建
worldPoints = triangulate(inlierPoints1, inlierPoints2, tform);
ptCloud = pointCloud(worldPoints);
pcshow(ptCloud);

% 显示结果
title('三维重建');

2、使用案例

2.1目标跟踪

目标跟踪是指在视频序列中检测和跟踪一个特定目标的过程。MATLAB机器视觉工具箱提供了许多用于目标跟踪的函数，例如vision.KalmanFilter、vision.PointTracker等。以下是一个简单的示例：

% 读取视频
video = VideoReader('traffic.avi');

% 创建点跟踪器
tracker = vision.PointTracker('MaxBidirectionalError', 2);

% 选择第一帧图像中的初始点
frame = readFrame(video);
points = detectMinEigenFeatures(rgb2gray(frame), 'ROI', [100, 100, 300, 300]);
points = points.Location;
initialize(tracker, points, frame);

% 跟踪目标
while hasFrame(video)
    frame = readFrame(video);
    [points, isFound] = step(tracker, frame);
    visiblePoints = points(isFound, :);
    oldInliers = oldPoints(isFound, :);
    if size(visiblePoints, 1) >= 2
        [tform, oldInliers, visiblePoints] = estimateGeometricTransform(oldInliers, visiblePoints, 'similarity');
        bbox = [min(visiblePoints) max(visiblePoints) -

min(visiblePoints)];
frame = insertShape(frame, 'Rectangle', bbox, 'LineWidth', 2);
oldPoints = visiblePoints;
setPoints(tracker, oldPoints);
end
imshow(frame);
end

2. 2人脸识别

人脸识别是指在图像或视频中识别出人脸并进行分类的过程。MATLAB机器视觉工具箱提供了许多用于人脸识别的函数，例如vision.CascadeObjectDetector、trainImageCategoryClassifier等。以下是一个简单的示例：

% 读取图像
faceDetector = vision.CascadeObjectDetector();
I = imread('faces.jpg');

% 检测人脸
bbox = step(faceDetector, I);

% 显示结果
figure
imshow(I); hold on;
for i = 1:size(bbox,1)
    rectangle('Position', bbox(i,:), 'EdgeColor', 'r', 'LineWidth', 2);
end
title('检测结果');

% 训练分类器
imds = imageDatastore('FaceDatabase', 'IncludeSubfolders', true, 'LabelSource', 'foldernames');
[trainingSet, testSet] = splitEachLabel(imds, 0.8, 'randomize');
bag = bagOfFeatures(trainingSet);
categoryClassifier = trainImageCategoryClassifier(trainingSet, bag);
confMatrix = evaluate(categoryClassifier, testSet);

% 测试分类器
I = imread('test.jpg');
[labelIdx, scores] = predict(categoryClassifier, I);
label = categoryClassifier.Labels(labelIdx);

% 显示结果
figure
imshow(I); title(char(label));

2.3 文字识别

文字识别是指在图像或视频中识别出文字并进行识别的过程。MATLAB机器视觉工具箱提供了许多用于文字识别的函数，例如ocr等。以下是一个简单的示例：

% 读取图像
I = imread('text.png');

% 进行文字识别
results = ocr(I);

% 显示结果
figure
imshow(I); hold on;
for i = 1:length(results.Words)
    text(results.WordBoundingBoxes(i,1), results.WordBoundingBoxes(i,2), results.Words{i}, 'FontSize', 20, 'Color', 'r', 'HorizontalAlignment', 'left', 'VerticalAlignment', 'top');
end
title('文字识别结果');

2.4 图像分割

图像分割是指将图像分成若干个不同的区域或对象的过程。MATLAB机器视觉工具箱提供了许多用于图像分割的函数，例如activecontour、kmeans等。以下是一个简单的示例：

% 读取图像
I = imread('coins.png');

% 进行图像分割
bw = imbinarize(I);
D = -bwdist(~bw);
D(~bw) = -Inf;
L = watershed(D);
rgb = label2rgb(L, 'jet', 'w', 'shuffle');

% 显示结果
figure
imshow(rgb); title('图像分割结果');

2.5 视频稳定

视频稳定是指在视频序列中去除摄像头抖动或手持摄像时的晃动效果。MATLAB机器视觉工具箱提供了许多用于视频稳定的函数，例如opticalFlowFarneback、vision.GeometricTransformEstimator等。以下是一个简单的示例：

% 读取视频
video = VideoReader('shaky.avi');

% 创建光流估计器
opticFlow = opticalFlowFarneback('PyrScale', 0.5, 'NumLevels', 4, 'NumIterations', 2, 'PolyN', 7, 'PolySigma', 1.5, 'FastPyramids', true);

% 对每一帧图像进行稳定
while hasFrame(video)
    frame = readFrame(video);
    flow = estimateFlow(opticFlow, rgb2gray(frame));
    [x, y] = meshgrid(1:size(frame,2), 1:size(frame,1));
    warpedFrame = interp2(x, y, double(frame), x+flow.Vx, y+flow.Vy);
    tform = vision.GeometricTransformEstimator('Transform', 'Affine');
    [tform, intermediatePoints] = step(tform, reshape([x(:) y(:)], [], 2), reshape([x(:)+flow.Vx(:) y(:)+flow.Vy(:)], [], 2));
    stabilizedFrame = imwarp(frame, affine2d(tform.T), 'OutputView', imref2d(size(frame)));
    imshow(stabilizedFrame);
end

以上是MATLAB机器视觉工具箱中的一些常见应用示例，当然还有很多其他的应用场景，如目标跟踪、深度学习等等。具体的应用需要根据实际问题和需求来选择相应的函数和算法。