图像形状尺寸的常用整理方法

最新推荐文章于 2023-08-29 18:49:04 发布

天羽东臻

最新推荐文章于 2023-08-29 18:49:04 发布

阅读量334

点赞数

分类专栏： # 深度学习成长之路

本文链接：https://blog.csdn.net/lvxingvir_sigma/article/details/109666425

版权

成长之路同时被 2 个专栏收录

19 篇文章 0 订阅

订阅专栏

深度学习

8 篇文章 0 订阅

订阅专栏

在进行网络训练的时候，经常需要对图像进行一些处理，形状的改变是最常见的一项，这里有几种办法来进行：

1. opencv:

https://www.tutorialkart.com/opencv/python/opencv-python-resize-image/

cv2.resize(src, dsize[, dst[, fx[, fy[, interpolation]]]])

import cv2
 
img = cv2.imread('/home/img/python.png', cv2.IMREAD_UNCHANGED)
 
print('Original Dimensions : ',img.shape)
 
scale_percent = 60 # percent of original size
width = int(img.shape[1] * scale_percent / 100)
height = int(img.shape[0] * scale_percent / 100)
dim = (width, height)
# resize image
resized = cv2.resize(img, dim, interpolation = cv2.INTER_AREA)
 
print('Resized Dimensions : ',resized.shape)
 
cv2.imshow("Resized image", resized)
cv2.waitKey(0)
cv2.destroyAllWindows()

2. PIL中的Image对象中的resize()方法

Syntax: Image.resize(size, resample=0)
Parameters:

size – The requested size in pixels, as a 2-tuple: (width, height).
resample – An optional resampling filter. This can be one of PIL.Image.NEAREST (use nearest neighbour), PIL.Image.BILINEAR (linear interpolation), PIL.Image.BICUBIC (cubic spline interpolation), or PIL.Image.LANCZOS (a high-quality downsampling filter). If omitted, or if the image has mode “1” or “P”, it is set PIL.Image.NEAREST.

Returns type: An Image object.

# Improting Image class from PIL module  
from PIL import Image  
  
# Opens a image in RGB mode  
im = Image.open(r"C:\Users\System-Pc\Desktop\ybear.jpg")  
  
# Size of the image in pixels (size of orginal image)  
# (This is not mandatory)  
width, height = im.size  
  
# Setting the points for cropped image  
left = 4
top = height / 5
right = 154
bottom = 3 * height / 5
  
# Cropped image of above dimension  
# (It will not change orginal image)  
im1 = im.crop((left, top, right, bottom)) 
newsize = (300, 300) 
im1 = im1.resize(newsize) 
# Shows the image in image viewer  
im1.show()

3. pytorch对于多维图像变化最为方便，插值并变换成任意尺寸大小：

torch.nn.functional.interpolate(input, size=None, scale_factor=None, mode='nearest', align_corners=None, recompute_scale_factor=None)

Down/up samples the input to either the given size or the given scale_factor

The algorithm used for interpolation is determined by mode.

Currently temporal, spatial and volumetric sampling are supported, i.e. expected inputs are 3-D, 4-D or 5-D in shape.

The input dimensions are interpreted in the form: mini-batch x channels x [optional depth] x [optional height] x width.

The modes available for resizing are: nearest, linear (3D-only), bilinear, bicubic (4D-only), trilinear (5D-only), area

Example:

x = nn.functional.interpolate(x, scale_factor=8, mode='bilinear', align_corners=False)

img_r = F.interpolate(img_t, [self.imgsz, self.imgsz, con_len])

天羽东臻

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
图像形状尺寸的常用整理方法

在进行网络训练的时候，经常需要对图像进行一些处理，形状的改变是最常见的一项，这里有几种办法来进行：1. opencv:https://www.tutorialkart.com/opencv/python/opencv-python-resize-image/cv2.resize(src, dsize[, dst[, fx[, fy[, interpolation]]]])import cv2 img = cv2.imread('/home/img/python.png', cv2.
复制链接

扫一扫

专栏目录