文本检测时对图片进行方向矫正

最新推荐文章于 2024-05-31 18:00:00 发布

真不会修电脑

最新推荐文章于 2024-05-31 18:00:00 发布

阅读量3.3k

点赞数 1

分类专栏：文字检测相关

本文链接：https://blog.csdn.net/m0_37382341/article/details/104971942

版权

文字检测相关专栏收录该内容

12 篇文章 0 订阅

订阅专栏

在使用ctpn或者pse等文本检测算法时，首先要对文本图片进行矫正，本篇博文使用的是opencv2调用Tensorflow矫正模型达到矫正文本图片的目的，逻辑是使用矫正模型检测是图片的翻转角度，根据矫正模型输出的度数来利用opencv做flip操作。




angleNet = cv2.dnn.readNetFromTensorflow(config['pse']['AngleModelPb'], config['pse']['AngleModelPbtxt'])

def angle_detect(self, img, adjust=False):
    h, w = img.shape[:2]
    ROTATE = [0, 90, 180, 270]
    if adjust:
        thesh = 0.05
        xMin, yMin, xMax, yMax = int(thesh * w), int(thesh * h), w - int(thesh * w), h - int(thesh * h)
        img = img[yMin: yMax, xMin: xMax]  # cut the edge of image

    inputBlob = cv2.dnn.blobFromImage(img, scalefactor=1.0, size=(224, 224), swapRB=True,
                                      mean=[103.939, 116.779, 123.68], crop=False)
    angleNet.setInput(inputBlob)
    pred = self.angleNet.forward()
    index = np.argmax(pred, axis=1)[0]

    return ROTATE[index]


def rotate(self, img):
    # rotate the image and return the angle
    angle = 0
    img = np.array(img)
    (h, w) = img.shape[:2]

    angle = angle_detect(img=np.copy(img))  # angle detection
    if angle == 90:
        img = cv2.transpose(img)
        img = cv2.flip(img, flipCode=0)  # counter clock wise

    elif angle == 180:
        img = cv2.flip(img, flipCode=-1)  # flip the image both horizontally and vertically

    elif angle == 270:
        img = cv2.transpose(img)
        img = cv2.flip(img, flipCode=1)  # clock wise rotation

    return angle, img

if __name__ == '__main__':

    rotate(img)

PS：如果想要模型的话可以去官网找，或者留言。

真不会修电脑

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
10
评论
文本检测时对图片进行方向矫正

在使用ctpn或者pse等文本检测算法时，首先要对文本图片进行矫正，本篇博文使用的是opencv2调用Tensorflow矫正模型达到矫正文本图片的目的，逻辑是使用矫正模型检测是图片的翻转角度，根据矫正模型输出的度数来利用opencv做flip操作。angleNet = cv2.dnn.readNetFromTensorflow(config['p...
复制链接

扫一扫

专栏目录