最近在写爬虫,但爬下来的很多图片被加了水印,而且水印的位置不是固定的。本来不是搞图形图像处理的我,只好google了一把,看到很多用opencv来解决类似问题。这里记录一下水印识别的代码.
首先,必须安装如下依赖, 用了豆瓣的镜像,速度快些.pip install matplotlib -i https://pypi.douban.com/simple
pip install numpy
pip install opencv-python -i https://pypi.douban.com/simple
识别水印的代码如下:
import cv2
import numpy as np
# Read the main image
img_rgb = cv2.imread('maven-spotbugs-static-code.png')
# Convert it to grayscale
img_gray = cv2.cvtColor(img_rgb, cv2.COLOR_BGR2GRAY)
# Read the template
template = cv2.imread('mark.jpg',0)
# Store width and heigth of template in w and h
w, h = template.shape[::-1]
# Perform match operations.
res = cv2.matchTemplate(img_gray,template,cv2.TM_CCOEFF_NORMED)
# Specify a threshold
threshold = 0.8