darknet-ocr:
web服务启动部分:
源代码:添加链接描述
该部分需要从该网站下载镜像并启动:
https://pan.baidu.com/s/12F9AYVyBTz34UCXeWj3ATA 密码:5lyz
docker load -i darknet-ocr.tar
docker run -it -p 8080:8080 darknet-ocr:1.0 python app.py
启动镜像后运行docker ps, 得到:
可通过docker logs 容器id 查看识别的日志:
在浏览器中运行http://宿主机id:8080/text, 得到:
另外,有一个crnn单独识别英文字体的脚本:
import torch
from torch.autograd import Variable
import utils
import dataset
from PIL import Image
import keys
import crnn.network_torch as crnn
model_path = './models/crnn.pth'
img_path = './test/test.png'
alphabet = keys.alphabet
alphabet = '0123456789abcdefghijklmnopqrstuvwxyz'
model = crnn.CRNN(32, 1, 37, 256)
if torch.cuda.is_available():
model = model.cuda()
print('loading pretrained model from %s' % model_path)
model.load_state_dict(torch.load(model_path))
converter = utils.strLabelConverter(alphabet)
transformer = dataset.resizeNormalize((100, 32))
image = Image.open(img_path).convert('L')
image = transformer(image)
if torch.cuda.is_available():
image = image.cuda()
image = image.view(1, *image.size())
image = Variable(image)
model.eval()
preds = model(image)
_, preds = preds.max(2)
preds = preds.transpose(1, 0).contiguous().view(-1)
preds_size = Variable(torch.IntTensor([preds.size(0)]))
raw_pred = converter.decode(preds.data, preds_size.data, raw=True)
sim_pred = converter.decode(preds.data, preds_size.data, raw=False)
print('%-20s => %-20s' % (raw_pred, sim_pred))
也可以通过修改alphabet的参数识别中文字体。