java图片开源框架tesseract调用OCR实现图片文件识别代码下载

原文:java图片开源框架tesseract调用OCR实现图片文件识别代码下载


项目实现的是身份证识别处理,将正确识别的的身份证图片以姓名和身份号命名,输出到一个suc文件夹下,不能识别的输出到fai文件夹下,传入文件库路径,使用递归调用实现文件夹下逐级查找,工程已在实际中应用,OCR用的是tesseract识别对图片要求比较高,详见代码,下载云盘tesseract http://pan.baidu.com/s/1pJ7DD7X放在工程路径下,直接可以运行。。。

Java OCR Framework An Optical Character Recognition Framework written purely in Java. Installation Build the project and add the jar for the project along with all the jars in the jar directory to your compile-time libraries. Usage There are 4 main parts to OCR: Normalization Segmentation Feature Extraction Classification Feature Extraction and Classification are the only required parts. For Feature Extraction there are 5 algorithms at your disposal Horizontal Celled Projection Vertical Celled Projection Horizontal Projection Histogram Vertical Projection Histogram Local Line Fitting This framework loosely uses a Fluent Interface Builder syntax. Example: OCR ocr = OCRBuilder .create() .normalization(new Normalization()) .segmentation(new Segmentation()) .featureExtraction( FeatureExtractionBuilder .create() .children( new HorizontalCelledProjection(5), new VerticalCelledProjection(5), new HorizontalProjectionHistogram(), new VerticalProjectionHistogram(), new LocalLineFitting(49)) .build()) .neuralNetwork( NeuralNetworkBuilder .create() .fromFile("neural_network.eg") .build()) .build(); Contributing Want to help out? Feel free to share your ideas. Fork it. Create a branch (git checkout -b my_fancy_feature) Commit your changes (git commit -am "Added amazing feature") Push to the branch (git push origin my_fancy_feature) Open a Pull Request References Arora, Sandhya (2008). “Combining Multiple Feature Extraction Techniques for Handwritten Devnagari Character Recognition”, IEEE Region 10 Colloquium. pp. 342-348 Haykin, Simon (1999). “Neural Networks A Comprehensive Foundation”, 2nd Edition. Pearson Education. Perez, Juan-Carlos ; Vidal, Enrique ; Sanchez, Lourdes (1994). “Simple and Effective Feature Extraction for Optical Character Recognition”, Selected Paper From the 5th Spanish Symposium on Pattern Recognition and Image Analysis. Zahid Hossain, M. ; Ashraful Amin, M. ; Yan, Hong (2012). “Rapid Feature Extraction for Optical Character Recognition”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 24, No. 6. pp. 801-813 Thanks Thanks to Heaton Research for providing an amazing Neural Network framework. Also thanks to Apache Math Commons for doing all the math without the mess.
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值