IronOCR 2022.1 Crack

最新推荐文章于 2025-03-16 16:56:11 发布

sdk大全

最新推荐文章于 2025-03-16 16:56:11 发布

阅读量1.1k

点赞数

分类专栏： IronOCR 文章标签： IronOCR 2022

本文链接：https://blog.csdn.net/john_dwh/article/details/122905314

版权

IronOCR 专栏收录该内容

81 篇文章

订阅专栏

IronOCR is an advanced OCR (Optical Character Recognition) library for C# and .NET

IronOCR provides Tesseract OCR on Mac, Windows, Linux, Azure and Docker for:

.Net Framework 4.0 +
.Net Standard 2.0 +
.Net Core 2.0 +
.Net 5
Mono for MacOS and Linux
Xamarin for MacOS

IronOCR reads Text, Barcodes & QR from all major image and PDF formats using the latest Tesseract 5 engine. This library adds OCR functionality to Desktop, Console and Web applications in minutes.

IronOCR's Unique Features

Pure .Net OCR API
All OCR tasks run locally (no SAAS)
125 languages
Barcode & QR Code reading
Corrects low quality, noisy and distorted scans
Performance tuned above and beyond any other known build of Tesseract OCR.
Reads PDFs and multi-page TIFFs
Can save any OCR Scan to a searchable PDF document or XHTML

Data Output Options Include

Output Plain Text, Barcode Data and an OCR Result class containing paragraphs, lines, words, and characters.

International Language Support

125 Languages supported including Arabic, Chinese, English, Finnish, French, German, Hebrew, Italian, Japanese, Korean, Portuguese, Russian, Spanish... Custom language packs can also be created.

Licensing & Support available

For code examples, documentation & more visit C# OCR Library [125 Languages Supported] | Iron OCR Email: developers@ironsoftware.com

Get Started Code Example

string Result = new IronOcr.IronTesseract().Read("scan.pdf").Text;

Why C# developers choose IronOCR over Vanilla Tesseract:

Achieve 99.8%+ OCR accuracy without using external web services.
Includes for Tesseract 5 , 4 and 3 Engines out of the box.
Blazing Speed and MultiThreading
MVC, WebApp, Desktop, Console & Server Application compatible
No Exes or C++ code to work with
Full PDF OCR support
To perform OCR an almost any Image file or PDF
Full .Net Core, Standard and FrameWork support
Deploy on Windows, Mac, Linux, Azure, Docker, Lambda, AWS
Read barcodes and QR codes
Export OCR as to XHTML
Export OCR to searchable PDF documents
Multithreading support
125 international languages all managed via Nuget or OcrData files
Extract Images, Coordinates, Statistics and Fonts. Not just text.
Can be used to redistribute Tesseract OCR inside commercial & proprietary applications.
Supports:

IronOCR shines when working with real world images and imperfect documents such as photographs, or scans of low resolution which may have digital noise or imperfections. Other free OCR libraries for the .NET platform such other .Net Tesseract APIs and web services do not perform so well on these real world use cases.

OCR with Tesseract 5 - Start Coding in C#

These code examples below shows how easy it is to read text from an image using C# or VB .NET.

Configurable Hello World

using IronOcr;

var Ocr = new IronTesseract();
using (var Input = new OcrInput()){
    Input.AddImage("images/sample.jpeg")
    //... you can add any number of images
    var Result = Ocr.Read(Input);
    Console.WriteLine(Result.Text);
}

C# PDF OCR

The same approach can similarly be used to extract text from any PDF document.

var Ocr = new IronTesseract();
using (var input = new OcrInput())
{

    input.AddPdf("example.pdf","password");
    // We can also select specific PDF page numnbers to OCR
        
    var Result = Ocr.Read(input);

    Console.WriteLine(Result.Text);
    Console.WriteLine($"{Result.Pages.Count()} Pages");
    // 1 page for every page of the PDF
}