Search for text within images with the OCRize Image Text Finder, a powerful .NET OCR plug-in. Identify differences in image texts, regardless of styles, resolution, font, format, or other factors. Perform regular expression searches, case-insensitive searches, and comparisons with a single line of code. Ideal for detecting PII in digital archives, analyzing contracts, classifying large amounts of non-textual data, and streamlining business processes.
Get the respective assembly files from the Releases or fetch the package from NuGet to add OCRize
directly to your workspace.
By default, our library can automatically recognize a wide range of languages based on the Extended Latin alphabet. However, providing a specific language can significantly enhance recognition accuracy. Explicitly specify the language when recognizing Cyrillic, Chinese, and Hindi texts.
You can use any popular format from a scanner or camera, including PDF, JPEG, PNG, and TIFF, including multi-page documents. Recognition results are returned in plain text, HTML, Microsoft Word, PDF, JSON, and XML.
Good image quality is crucial for accurate OCR. Use a scanner or high-resolution camera. The library has advanced filters to automatically improve image quality before recognition.