OCRize’s .NET OCR plug-in extracts text from scanned or photographed tables. Advanced machine learning algorithms and specially trained neural networks accurately identify the structure of tables, extract text from cells, and organize it into searchable and editable spreadsheets.
OcrInput
object.Get the respective assembly files from the Releases or fetch the package from NuGet to add OCRize directly to your workspace.
By default, OCRize can automatically recognize a wide range of languages based on the Extended Latin alphabet. However, providing a specific language can significantly enhance recognition accuracy. Explicitly specify the language when recognizing Cyrillic, Chinese, and Hindi texts.
OCRize supports popular formats from scanners or cameras, including PDF, JPEG, PNG, and TIFF. Recognition results are returned in plain text, HTML, Microsoft Word, PDF, JSON, and XML.
Good image quality is crucial for accurate OCR. Use a scanner or high-resolution camera. The library includes advanced filters to automatically improve image quality before recognition.