Python 3 library for processing historical English
-
Updated
Aug 10, 2024 - Python
Python 3 library for processing historical English
dev repo for article
Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phrase for the suggestion. The software was originally developed for correcting OCR output.
Notebooks that use LLMs to work with historical documents and artefacts
CoNLL 2018: Post-OCR Text Correction in Romanised Sanskrit
graphical HOCR editor to produce minimal diffs for proofreading of tesseract OCR output
An OCR post-processing tool for scanned medical reports that reconstructs fragmented text lines into semantically coherent paragraphs. 一款来自于医学报告扫描件处理的 OCR 后处理工具。将离散的文本行自动重建为语义连贯的段落。医療レポートのスキャン画像に特化した OCR 後処理ツール。離散的なテキスト行をセマンティック(意味的)に連結し、一貫性のある段落を再構築します。(機械翻訳)
Add a description, image, and links to the ocr-post-processing topic page so that developers can more easily learn about it.
To associate your repository with the ocr-post-processing topic, visit your repo's landing page and select "manage topics."