ocr-post-processing

Here are 7 public repositories matching this topic...

mikahama / natas

Python 3 library for processing historical English

english digital-humanities nlp-library spelling-correction historical-data historical-linguistics ocr-post-processing ocr-correction spelling-normalization non-standard-data historical-english

Updated Aug 10, 2024
Python

sergiocorreia / quipucamayoc

Star

dev repo for article

ocr poppler textract table-extraction ocr-python ocr-post-processing table-ocr

Updated Mar 14, 2023
Python

PedroBarcha / context-spelling-correction

Star

Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phrase for the suggestion. The software was originally developed for correcting OCR output.

spelling-correction context-aware context-awareness ocr-post-processing online-spelling-correction

Updated Dec 13, 2018
Python

soberbichler / Notebooks4Historical_Newspapers

Star

Notebooks that use LLMs to work with historical documents and artefacts

history event-detection historical-data newspapers article-extractor historical-research ocr-post-processing llms genai

Updated Jan 13, 2025
Jupyter Notebook

majumderb / sanskrit-ocr

Star

CoNLL 2018: Post-OCR Text Correction in Romanised Sanskrit

conll encoder-decoder copynet sanskrit-language nmt-model ocr-post-processing

Updated Feb 3, 2019
PLSQL

milahu / hocr-editor-qt

Star

graphical HOCR editor to produce minimal diffs for proofreading of tesseract OCR output

tesseract tesseract-ocr hocr proofreading ocr-post-processing hocr-editor minimal-diff cst-editor ocr-proofreading ocr-postprocessing

Updated Oct 25, 2025
Python

llap4585 / RapidOcr-Paragraphizer

Star

An OCR post-processing tool for scanned medical reports that reconstructs fragmented text lines into semantically coherent paragraphs. 一款来自于医学报告扫描件处理的 OCR 后处理工具。将离散的文本行自动重建为语义连贯的段落。医療レポートのスキャン画像に特化した OCR 後処理ツール。離散的なテキスト行をセマンティック（意味的）に連結し、一貫性のある段落を再構築します。(機械翻訳)

python nlp ocr document-analysis layout-analysis ocr-post-processing medical-reports

Updated Jan 31, 2026
Python

Improve this page

Add a description, image, and links to the ocr-post-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ocr-post-processing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ocr-post-processing

Here are 7 public repositories matching this topic...

mikahama / natas

sergiocorreia / quipucamayoc

PedroBarcha / context-spelling-correction

soberbichler / Notebooks4Historical_Newspapers

majumderb / sanskrit-ocr

milahu / hocr-editor-qt

llap4585 / RapidOcr-Paragraphizer

Improve this page

Add this topic to your repo