Skip to content
#

layoutlm

Here are 13 public repositories matching this topic...

The MERIT Dataset is a fully synthetic, labeled dataset created for training and benchmarking LLMs on Visually Rich Document Understanding tasks. It is also designed to help detect biases and improve interpretability in LLMs, where we are actively working. This repository is actively maintained, and new features are continuously being added.

  • Updated Jul 16, 2025
  • Python

SamvidAI — Enterprise Contract Intelligence powered by OpticalRAG Multimodal document understanding system for clause extraction, legal risk scoring, and explainable contract analysis using layout-aware RAG pipelines.

  • Updated Jan 30, 2026
  • Python

Improve this page

Add a description, image, and links to the layoutlm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the layoutlm topic, visit your repo's landing page and select "manage topics."

Learn more