Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
-
Updated
Feb 27, 2023 - Python
{{ message }}
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
A curated list of resources for Document Understanding (DU) topic
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
A Repo For Document AI
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
Official Implementation of Web-based Visual Corpus Builder (Webvicob)
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"
ReadingBank: A Benchmark Dataset for Reading Order Detection
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Table detection and table structure recognition using Yolov5
Exploring LayoutLM for Smart OCR Capabilities
Add a description, image, and links to the document-ai topic page so that developers can more easily learn about it.
To associate your repository with the document-ai topic, visit your repo's landing page and select "manage topics."