FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction Paper • 2203.08411 • Published Mar 16, 2022 • 1
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction Paper • 2305.02549 • Published May 4, 2023 • 6
ETC: Encoding Long and Structured Inputs in Transformers Paper • 2004.08483 • Published Apr 17, 2020 • 1
CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents Paper • 2004.12629 • Published Apr 27, 2020 • 2
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking Paper • 2204.08387 • Published Apr 18, 2022 • 2
Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models Paper • 2404.06209 • Published Apr 9 • 4
Text Role Classification in Scientific Charts Using Multimodal Transformers Paper • 2402.14579 • Published Feb 8 • 1
An inclusive review on deep learning techniques and their scope in handwriting recognition Paper • 2404.08011 • Published Apr 10 • 1
NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement Paper • 2404.05669 • Published Apr 8 • 1