Indian Flag
Government Of India
A-
A
A+

LayoutLM - Multimodal Pre-training for Document Understanding

A multimodal Transformer model pre-trained on text and layout for document image understanding, optimized for form processing, receipt understanding, and structured document analysis.

About Model

LayoutLM is a Transformer-based model designed for document AI tasks, integrating text and layout information to enhance document image understanding. Pre-trained on the IIT-CDIP dataset, it achieves state-of-the-art results in various structured document processing applications, such as form understanding, receipt recognition, and document classification. The model effectively captures the spatial relationships between text elements, making it particularly useful for OCR-based workflows, invoice processing, and automated document analysis. Available in different configurations, LayoutLM is a foundational model for modern AI-driven document processing solutions.

LayoutLM - Multimodal Pre-training for Document Understanding

Metadata Metadata

MIT

Yiheng Xu, Minghao Li, Lei Cui, Shaohan Huang, Furu Wei, Ming Zhou

Transformers

N.A.

Open

Sector Agnostic

12/03/25 06:35:02

0

Activity Overview Activity Overview

  • Downloads0
  • Redirect 7
  • File Size 0
  • Views 568

Tags Tags

  • PyTorch
  • Transformers
  • English
  • safetensors
  • inference endpoints
  • tensorflow
  • layoutlm

License Control License Control

MIT

More Models from Microsoft Corporation (India) Pvt. Ltd. More Models from Microsoft Corporation (India) Pvt. Ltd.

TAPEX: Large SQL Execution Model (Table Pre-training via Learning a Neural SQL Executor)
A large-sized TAPEX model pre-trained to simulate neural SQL execution, enabling the execution of SQL queries on given tables.
DataRetrieval
Transformers
BART
TAPEX
PreTrainedModel
NeuralExecutor
SQLExecution
  • See Upvoters0
  • Downloads6
  • File Size0
  • Views214
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Large Model (Table Pre-training via Learning a Neural SQL Executor)
A large-sized pre-trained model designed to enhance table-based question answering and fact verification tasks.
FactVerification
BART
TableQuestionAnswering
PreTrainedModel
LargeModel
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views169
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: TabFact Data enabled Large Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A large-sized TAPEX model fine-tuned on the TabFact dataset, designed to enhance performance in table-based fact verification tasks.
Transformers
TabFact
DataValidation
FactVerification
TAPEX
FineTunedModel
NaturalLanguageProcessing
BART
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views131
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX (Table Pre-training via Learning a Neural SQL Executor) Large Finetuned Model
A large-sized TAPEX model fine-tuned on the WikiTableQuestions dataset, designed to enhance performance in table-based question answering tasks.
BART
Transformers
WikiTableQuestions
TableQuestionAnswering
DataExtraction
TAPEX
FineTunedModel
NaturalLanguageProcessing
  • See Upvoters0
  • Downloads2
  • File Size0
  • Views205
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Base Model (Table Pre-training via Learning a Neural SQL Executor)
A base-sized pre-trained model designed to enhance table-based question answering and fact verification tasks.
FactVerification
BART
TableQuestionAnswering
PreTrainedModel
TabularData
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views122
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: WikiTable Questions Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the WikiTableQuestions dataset, designed to enhance performance in table-based question answering tasks.
WikiTableQuestions
Transformers
BART
NaturalLanguageProcessing
FineTunedModel
TAPEX
DataExtraction
TableQuestionAnswering
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views157
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: WikiSQL Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A large-sized TAPEX model fine-tuned on the WikiSQL dataset, optimized for translating natural language questions into SQL queries for effective table-based question answering.
DataRetrieval
SQLQueryGeneration
Transformers
BART
NaturalLanguageProcessing
FineTunedModel
TAPEX
WikiSQL
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views146
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: TabFact Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the TabFact dataset, tailored for verifying the factual accuracy of textual statements against tabular data.
Transformers
BART
NaturalLanguageProcessing
FineTunedModel
TAPEX
FactVerification
DataValidation
TabFact
  • See Upvoters0
  • Downloads4
  • File Size0
  • Views122
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the WikiSQL dataset, designed to enhance performance in table-based question answering tasks.
TAPEX
WikiSQL
TableQuestionAnswering
DataExtraction
Transformers
FineTunedModel
NaturalLanguageProcessing
BART
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views174
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

BiomedBERT - Domain-Specific Biomedical Language Model
A biomedical NLP model pre-trained from scratch on abstracts and full-text articles from PubMed and PubMed Central, achieving state-of-the-art performance on biomedical language understanding tasks.
Transformers
exbert
Fill-Mask
inference endpoints
Bert
English
JAX
PyTorch
  • See Upvoters0
  • Downloads91
  • File Size0
  • Views1,471
Updated 1 year(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.