Indian Flag
Government Of India
A-
A
A+

LayoutLMv2 - Multimodal Document Understanding

An improved multimodal Transformer model for document AI, integrating text, layout, and image pre-training for enhanced document understanding tasks such as form recognition, document classification, and visual question answering.

About Model

LayoutLMv2 is an advanced version of the LayoutLM model, designed for document AI applications by integrating text, layout, and image information into a unified multimodal framework. This model enhances document understanding through pre-training tasks that improve interaction among different modalities, leading to superior performance in visually rich document processing. It achieves state-of-the-art results on various benchmark tasks, including form recognition, receipt and invoice understanding, and document-based question answering. Built for tasks requiring OCR and document layout comprehension, LayoutLMv2 is widely applicable in automated document processing, financial data extraction, and AI-driven document analysis.

LayoutLMv2 - Multimodal Document Understanding

Metadata Metadata

Attribution-Non-Commercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)

Yiheng Xu, Minghao Li, Lei Cui, Shaohan Huang, Furu Wei, Ming Zhou

Transformers

Other

Open

Sector Agnostic

20/08/25 11:45:33

0

Activity Overview Activity Overview

  • Downloads0
  • Redirect 9
  • Views 152
  • File Size 0

Tags Tags

  • Transformers
  • PyTorch
  • tensorflow
  • ONNX
  • safetensors
  • English
  • layoutlmv3
  • inference endpoints

License Control License Control

Attribution-Non-Commercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)

More Models from Microsoft Corporation (India) Pvt. Ltd. More Models from Microsoft Corporation (India) Pvt. Ltd.

TAPEX: Large SQL Execution Model (Table Pre-training via Learning a Neural SQL Executor)
A large-sized TAPEX model pre-trained to simulate neural SQL execution, enabling the execution of SQL queries on given tables.
Transformers
SQLExecution
PreTrainedModel
TAPEX
DataRetrieval
NeuralExecutor
BART
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views143
Updated 7 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Large Model (Table Pre-training via Learning a Neural SQL Executor)
A large-sized pre-trained model designed to enhance table-based question answering and fact verification tasks.
BART
TableQuestionAnswering
FactVerification
PreTrainedModel
LargeModel
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views96
Updated 7 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: TabFact Data enabled Large Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A large-sized TAPEX model fine-tuned on the TabFact dataset, designed to enhance performance in table-based fact verification tasks.
FactVerification
NaturalLanguageProcessing
Transformers
BART
DataValidation
FineTunedModel
TabFact
TAPEX
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views73
Updated 7 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX (Table Pre-training via Learning a Neural SQL Executor) Large Finetuned Model
A large-sized TAPEX model fine-tuned on the WikiTableQuestions dataset, designed to enhance performance in table-based question answering tasks.
TAPEX
TableQuestionAnswering
NaturalLanguageProcessing
Transformers
BART
DataExtraction
FineTunedModel
WikiTableQuestions
  • See Upvoters0
  • Downloads2
  • File Size0
  • Views96
Updated 7 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Base Model (Table Pre-training via Learning a Neural SQL Executor)
A base-sized pre-trained model designed to enhance table-based question answering and fact verification tasks.
BART
TableQuestionAnswering
FactVerification
PreTrainedModel
TabularData
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views69
Updated 7 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: WikiTable Questions Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the WikiTableQuestions dataset, designed to enhance performance in table-based question answering tasks.
NaturalLanguageProcessing
TableQuestionAnswering
TAPEX
WikiTableQuestions
FineTunedModel
DataExtraction
BART
Transformers
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views81
Updated 7 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: WikiSQL Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A large-sized TAPEX model fine-tuned on the WikiSQL dataset, optimized for translating natural language questions into SQL queries for effective table-based question answering.
Transformers
NaturalLanguageProcessing
SQLQueryGeneration
TAPEX
WikiSQL
FineTunedModel
DataRetrieval
BART
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views82
Updated 7 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: TabFact Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the TabFact dataset, tailored for verifying the factual accuracy of textual statements against tabular data.
FactVerification
TAPEX
TabFact
FineTunedModel
DataValidation
BART
Transformers
NaturalLanguageProcessing
  • See Upvoters0
  • Downloads4
  • File Size0
  • Views74
Updated 7 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the WikiSQL dataset, designed to enhance performance in table-based question answering tasks.
DataExtraction
NaturalLanguageProcessing
Transformers
BART
TableQuestionAnswering
FineTunedModel
WikiSQL
TAPEX
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views104
Updated 7 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

BiomedBERT - Domain-Specific Biomedical Language Model
A biomedical NLP model pre-trained from scratch on abstracts and full-text articles from PubMed and PubMed Central, achieving state-of-the-art performance on biomedical language understanding tasks.
Transformers
inference endpoints
exbert
Bert
English
JAX
PyTorch
Fill-Mask
  • See Upvoters0
  • Downloads72
  • File Size0
  • Views967
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.