Indian Flag
Government Of India
A-
A
A+

microsoft/Phi-3-vision-128k-instruct

Phi-3-vision-128k-instruct is a state-of-the-art multimodal model by Microsoft, designed to process both text and visual inputs with a context length of up to 128,000 tokens.

About Model

As part of the Phi-3 model family, Phi-3-vision-128k-instruct combines text and vision modalities to perform complex reasoning tasks across large contexts. The model is trained on high-quality, reasoning-rich datasets, including synthetic data and filtered publicly available web content. It has undergone supervised fine-tuning and direct preference optimization to enhance instruction-following accuracy and safety. Phi-3-vision-128k-instruct excels in tasks such as image captioning, visual question answering, and document analysis. The model is available on platforms like Hugging Face and Azure AI Foundry, supporting applications that require deep multimodal comprehension.

microsoft/Phi-3-vision-128k-instruct

Metadata Metadata

MIT

Microsoft

Multimodal Language Model

N.A.

Open

Sector Agnostic

12/03/25 06:35:30

0

Activity Overview Activity Overview

  • Downloads0
  • Redirect 9
  • File Size 0
  • Views 170

Tags Tags

  • Transformers
  • Microsoft
  • NLP
  • Text Generation
  • Reasoning
  • Multimodal
  • Visual Question Answering
  • Image-to-Text
  • Long Context

License Control License Control

MIT

More Models from Microsoft Corporation (India) Pvt. Ltd. More Models from Microsoft Corporation (India) Pvt. Ltd.

TAPEX: Large SQL Execution Model (Table Pre-training via Learning a Neural SQL Executor)
A large-sized TAPEX model pre-trained to simulate neural SQL execution, enabling the execution of SQL queries on given tables.
DataRetrieval
Transformers
BART
TAPEX
PreTrainedModel
NeuralExecutor
SQLExecution
  • See Upvoters0
  • Downloads6
  • File Size0
  • Views214
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Large Model (Table Pre-training via Learning a Neural SQL Executor)
A large-sized pre-trained model designed to enhance table-based question answering and fact verification tasks.
FactVerification
BART
TableQuestionAnswering
PreTrainedModel
LargeModel
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views169
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: TabFact Data enabled Large Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A large-sized TAPEX model fine-tuned on the TabFact dataset, designed to enhance performance in table-based fact verification tasks.
Transformers
TabFact
DataValidation
FactVerification
TAPEX
FineTunedModel
NaturalLanguageProcessing
BART
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views131
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX (Table Pre-training via Learning a Neural SQL Executor) Large Finetuned Model
A large-sized TAPEX model fine-tuned on the WikiTableQuestions dataset, designed to enhance performance in table-based question answering tasks.
BART
Transformers
WikiTableQuestions
TableQuestionAnswering
DataExtraction
TAPEX
FineTunedModel
NaturalLanguageProcessing
  • See Upvoters0
  • Downloads2
  • File Size0
  • Views206
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Base Model (Table Pre-training via Learning a Neural SQL Executor)
A base-sized pre-trained model designed to enhance table-based question answering and fact verification tasks.
FactVerification
BART
TableQuestionAnswering
PreTrainedModel
TabularData
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views122
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: WikiTable Questions Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the WikiTableQuestions dataset, designed to enhance performance in table-based question answering tasks.
WikiTableQuestions
Transformers
BART
NaturalLanguageProcessing
FineTunedModel
TAPEX
DataExtraction
TableQuestionAnswering
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views157
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: WikiSQL Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A large-sized TAPEX model fine-tuned on the WikiSQL dataset, optimized for translating natural language questions into SQL queries for effective table-based question answering.
DataRetrieval
SQLQueryGeneration
Transformers
BART
NaturalLanguageProcessing
FineTunedModel
TAPEX
WikiSQL
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views147
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: TabFact Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the TabFact dataset, tailored for verifying the factual accuracy of textual statements against tabular data.
Transformers
BART
NaturalLanguageProcessing
FineTunedModel
TAPEX
FactVerification
DataValidation
TabFact
  • See Upvoters0
  • Downloads4
  • File Size0
  • Views123
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the WikiSQL dataset, designed to enhance performance in table-based question answering tasks.
TAPEX
WikiSQL
TableQuestionAnswering
DataExtraction
Transformers
FineTunedModel
NaturalLanguageProcessing
BART
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views174
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

BiomedBERT - Domain-Specific Biomedical Language Model
A biomedical NLP model pre-trained from scratch on abstracts and full-text articles from PubMed and PubMed Central, achieving state-of-the-art performance on biomedical language understanding tasks.
Transformers
exbert
Fill-Mask
inference endpoints
Bert
English
JAX
PyTorch
  • See Upvoters0
  • Downloads91
  • File Size0
  • Views1,473
Updated 1 year(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.