Indian Flag
Government Of India
A-
A
A+

Phi-3-Medium-4K-Instruct ONNX-CUDA - Optimized AI Model for NVIDIA GPU Inference

An ONNX-optimized version of Phi-3-Medium-4K-Instruct, designed for high-speed, efficient inference on NVIDIA GPUs, supporting FP16 and INT4 quantization for enhanced performance.

About Model

Phi-3-Medium-4K-Instruct ONNX-CUDA is a high-performance AI model from Microsoft, optimized for fast execution on NVIDIA GPUs using ONNX Runtime. This version is quantized for FP16 and INT4 precision, enabling low-latency, high-speed processing while maintaining 4K and 128K token context lengths for structured reasoning and AI-powered instruction-following tasks.

Phi-3-Medium-4K-Instruct ONNX-CUDA - Optimized AI Model for NVIDIA GPU Inference

Metadata Metadata

MIT

Microsoft

Text Generation

N.A.

Open

Sector Agnostic

12/03/25 06:35:39

0

Activity Overview Activity Overview

  • Downloads0
  • Redirect 0
  • File Size 0
  • Views 181

Tags Tags

  • Transformers
  • ONNX
  • Microsoft
  • NLP
  • Text Generation
  • Reasoning
  • Instruction Following
  • CUDA

License Control License Control

MIT

More Models from Microsoft Corporation (India) Pvt. Ltd. More Models from Microsoft Corporation (India) Pvt. Ltd.

TAPEX: Large SQL Execution Model (Table Pre-training via Learning a Neural SQL Executor)
A large-sized TAPEX model pre-trained to simulate neural SQL execution, enabling the execution of SQL queries on given tables.
DataRetrieval
Transformers
BART
TAPEX
PreTrainedModel
NeuralExecutor
SQLExecution
  • See Upvoters0
  • Downloads6
  • File Size0
  • Views204
Updated 9 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Large Model (Table Pre-training via Learning a Neural SQL Executor)
A large-sized pre-trained model designed to enhance table-based question answering and fact verification tasks.
FactVerification
BART
TableQuestionAnswering
PreTrainedModel
LargeModel
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views159
Updated 9 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: TabFact Data enabled Large Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A large-sized TAPEX model fine-tuned on the TabFact dataset, designed to enhance performance in table-based fact verification tasks.
Transformers
TabFact
DataValidation
FactVerification
TAPEX
FineTunedModel
NaturalLanguageProcessing
BART
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views120
Updated 9 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX (Table Pre-training via Learning a Neural SQL Executor) Large Finetuned Model
A large-sized TAPEX model fine-tuned on the WikiTableQuestions dataset, designed to enhance performance in table-based question answering tasks.
BART
Transformers
WikiTableQuestions
TableQuestionAnswering
DataExtraction
TAPEX
FineTunedModel
NaturalLanguageProcessing
  • See Upvoters0
  • Downloads2
  • File Size0
  • Views179
Updated 9 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Base Model (Table Pre-training via Learning a Neural SQL Executor)
A base-sized pre-trained model designed to enhance table-based question answering and fact verification tasks.
FactVerification
BART
TableQuestionAnswering
PreTrainedModel
TabularData
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views117
Updated 9 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: WikiTable Questions Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the WikiTableQuestions dataset, designed to enhance performance in table-based question answering tasks.
WikiTableQuestions
Transformers
BART
NaturalLanguageProcessing
FineTunedModel
TAPEX
DataExtraction
TableQuestionAnswering
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views151
Updated 9 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: WikiSQL Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A large-sized TAPEX model fine-tuned on the WikiSQL dataset, optimized for translating natural language questions into SQL queries for effective table-based question answering.
DataRetrieval
SQLQueryGeneration
Transformers
BART
NaturalLanguageProcessing
FineTunedModel
TAPEX
WikiSQL
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views132
Updated 9 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: TabFact Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the TabFact dataset, tailored for verifying the factual accuracy of textual statements against tabular data.
Transformers
BART
NaturalLanguageProcessing
FineTunedModel
TAPEX
FactVerification
DataValidation
TabFact
  • See Upvoters0
  • Downloads4
  • File Size0
  • Views117
Updated 9 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the WikiSQL dataset, designed to enhance performance in table-based question answering tasks.
TAPEX
WikiSQL
TableQuestionAnswering
DataExtraction
Transformers
FineTunedModel
NaturalLanguageProcessing
BART
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views159
Updated 9 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

BiomedBERT - Domain-Specific Biomedical Language Model
A biomedical NLP model pre-trained from scratch on abstracts and full-text articles from PubMed and PubMed Central, achieving state-of-the-art performance on biomedical language understanding tasks.
Transformers
exbert
Fill-Mask
inference endpoints
Bert
English
JAX
PyTorch
  • See Upvoters0
  • Downloads91
  • File Size0
  • Views1,426
Updated 1 year(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.