Indian Flag
Government Of India
A-
A
A+

Orca 2-13Billion Parameter Model

A 13-billion-parameter language model fine-tuned from LLaMA-2, designed to enhance reasoning capabilities in tasks such as reading comprehension, math problem-solving, and text summarization.

About Model

Orca-2-13B is an advanced language model developed by Microsoft, fine-tuned from Meta's LLaMA-2 13B base model. This model focuses on bolstering the reasoning skills of smaller language models through training on a synthetic dataset tailored for complex reasoning tasks. It excels in single-turn interactions, effectively handling tasks like reasoning over user-provided data, reading comprehension, mathematical problem-solving, and text summarization. Evaluations indicate that Orca-2-13B outperforms baseline models of similar size and achieves performance levels comparable to models significantly larger, particularly in zero-shot reasoning scenarios. Despite its strengths, the model is not optimized for chat functionalities and lacks training involving reinforcement learning from human feedback (RLHF) or Direct Preference Optimization (DPO). Researchers interested in deploying the model for conversational purposes or specific applications are advised to undertake additional fine-tuning. Orca-2-13B is accessible under the Microsoft Research License, promoting further exploration in the field of small language model development and alignment.

Orca 2-13Billion Parameter Model

Metadata Metadata

Microsoft-research-license

Arindam Mitra and Luciano Del Corro and Shweti Mahajan and Andres Codas and Clarisse Simoes and Sahaj Agrawal and Xuxi Chen and Anastasia Razdaibiedina and Erik Jones and Kriti Aggarwal and Hamid Palangi and Guoqing Zheng and Corby Rosset and Hamed Khanpour and Ahmed Awadallah

Fine-Tuned LLaMA-2 Model

N.A.

Open

Sector Agnostic

12/03/25 06:34:54

0

Activity Overview Activity Overview

  • Downloads0
  • Redirect 8
  • File Size 0
  • Views 131

Tags Tags

  • Microsoft
  • LanguageModel
  • Orca2
  • Reasoning
  • Research
  • LLaMA2

License Control License Control

Microsoft-research-license

More Models from Microsoft Corporation (India) Pvt. Ltd. More Models from Microsoft Corporation (India) Pvt. Ltd.

TAPEX: Large SQL Execution Model (Table Pre-training via Learning a Neural SQL Executor)
A large-sized TAPEX model pre-trained to simulate neural SQL execution, enabling the execution of SQL queries on given tables.
DataRetrieval
Transformers
BART
TAPEX
PreTrainedModel
NeuralExecutor
SQLExecution
  • See Upvoters0
  • Downloads6
  • File Size0
  • Views214
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Large Model (Table Pre-training via Learning a Neural SQL Executor)
A large-sized pre-trained model designed to enhance table-based question answering and fact verification tasks.
FactVerification
BART
TableQuestionAnswering
PreTrainedModel
LargeModel
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views169
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: TabFact Data enabled Large Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A large-sized TAPEX model fine-tuned on the TabFact dataset, designed to enhance performance in table-based fact verification tasks.
Transformers
TabFact
DataValidation
FactVerification
TAPEX
FineTunedModel
NaturalLanguageProcessing
BART
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views131
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX (Table Pre-training via Learning a Neural SQL Executor) Large Finetuned Model
A large-sized TAPEX model fine-tuned on the WikiTableQuestions dataset, designed to enhance performance in table-based question answering tasks.
BART
Transformers
WikiTableQuestions
TableQuestionAnswering
DataExtraction
TAPEX
FineTunedModel
NaturalLanguageProcessing
  • See Upvoters0
  • Downloads2
  • File Size0
  • Views206
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Base Model (Table Pre-training via Learning a Neural SQL Executor)
A base-sized pre-trained model designed to enhance table-based question answering and fact verification tasks.
FactVerification
BART
TableQuestionAnswering
PreTrainedModel
TabularData
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views122
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: WikiTable Questions Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the WikiTableQuestions dataset, designed to enhance performance in table-based question answering tasks.
WikiTableQuestions
Transformers
BART
NaturalLanguageProcessing
FineTunedModel
TAPEX
DataExtraction
TableQuestionAnswering
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views157
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: WikiSQL Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A large-sized TAPEX model fine-tuned on the WikiSQL dataset, optimized for translating natural language questions into SQL queries for effective table-based question answering.
DataRetrieval
SQLQueryGeneration
Transformers
BART
NaturalLanguageProcessing
FineTunedModel
TAPEX
WikiSQL
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views147
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: TabFact Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the TabFact dataset, tailored for verifying the factual accuracy of textual statements against tabular data.
Transformers
BART
NaturalLanguageProcessing
FineTunedModel
TAPEX
FactVerification
DataValidation
TabFact
  • See Upvoters0
  • Downloads4
  • File Size0
  • Views123
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the WikiSQL dataset, designed to enhance performance in table-based question answering tasks.
TAPEX
WikiSQL
TableQuestionAnswering
DataExtraction
Transformers
FineTunedModel
NaturalLanguageProcessing
BART
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views174
Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

BiomedBERT - Domain-Specific Biomedical Language Model
A biomedical NLP model pre-trained from scratch on abstracts and full-text articles from PubMed and PubMed Central, achieving state-of-the-art performance on biomedical language understanding tasks.
Transformers
exbert
Fill-Mask
inference endpoints
Bert
English
JAX
PyTorch
  • See Upvoters0
  • Downloads91
  • File Size0
  • Views1,473
Updated 1 year(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.