Indian Flag
Government Of India
A-
A
A+

Microsoft's - LLM2CLIP Llama3.2-1B-EVA02-L-14-336

A model that integrates the Llama3.2-1B language model with the EVA02-L-14-336 visual encoder to enhance cross-modal understanding and retrieval tasks.

About Model

The LLM2CLIP-Llama3.2-1B-EVA02-L-14-336 model is part of the LLM2CLIP series, aiming to extend the capabilities of CLIP models by combining Large Language Models (LLMs) with advanced visual encoders. This integration allows the model to process more detailed and extended textual descriptions, overcoming the context window limitations of traditional CLIP text encoders. By fine-tuning the LLM in the caption space using contrastive learning, the model enhances the textual discriminability of output embeddings. This approach leads to substantial improvements in cross-modal tasks, such as image-text retrieval and zero-shot image classification. Experiments have demonstrated that this method boosts performance significantly, transforming a CLIP model trained solely on English data into a state-of-the-art cross-lingual model. Moreover, when integrated into multimodal training with models like Llava 1.5, it consistently outperforms traditional CLIP models across nearly all benchmarks, showcasing comprehensive performance enhancements.

Microsoft's - LLM2CLIP Llama3.2-1B-EVA02-L-14-336

Metadata Metadata

Apache 2.0

Weiquan Huang and Aoqi Wu and Yifan Yang and Xufang Luo and Yuqing Yang and Liang Hu and Qi Dai and Xiyang Dai and Dongdong Chen and Chong Luo and Lili Qiu

vision foundation model, feature backbone

Other

Open

Sector Agnostic

20/08/25 05:43:55

0

Activity Overview Activity Overview

  • Downloads0
  • Redirect 2
  • Views 76
  • File Size 0

Tags Tags

  • LLM2CLIP
  • Llama3.2-1B
  • EVA02
  • CrossModal
  • ImageTextRetrieval
  • ZeroShotClassification

License Control License Control

Apache 2.0

More Models from Microsoft Corporation (India) Pvt. Ltd. More Models from Microsoft Corporation (India) Pvt. Ltd.

TAPEX: Large SQL Execution Model (Table Pre-training via Learning a Neural SQL Executor)
A large-sized TAPEX model pre-trained to simulate neural SQL execution, enabling the execution of SQL queries on given tables.
Transformers
SQLExecution
PreTrainedModel
TAPEX
DataRetrieval
NeuralExecutor
BART
  • See Upvoters0
  • Downloads6
  • File Size0
  • Views177
Updated 8 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Large Model (Table Pre-training via Learning a Neural SQL Executor)
A large-sized pre-trained model designed to enhance table-based question answering and fact verification tasks.
BART
TableQuestionAnswering
FactVerification
PreTrainedModel
LargeModel
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views134
Updated 8 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: TabFact Data enabled Large Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A large-sized TAPEX model fine-tuned on the TabFact dataset, designed to enhance performance in table-based fact verification tasks.
FactVerification
NaturalLanguageProcessing
Transformers
BART
DataValidation
FineTunedModel
TabFact
TAPEX
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views101
Updated 8 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX (Table Pre-training via Learning a Neural SQL Executor) Large Finetuned Model
A large-sized TAPEX model fine-tuned on the WikiTableQuestions dataset, designed to enhance performance in table-based question answering tasks.
TAPEX
TableQuestionAnswering
NaturalLanguageProcessing
Transformers
BART
DataExtraction
FineTunedModel
WikiTableQuestions
  • See Upvoters0
  • Downloads2
  • File Size0
  • Views136
Updated 8 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Base Model (Table Pre-training via Learning a Neural SQL Executor)
A base-sized pre-trained model designed to enhance table-based question answering and fact verification tasks.
BART
TableQuestionAnswering
FactVerification
PreTrainedModel
TabularData
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views95
Updated 8 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: WikiTable Questions Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the WikiTableQuestions dataset, designed to enhance performance in table-based question answering tasks.
NaturalLanguageProcessing
TableQuestionAnswering
TAPEX
WikiTableQuestions
FineTunedModel
DataExtraction
BART
Transformers
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views124
Updated 8 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: WikiSQL Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A large-sized TAPEX model fine-tuned on the WikiSQL dataset, optimized for translating natural language questions into SQL queries for effective table-based question answering.
Transformers
NaturalLanguageProcessing
SQLQueryGeneration
TAPEX
WikiSQL
FineTunedModel
DataRetrieval
BART
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views111
Updated 8 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: TabFact Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the TabFact dataset, tailored for verifying the factual accuracy of textual statements against tabular data.
FactVerification
TAPEX
TabFact
FineTunedModel
DataValidation
BART
Transformers
NaturalLanguageProcessing
  • See Upvoters0
  • Downloads4
  • File Size0
  • Views100
Updated 8 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

TAPEX: Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model
A base-sized TAPEX model fine-tuned on the WikiSQL dataset, designed to enhance performance in table-based question answering tasks.
DataExtraction
NaturalLanguageProcessing
Transformers
BART
TableQuestionAnswering
FineTunedModel
WikiSQL
TAPEX
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views139
Updated 8 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

BiomedBERT - Domain-Specific Biomedical Language Model
A biomedical NLP model pre-trained from scratch on abstracts and full-text articles from PubMed and PubMed Central, achieving state-of-the-art performance on biomedical language understanding tasks.
Transformers
inference endpoints
exbert
Bert
English
JAX
PyTorch
Fill-Mask
  • See Upvoters0
  • Downloads88
  • File Size0
  • Views1,273
Updated 11 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.