Home/Models/BiomedCLIP-PubMedBERT_256-vit_base_patch16_224: A biomedical vision-language foundation model

BiomedCLIP-PubMedBERT_256-vit_base_patch16_224: A biomedical vision-language foundation model

A biomedical vision-language foundation model trained on PMC-15M, using PubMedBERT as the text encoder and Vision Transformer as the image encoder, optimized for cross-modal retrieval, classification, and visual question answering in medical AI applications.

Microsoft Corporation (India) Pvt. Ltd.
Vimalho

About Model

BiomedCLIP is a state-of-the-art biomedical vision-language model designed for multimodal learning in medical AI. Developed by Microsoft, it is pre-trained on PMC-15M, a dataset of 15 million figure-caption pairs extracted from biomedical research articles in PubMed Central. The model combines: 1. PubMedBERT as the text encoder for domain-specific language understanding. 2. Vision Transformer (ViT) as the image encoder with specialized adaptations for medical imaging tasks. BiomedCLIP significantly outperforms prior vision-language models in various medical AI benchmarks and supports the following applications: 1. Cross-modal retrieval (text-to-image and image-to-text search). 2. Zero-shot image classification for medical images. 3. Visual question answering (VQA) in radiology and pathology. Trained on a diverse range of medical imaging modalities, including radiography, microscopy, and histology, BiomedCLIP establishes new performance standards in biomedical visual-language tasks. However, the model is intended for research purposes only and is not suitable for clinical decision-making or commercial deployment. It serves as a valuable tool for AI researchers exploring multimodal medical applications in radiology, pathology, and beyond.

BiomedCLIP-PubMedBERT_256-vit_base_patch16_224: A biomedical vision-language foundation model

Metadata

License

MIT

Hosted By

MicroSoft

Model Type

Zero-Shot Image Classification

Model Format

N.A.

Visibility

Open

Source organisation

Microsoft Corporation (India) Pvt. Ltd.

Sector

Healthcare, Wellness and Family Welfare

Updated Date & Time

11/04/25 06:25:10

Created By

Vikram Malhotra

Size

Activity Overview

License Control

MIT

More Models from Microsoft Corporation (India) Pvt. Ltd.

TAPEX: Large SQL Execution Model (Table Pre-training via Learning a Neural SQL Executor)

A large-sized TAPEX model pre-trained to simulate neural SQL execution, enabling the execution of SQL queries on given tables.

DataRetrieval

Transformers

BART

TAPEX

PreTrainedModel

NeuralExecutor

SQLExecution

Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

View Details

TAPEX: Large Model (Table Pre-training via Learning a Neural SQL Executor)

A large-sized pre-trained model designed to enhance table-based question answering and fact verification tasks.

FactVerification

BART

TableQuestionAnswering

PreTrainedModel

LargeModel

Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

View Details

TAPEX: TabFact Data enabled Large Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model

A large-sized TAPEX model fine-tuned on the TabFact dataset, designed to enhance performance in table-based fact verification tasks.

Transformers

TabFact

DataValidation

FactVerification

TAPEX

FineTunedModel

NaturalLanguageProcessing

BART

Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

View Details

TAPEX (Table Pre-training via Learning a Neural SQL Executor) Large Finetuned Model

A large-sized TAPEX model fine-tuned on the WikiTableQuestions dataset, designed to enhance performance in table-based question answering tasks.

BART

Transformers

WikiTableQuestions

TableQuestionAnswering

DataExtraction

TAPEX

FineTunedModel

NaturalLanguageProcessing

Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

View Details

TAPEX: Base Model (Table Pre-training via Learning a Neural SQL Executor)

A base-sized pre-trained model designed to enhance table-based question answering and fact verification tasks.

FactVerification

BART

TableQuestionAnswering

PreTrainedModel

TabularData

Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

View Details

TAPEX: WikiTable Questions Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model

A base-sized TAPEX model fine-tuned on the WikiTableQuestions dataset, designed to enhance performance in table-based question answering tasks.

WikiTableQuestions

Transformers

BART

NaturalLanguageProcessing

FineTunedModel

TAPEX

DataExtraction

TableQuestionAnswering

Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

View Details

TAPEX: WikiSQL Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model

A large-sized TAPEX model fine-tuned on the WikiSQL dataset, optimized for translating natural language questions into SQL queries for effective table-based question answering.

DataRetrieval

SQLQueryGeneration

Transformers

BART

NaturalLanguageProcessing

FineTunedModel

TAPEX

WikiSQL

Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

View Details

TAPEX: TabFact Data enabled Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model

A base-sized TAPEX model fine-tuned on the TabFact dataset, tailored for verifying the factual accuracy of textual statements against tabular data.

Transformers

BART

NaturalLanguageProcessing

FineTunedModel

TAPEX

FactVerification

DataValidation

TabFact

Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

View Details

TAPEX: Base Finetuned (Table Pre-training via Learning a Neural SQL Executor) Model

A base-sized TAPEX model fine-tuned on the WikiSQL dataset, designed to enhance performance in table-based question answering tasks.

TAPEX

WikiSQL

TableQuestionAnswering

DataExtraction

Transformers

FineTunedModel

NaturalLanguageProcessing

BART

Updated 10 month(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

View Details

BiomedBERT - Domain-Specific Biomedical Language Model

A biomedical NLP model pre-trained from scratch on abstracts and full-text articles from PubMed and PubMed Central, achieving state-of-the-art performance on biomedical language understanding tasks.

Transformers

exbert

Fill-Mask

inference endpoints

Bert

English

JAX

PyTorch

0
91
0
1,467

Updated 1 year(s) ago

MICROSOFT CORPORATION (INDIA) PVT. LTD.

View Details

Accessibility options by UX4G

BiomedCLIP-PubMedBERT_256-vit_base_patch16_224: A biomedical vision-language foundation model

About Model

BiomedCLIP-PubMedBERT_256-vit_base_patch16_224: A biomedical vision-language foundation model

Metadata

Activity Overview

Tags

License Control

More Models from Microsoft Corporation (India) Pvt. Ltd.

AIKosh

Resources

Support