Parrotlet-e is a state of the art multilingual medical embedding model designed for understanding and linking medical terms across Indian languages. It is optimised for entity-level representation of clinical concepts such as symptoms, diagnoses, and anatomical structures — enabling accurate medical coding, semantic search, and cross-lingual retrieval in healthcare applications. The model is fine-tuned from bge-m3 using weakly supervised contrastive learning with Multi-Similarity Loss on over 18 million multilingual medical term pairs aligned with SNOMED CT and UMLS. It supports both native and romanized scripts across 12 Indic languages and English, and is robust to abbreviations, spelling variations, and colloquial expressions commonly found in clinical documentation. Indic Languages support: Hindi Kannada Marathi Malayalam Tamil Telugu Odia Assamese Bengali Urdu Gujarati Punjabi
Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
ekacare
Multilingual Model
PyTorch
Open
Healthcare, Wellness and Family Welfare
13/11/25 06:12:02
0
Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.