It is a high-quality parallel corpus for Hindi to 19+ Indian languages, specifically curated for the Health and Public Health domain. It covers disease awareness, symptoms, prevention, immunization, and maternal-child health advisories.
Following the HIN- convention, this dataset contains manually verified translations: Path: Health_v2 / HIN-XXX / source_reviewed / HEALTH / *.txt Quality: All data is source-reviewed (manually translated and verified). Data Format (TSV) Tab-separated files with the following headerless structure: id | src_hi | tgt_ | domain="health"
Nmt Training: Fine-tuning Models For Medical And Healthcare-related Terminology. Health-tech: Powering Multilingual Health Assistants, Symptom Checkers, And Public Awareness Tools.
Attribution 4.0 International (CC BY- 4.0)
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.