The Hindi to Indian Languages Science and Technology Translation Dataset is a parallel corpus for translating Hindi into multiple Indian languages in the science and technology domain. It includes content such as popular science explainers, technology magazines, digital service instructions, research announcements, and STEM education snippets, supporting multilingual machine translation and cross lingual NLP research.
Science-and-Technology_v2: Multilingual STEM Translation Science-and-Technology_v2 (SAT_v2) is a specialized parallel corpus covering popular science explainers, research announcements, and digital service instructions. Folder Structure & Quality Following the - convention, this dataset includes manually verified translations: Path: Science-and-Technology_v2 / - / source_reviewed / SAT / *.txt Quality: All files in this version are source-reviewed… See the full description on the dataset page: https://huggingface.co/datasets/coild-aikosh/Science-and-Technology_v2.
Nmt Training: Fine-tuning Models For Technical, Scientific, And Digital Domain Accuracy. Digital Literacy: Powering Tools That Translate Complex Technology Concepts Into Regional Indian Languages.
Attribution 4.0 International (CC BY- 4.0)
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.