Bhashini's Bengali-Punjabi Translation Benchmark is a detailed text dataset for testing machine translation quality. It includes document-level information and helps researchers build better multilingual translation systems.
The dataset NTREX_bn_pa_benchmark provides news test references for Machine Translation (MT) evaluation, focusing on translations from Bengali to Punjabi. Part of a broader collection supporting translations into 128 target languages, this dataset includes document-level information, making it ideal for multilingual MT benchmarking. Tailored for the news domain, it serves as a comprehensive resource for assessing translation quality and advancing translation systems. Submitted by Microsoft, this dataset is a critical tool for researchers and developers working on Bengali-to-Punjabi translation tasks.
Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.