Bhashini's Bengali-English Translation Benchmark is a detailed text dataset for testing machine translation quality. It includes document-level information and helps researchers build better multilingual translation systems.
The dataset NTREX_bn_en_benchmark provides news test references for Machine Translation (MT) evaluation, specifically for translations from Bengali to English. With coverage spanning 128 target languages, this dataset includes document-level information, making it highly suitable for benchmarking and enhancing multilingual MT models. Designed for the news domain, it offers a comprehensive testbed for evaluating translation quality and improving language models. Submitted by Microsoft, this dataset is a critical resource for researchers and developers working on Bengali-to-English MT tasks.
Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.