Indian Flag
Government Of India
A-
A
A+
Bengali to Sindhi Translation Benchmark Dataset

Bengali to Sindhi Translation Benchmark Dataset

Bhashini's Bengali-Sindhi Translation Benchmark is a detailed text dataset for testing machine translation quality. It includes document-level information and helps researchers build better multilingual translation systems.

About Dataset

The dataset NTREX_bn_sd_benchmark provides news test references for Machine Translation (MT) evaluation, focusing on translations from Bengali to Sindhi. It is part of a comprehensive collection supporting translations into 128 target languages and includes document-level information, making it a valuable tool for multilingual MT benchmarking. Designed for the news domain, this dataset facilitates the assessment of translation quality and aids in the development of robust translation systems. Submitted by Microsoft, this resource is essential for researchers and developers working on Bengali-to-Sindhi translation tasks.

Activity Overview Activity Overview

  • Downloads0
  • Downloads 6
  • Views 79
  • File Size 1.13 MB

Tags Tags

  • Translation
  • Document-Level Evaluation
  • NLP Dataset
  • Language Modeling
  • Bilingual Translation
  • Bengali-Sindhi
  • Benchmark
  • News Domain
  • Machine Translation
  • Microsoft

License Control License Control

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

data.json ( 1.13 MB )


To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Data Quality Score BetaData Quality Score Beta

Version Control Version Control

FolderVersion 1(1.13 MB)
  • admin·11 month(s) ago
    • application/json
      data.json
    • application/json
      params.json