Indian Flag
Government Of India
A-
A
A+
IndicCorpV2

IndicCorpV2

Monolingual Corpora for Indic Languages

About Dataset

Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages

This repository contains the pretraining data for the paper published at ACL 2023.

Activity Overview Activity Overview

  • Downloads0
  • Redirect 11
  • Views 33
  • File Size 0

Tags Tags

  • text
  • multilingual corpus
  • Indic Languages

License Control License Control

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)