Indian Flag
Government Of India
A-
A
A+
bhasha‑wiki-Kannada

bhasha‑wiki-Kannada

Kannada is written in the Kannada script and is spoken in the southern Indian state of Karnataka.

About Dataset

Kannada is written in the Kannada script and is spoken in the southern Indian state of Karnataka. As a major regional language with a strong presence in literature, education, and governance, Kannada's inclusion in the soketlabs/bhasha-wiki dataset ensures that language models are well-equipped to engage with local users, understand regional contexts, and support linguistic diversity in the Indian subcontinent.


Note on Encoding:
This dataset is encoded in UTF-8 format.

  • Windows users:
    To ensure proper display of non-ASCII characters in Excel, first download the .csv file, open it in Notepad, choose File → Save As, and select UTF-8 with BOM . Then open the saved file in Excel.

  • macOS users:
    You can open the CSV file directly in Excel or any spreadsheet software without any  issues.

Activity Overview Activity Overview

  • Downloads0
  • Redirect 8
  • Views 102
  • File Size 0

Tags Tags

  • Kannada
  • cross-lingual NLP
  • multilingual NLP
  • indicnlp
  • natural language processing (NLP)
  • language research
  • multi-modal language resources

License Control License Control

Attribution 4.0 International (CC BY- 4.0)