**Santham** is a high-quality, curated parallel corpus for Sanskrit-Tamil machine translation. This dataset also contains 1000 benchmark sentences.
Sanskrit poetry frequently relies on complex metrical order that hinder direct translation. This repository contains *anvaya* (prose-order reordered) data mapped to poetry to serve as an intermediate translation bridge. | *Anvaya* | 10,146 | Poetry data mapped to *anvaya* (reordered) data. |
Translation Of Sanskrit Tamil Dataset Using Anvaya As Source.
Attribution 4.0 International (CC BY- 4.0)
2 files
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.