Indian Flag
Government Of India
A-
A
A+

Text Analytics Assistance by NIC (TAANI)

TAANI is a suite of AI-based text analytics services for transliteration, translation, summarization, text deduplication, transcription, and PII masking in Indian languages.

About Use Case

TAANI is a comprehensive AI-based text analytics platform developed by NIC, offering modular language solutions across 22 Indian languages with 14+ crore hits in FY 2024–25. The platform includes:

  1. AI Matra – Transliteration service for converting between Indic languages and English, allowing users to input text using an English keyboard. It is widely used in health services (e.g., for Sickle Cell Anaemia cards). It also supports multilingual -script recognition for both native and romanised mixed mode text  models built in collaboration with IIT Roorkee.
  2. AI Panini – Translation services across 22 regional languages. A specialized instance trained on legal corpus is deployed at the Supreme Court for English-Hindi translation.
  3. AI Saransh – Extractive & Abstractive text summarization in English, used in multilingual summarization pipelines to generate summaries that can  be translated in desired Indic language.
  4. AI Shruti – Real-time transcription of streaming audio into text in 9 Indian languages including Hindi, Marathi, Tamil, Bengali, and more.
  5. AI Nibhrit – AI-driven PDF redaction tool to mask PAN, Aadhaar, and fingerprints data, and extract handwritten annotations and highlights. Deployed by RERA in Himachal and the Directorate of Stamps and Revenue in 12 states. It is a copyrighted service of NIC.

Key Differentiators

  • Full-stack multilingual text analytics covering transliteration, translation, summarization, deduplication and transcription
  • Developed and hosted on Meghraj Cloud, accessible over NICNET
  • Legal domain translation engine adopted by the Supreme Court
  • Real-time streaming transcription across 9 Indian languages
  • Advanced PDF redaction including handwritten notes and image-based data masking
  • Nibhrit Copyrighted by NIC

Source Organization Source Organization

National Informatics Centre

Tags Tags

  • Translation
  • Legal Tech
  • Language AI
  • Text Analytics
  • PII Masking

Tags Sector

Governance and Administration