Sooktam-2 is a multilingual Indic Text-to-Speech model by BharatGen supporting 12 languages including Hindi, Marathi, Tamil, Telugu, Bengali, Urdu, Punjabi and Indian English. It enables high-quality speech synthesis with reference-guided voice conditioning, preserving speaker voice, accent and prosody for natural and expressive generation.
Sooktam-2 is a sovereign multilingual Text-to-Speech (TTS) model developed by BharatGen. It generates natural and expressive speech across major Indian languages using reference-guided voice conditioning, preserving the speaker’s voice, accent, and cultural cadence. Supported Languages (12) Hindi · Marathi · Gujarati · Tamil · Telugu · Kannada · Bengali · Malayalam · Odia · Urdu · Punjabi · Indian English Key Capabilities Reference-guided voice cloning Multilingual Indic speech synthesis Natural prosody and expressive speech generation Language-aware CLS tokenization for accurate Indic phonetics Production-ready audio quality for scalable deployment
Other
bharatgenai
Transformers
PyTorch
Restricted
Other
09/03/26 08:11:25
1.25 GB
Other
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.