ORGANISATION

Shrutam-2

Shrutam-2 is a LLM based automatic speech recognition system for 12 major Indian languages. It bridges a Conformer speech encoder with a pretrained LLM decoder through a Mixture-of-Experts (MoE) projection layer, enabling high-quality, prompt-controllable transcription across diverse Indic languages.

BharatGen
AVJ

About Model

Unlike conventional CTC/Attention ASR systems that map audio directly to text tokens, Shrutam-2 reframes speech recognition as a conditional language generation task. A speech encoder produces frame-level audio representations, which are then projected into the LLM's embedding space and fed to a frozen LLM decoder alongside a text prompt.

Shrutam-2

Metadata

License

Attribution-Non-Commercial 4.0 International (CC BY-NC 4.0)

Hosted By

bharatgenai

Model Type

Automatic Speech Recognition Model

Model Format

PyTorch

Visibility

Restricted

Source Organisation

BharatGen

Sector

Sector Agnostic

Updated Date & Time

18/05/26 11:24:38

Created By

Abhay Vijayvargiya

Size

8.37 GB

Shrutam-2 ( 1 directories )

Shrutam-2

13 files, 1 directories

Activity Overview

0
1
8.37 GB
146

License Control

Attribution-Non-Commercial 4.0 International (CC BY-NC 4.0)

Version Control

Version 1(8.37 GB)

admin·1 month(s) ago
- Shrutam-2
  Shrutam-2

More Models from BharatGen

sooktam2

Sooktam-2 is a multilingual Indic Text-to-Speech model by BharatGen supporting 12 languages including Hindi, Marathi, Tamil, Telugu, Bengali, Urdu, Punjabi and Indian English. It enables high-quality speech synthesis with reference-guided voice conditioning, preserving speaker voice, accent and prosody for natural and expressive generation.

Text to Speech

Multilingual

f5-tts

sooktam2

tts

indic

Updated 1 month(s) ago

BHARATGEN

View Details

Shrutam-2

Speech-to-Text

Automatic Speech Recognition

0
1
8.37 GB
147

Updated 1 month(s) ago

BHARATGEN

View Details

Param-1-5B

Param-1-5B is a bilingual (English–Hindi) large language model developed under the Param-1 family. With 5 billion parameters, this model extends the capabilities of Param-1-2.9B by incorporating enhanced mathematical reasoning and code understanding/generation. The model is pretrained from scratch and designed to serve as a strong foundation for downstream tasks such as mathematical problem solving, and code-related understanding / generation.

pretrained

0
1
10.42 GB
86

Updated 1 month(s) ago

BHARATGEN

View Details

Param-1-Instruct

BharatGen introduces the early checkpoint of SFT (Supervised Fine-Tuned) for Param 1, a bilingual language model trained from scratch in English and Hindi. With 2.9 billion parameters, this checkpoint builds upon the pretraining phase and serves as a foundation for more downstream tasks, safety testing, and customization.

QnA

Instruction-Tuning

Model Fine-Tuning

0
17
5.36 GB
70

Updated 1 month(s) ago

BHARATGEN

View Details

BharatGen - Param 1 Indic-Scale Bilingual Foundation Model

Param1 is a 2.9 billion parameter language model pretrained on English and Hindi, designed for text completion.

Large Language Model

4
713
13.79 GB
20,390

Updated 1 month(s) ago

BHARATGEN

View Details

Param2-17B-Thinking

BharatGen presents Param-2-17B-MoE-A2.4B, a large-scale Mixture-of-Experts (MoE) language model designed to deliver high model capacity while retaining the inference efficiency of a much smaller dense model. It uses a Hybrid MoE architecture with 17B total parameters, while activating only 2.4B parameters per token.

Mixture of Experts

pretrained

Multilingual Text

1
62
57.29 GB
2,195

Updated 3 month(s) ago

BHARATGEN

View Details

BharatGen Multilingual TTS - Sooktam2

Text-to-Speech

Audio Synthesis

sooktam2

Multilingual Speech

multilingual-TTS

0
22
1.25 GB
1,544

Updated 3 month(s) ago

BHARATGEN

View Details

BharatGen - Param-1-7B-MoE Advancing Multilingual GenAI for India

Param-1-7B-MoE is a multilingual large language model developed under the Param-1 family as part of BharatGen – A Suite of Generative AI Technologies for India. With 7 billion parameters and a Mixture of Experts (MoE) architecture, the model is designed to better understand and generate text across English, Hindi, and 14 additional Indian languages. The model is pretrained from scratch with a strong focus on linguistic diversity, cultural context, and large-scale multilingual representation.

safetensors

mixtral

region:us

1
83
0
1,449

Updated 5 month(s) ago

BHARATGEN

View Details

BharatGen-AgriParam

Large language model fine-tuned from Param-1-2.9B-Instruct on a high-quality, India-centric agriculture dataset.

Multiturn

QnA

Updated 6 month(s) ago

BHARATGEN

View Details

BharatGen-FinanceParam

large language model fine-tuned from Param-1-2.9B-Instruct on a high-quality finance dataset.

Multiturn

QnA

Updated 6 month(s) ago

BHARATGEN

View Details

Accessibility options by UX4G

Shrutam-2

About Model

Shrutam-2

Metadata

Shrutam-2 ( 1 directories )

Shrutam-2

Activity Overview

Tags

License Control

Version Control

Version 1(8.37 GB)

Shrutam-2

Shrutam-2

More Models from BharatGen

AIKosh

Resources

Support