Home/Models/BharatGen - Param 1 Indic-Scale Bilingual Foundation Model

ORGANISATION

BharatGen - Param 1 Indic-Scale Bilingual Foundation Model

Param1 is a 2.9 billion parameter language model pretrained on English and Hindi, designed for text completion.

About Model

Param 1 is a 2.9-billion parameter foundation model developed for English and Hindi, capable for text generation and completion. Pretrained on high-quality, culturally rich datasets from diverse Indian domains approximately on 5 Trillion Tokens combined for English and Hindi, it delivers better performance on bilingual tasks while maintaining computational efficiency, outperforming several models of similar size and task scope on standard benchmarks. Param 1 is developed by BharatGen: A Suite of Generative AI Tech for India.
For any queries, please visit https://bharatgen.discourse.group/invites/BcouFsKk4g

BharatGen - Param 1 Indic-Scale Bilingual Foundation Model

Metadata

License

MIT

Hosted By

Kundeshwar Pundalik, Piyush Sawarkar, Vedant Goswami, Ajay Nagpal, Smita Gautam, Bhagwan Panditi, Adugani Vanjari Akanksh, Pankaj Singh, Rishi Bal, Prof. Rohit Saluja, Prof. Ganesh Ramakrishnan

Task Type

Text Generation

Model Format

Transformers

Visibility

Open

Source Organisation

BharatGen

Sector

Science, Technology and Research

Updated Date & Time

18/07/25 08:53:23

Created By

Kundeshwar Vijay Pundalik

Size

13.79 GB

Param_1 ( 4 files, 1 directories )

model_extracted

2 files, 1 directories

LICENSE

1.04 KB

model.nemo

5.33 GB

nemo_inference.sh

649 Bytes

README.md

7.05 KB

Activity Overview

4
714
8.44 GB
20,422

License Control

MIT

Version Control

Version 1(8.44 GB)

admin·1 year(s) ago
- Param_1
  model_extracted
  LICENSE
  model.nemo
  nemo_inference.sh
  README.md

More Models from BharatGen

sooktam2

Sooktam-2 is a multilingual Indic Text-to-Speech model by BharatGen supporting 12 languages including Hindi, Marathi, Tamil, Telugu, Bengali, Urdu, Punjabi and Indian English. It enables high-quality speech synthesis with reference-guided voice conditioning, preserving speaker voice, accent and prosody for natural and expressive generation.

Text to Speech

Multilingual

f5-tts

sooktam2

tts

indic

Updated 1 month(s) ago

BHARATGEN

View Details

Shrutam-2

Shrutam-2 is a LLM based automatic speech recognition system for 12 major Indian languages. It bridges a Conformer speech encoder with a pretrained LLM decoder through a Mixture-of-Experts (MoE) projection layer, enabling high-quality, prompt-controllable transcription across diverse Indic languages.

Speech-to-Text

Automatic Speech Recognition

0
1
8.37 GB
149

Updated 1 month(s) ago

BHARATGEN

View Details

Param-1-5B

Param-1-5B is a bilingual (English–Hindi) large language model developed under the Param-1 family. With 5 billion parameters, this model extends the capabilities of Param-1-2.9B by incorporating enhanced mathematical reasoning and code understanding/generation. The model is pretrained from scratch and designed to serve as a strong foundation for downstream tasks such as mathematical problem solving, and code-related understanding / generation.

pretrained

0
1
10.42 GB
88

Updated 1 month(s) ago

BHARATGEN

View Details

Param-1-Instruct

BharatGen introduces the early checkpoint of SFT (Supervised Fine-Tuned) for Param 1, a bilingual language model trained from scratch in English and Hindi. With 2.9 billion parameters, this checkpoint builds upon the pretraining phase and serves as a foundation for more downstream tasks, safety testing, and customization.

QnA

Instruction-Tuning

Model Fine-Tuning

0
17
5.36 GB
72

Updated 1 month(s) ago

BHARATGEN

View Details

BharatGen - Param 1 Indic-Scale Bilingual Foundation Model

Param1 is a 2.9 billion parameter language model pretrained on English and Hindi, designed for text completion.

Large Language Model

4
714
13.79 GB
20,423

Updated 2 month(s) ago

BHARATGEN

View Details

Param2-17B-Thinking

BharatGen presents Param-2-17B-MoE-A2.4B, a large-scale Mixture-of-Experts (MoE) language model designed to deliver high model capacity while retaining the inference efficiency of a much smaller dense model. It uses a Hybrid MoE architecture with 17B total parameters, while activating only 2.4B parameters per token.

Mixture of Experts

pretrained

Multilingual Text

1
62
57.29 GB
2,217

Updated 3 month(s) ago

BHARATGEN

View Details

BharatGen Multilingual TTS - Sooktam2

Text-to-Speech

Audio Synthesis

sooktam2

Multilingual Speech

multilingual-TTS

0
23
1.25 GB
1,557

Updated 3 month(s) ago

BHARATGEN

View Details

BharatGen - Param-1-7B-MoE Advancing Multilingual GenAI for India

Param-1-7B-MoE is a multilingual large language model developed under the Param-1 family as part of BharatGen – A Suite of Generative AI Technologies for India. With 7 billion parameters and a Mixture of Experts (MoE) architecture, the model is designed to better understand and generate text across English, Hindi, and 14 additional Indian languages. The model is pretrained from scratch with a strong focus on linguistic diversity, cultural context, and large-scale multilingual representation.

safetensors

mixtral

region:us

1
85
0
1,466

Updated 6 month(s) ago

BHARATGEN

View Details

BharatGen-AgriParam

Large language model fine-tuned from Param-1-2.9B-Instruct on a high-quality, India-centric agriculture dataset.

Multiturn

QnA

Updated 6 month(s) ago

BHARATGEN

View Details

BharatGen-FinanceParam

large language model fine-tuned from Param-1-2.9B-Instruct on a high-quality finance dataset.

Multiturn

QnA

Updated 6 month(s) ago

BHARATGEN

View Details

Accessibility options by UX4G

BharatGen - Param 1 Indic-Scale Bilingual Foundation Model

About Model

BharatGen - Param 1 Indic-Scale Bilingual Foundation Model

Metadata