Indian Flag
Government Of India
A-
A
A+

BharatGen - Param 1 Indic-Scale Bilingual Foundation Model

Param1 is a 2.9 billion parameter language model pretrained on English and Hindi, designed for text completion.

About Model

Param 1 is a 2.9-billion parameter foundation model developed for English and Hindi, capable for text generation and completion. Pretrained on high-quality, culturally rich datasets from diverse Indian domains approximately on 5 Trillion Tokens combined for English and Hindi, it delivers better performance on bilingual tasks while maintaining computational efficiency, outperforming several models of similar size and task scope on standard benchmarks. Param 1 is developed by BharatGen: A Suite of Generative AI Tech for India.
For any queries, please visit https://bharatgen.discourse.group/invites/BcouFsKk4g

BharatGen - Param 1 Indic-Scale Bilingual Foundation Model

Metadata Metadata

MIT

Kundeshwar Pundalik, Piyush Sawarkar, Vedant Goswami, Ajay Nagpal, Smita Gautam, Bhagwan Panditi, Adugani Vanjari Akanksh, Pankaj Singh, Rishi Bal, Prof. Rohit Saluja, Prof. Ganesh Ramakrishnan

Text Generation

Transformers

Open

BharatGen

Science, Technology and Research

18/07/25 08:53:23

13.79 GB

Param_1 ( 4 files, 1 directories )


Directory
model_extracted

2 files, 1 directories

undefined
LICENSE

1.04 KB

undefined
model.nemo

5.33 GB

undefined
nemo_inference.sh

649 Bytes

text/markdown
README.md

7.05 KB

Activity Overview Activity Overview

  • Downloads4
  • Downloads 691
  • File Size 8.44 GB
  • Views 19,668

Tags Tags

  • Large Language Model

License Control License Control

MIT

Version Control Version Control

FolderVersion 1(8.44 GB)
  • admin·1 year(s) ago
    • chevron_rightFolder
      Param_1
      • chevron_rightFolder
        model_extracted
      • undefined
        LICENSE
      • undefined
        model.nemo
      • undefined
        nemo_inference.sh
      • text/markdown
        README.md

More Models from BharatGen More Models from BharatGen

Shrutam-2
Shrutam-2 is a LLM based automatic speech recognition system for 12 major Indian languages. It bridges a Conformer speech encoder with a pretrained LLM decoder through a Mixture-of-Experts (MoE) projection layer, enabling high-quality, prompt-controllable transcription across diverse Indic languages.
Automatic Speech Recognition
Speech-to-Text
  • See Upvoters0
  • Downloads0
  • File Size8.37 GB
  • Views23
Updated 4 day(s) ago

BHARATGEN

Param-1-5B
Param-1-5B is a bilingual (English–Hindi) large language model developed under the Param-1 family. With 5 billion parameters, this model extends the capabilities of Param-1-2.9B by incorporating enhanced mathematical reasoning and code understanding/generation. The model is pretrained from scratch and designed to serve as a strong foundation for downstream tasks such as mathematical problem solving, and code-related understanding / generation.
pretrained
  • See Upvoters0
  • Downloads0
  • File Size10.42 GB
  • Views28
Updated 14 day(s) ago

BHARATGEN

Param-1-Instruct
BharatGen introduces the early checkpoint of SFT (Supervised Fine-Tuned) for Param 1, a bilingual language model trained from scratch in English and Hindi. With 2.9 billion parameters, this checkpoint builds upon the pretraining phase and serves as a foundation for more downstream tasks, safety testing, and customization.
QnA
Model Fine-Tuning
Instruction-Tuning
  • See Upvoters0
  • Downloads10
  • File Size5.36 GB
  • Views24
Updated 14 day(s) ago

BHARATGEN

BharatGen - Param 1 Indic-Scale Bilingual Foundation Model
Param1 is a 2.9 billion parameter language model pretrained on English and Hindi, designed for text completion.
Large Language Model
  • See Upvoters4
  • Downloads691
  • File Size13.79 GB
  • Views19,668
Updated 15 day(s) ago

BHARATGEN

Param2-17B-Thinking
BharatGen presents Param-2-17B-MoE-A2.4B, a large-scale Mixture-of-Experts (MoE) language model designed to deliver high model capacity while retaining the inference efficiency of a much smaller dense model. It uses a Hybrid MoE architecture with 17B total parameters, while activating only 2.4B parameters per token.
Multilingual Text
pretrained
Mixture of Experts
  • See Upvoters1
  • Downloads45
  • File Size57.29 GB
  • Views1,180
Updated 2 month(s) ago

BHARATGEN

BharatGen Multilingual TTS - Sooktam2
Sooktam-2 is a multilingual Indic Text-to-Speech model by BharatGen supporting 12 languages including Hindi, Marathi, Tamil, Telugu, Bengali, Urdu, Punjabi and Indian English. It enables high-quality speech synthesis with reference-guided voice conditioning, preserving speaker voice, accent and prosody for natural and expressive generation.
multilingual-TTS
Text-to-Speech
Multilingual Speech
Audio Synthesis
sooktam2
  • See Upvoters0
  • Downloads9
  • File Size1.25 GB
  • Views891
Updated 2 month(s) ago

BHARATGEN

BharatGen - Param-1-7B-MoE Advancing Multilingual GenAI for India
Param-1-7B-MoE is a multilingual large language model developed under the Param-1 family as part of BharatGen – A Suite of Generative AI Technologies for India. With 7 billion parameters and a Mixture of Experts (MoE) architecture, the model is designed to better understand and generate text across English, Hindi, and 14 additional Indian languages. The model is pretrained from scratch with a strong focus on linguistic diversity, cultural context, and large-scale multilingual representation.
safetensors
mixtral
region:us
  • See Upvoters1
  • Downloads82
  • File Size0
  • Views1,228
Updated 4 month(s) ago

BHARATGEN

BharatGen-AgriParam
Large language model fine-tuned from Param-1-2.9B-Instruct on a high-quality, India-centric agriculture dataset.
Multiturn
QnA
  • See Upvoters0
  • Downloads9
  • File Size0
  • Views106
Updated 5 month(s) ago

BHARATGEN

BharatGen-FinanceParam
large language model fine-tuned from Param-1-2.9B-Instruct on a high-quality finance dataset.
QnA
Multiturn
  • See Upvoters0
  • Downloads18
  • File Size0
  • Views175
Updated 5 month(s) ago

BHARATGEN

BharatGen-LegalParam
Large language model fine-tuned from Param-1-2.9B-Instruct on an exhaustive India-centric legal dataset.
QnA
Multiturn
Summarization
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views66
Updated 5 month(s) ago

BHARATGEN