Param-1-Instruct

BharatGen introduces the early checkpoint of SFT (Supervised Fine-Tuned) for Param 1, a bilingual language model trained from scratch in English and Hindi. With 2.9 billion parameters, this checkpoint builds upon the pretraining phase and serves as a foundation for more downstream tasks, safety testing, and customization.

BharatGen
KUNDESHWAR

About Model

Pre-Training Details: * Dataset: 7.5 Trillion tokens * Data Quality: Highly curated with standard filtering and multiple processing steps. * Scheduler: Cosine Annealing * Learning_rate: 3e-4 to 3e-6 * Training Setup: Running on 512 H100 GPUs * Framework: NVIDIA NeMo * Precision: bf16-mixed * Base Pre-Trained Checkpoint (Param 1): https://aikosh.indiaai.gov.in/home/models/details/bharatgen_param_1_indic_scale_bilingual_foundation_model.html SFT Training Details: * Dataset: 0.8 Million samples * Epochs: 3 * Scheduler: Cosine Annealing * Learning Rate: 5e-6 to 5e-8 * Training Hardware: 32 H200 GPUs * Framework: NVIDIA NeMo * Precision: bf16-mixed

Param-1-Instruct

Metadata

License

Attribution-Non-Commercial 4.0 International (CC BY-NC 4.0)

Hosted By

bharatgenai

Model Type

Transformers

Model Format

Transformers

Visibility

Open

Source organisation

BharatGen

Sector

Other

Updated Date & Time

07/05/26 10:02:42

Created By

Kundeshwar Vijay Pundalik

Size

5.36 GB

BharatGen%20Logo%20%281%29.png ( 141.12 KB )

To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Activity Overview

0
0
5.36 GB
8

License Control

Attribution-Non-Commercial 4.0 International (CC BY-NC 4.0)

Version Control

Version 1(5.36 GB)

admin·5 day(s) ago
- BharatGen%20Logo%20%281%29.png
- chat_template.jinja
- config_parambharatgen.py
- config.json
- generation_config.json
- model-00001-of-00002.safetensors
- model-00002-of-00002.safetensors
- model.safetensors.index.json
- modeling_parambharatgen.py
- 3 more

More Models from BharatGen

Param-1-5B

Param-1-5B is a bilingual (English–Hindi) large language model developed under the Param-1 family. With 5 billion parameters, this model extends the capabilities of Param-1-2.9B by incorporating enhanced mathematical reasoning and code understanding/generation. The model is pretrained from scratch and designed to serve as a strong foundation for downstream tasks such as mathematical problem solving, and code-related understanding / generation.

pretrained

0
0
10.42 GB
9

Updated 4 day(s) ago

BHARATGEN

View Details

Param-1-Instruct

QnA

Instruction-Tuning

Model Fine-Tuning

0
0
5.36 GB
9

Updated 4 day(s) ago

BHARATGEN

View Details

BharatGen - Param 1 Indic-Scale Bilingual Foundation Model

Param1 is a 2.9 billion parameter language model pretrained on English and Hindi, designed for text completion.

Large Language Model

4
678
13.79 GB
19,436

Updated 5 day(s) ago

BHARATGEN

View Details

Param2-17B-Thinking

BharatGen presents Param-2-17B-MoE-A2.4B, a large-scale Mixture-of-Experts (MoE) language model designed to deliver high model capacity while retaining the inference efficiency of a much smaller dense model. It uses a Hybrid MoE architecture with 17B total parameters, while activating only 2.4B parameters per token.

Mixture of Experts

Multilingual Text

pretrained

1
41
57.29 GB
1,025

Updated 1 month(s) ago

BHARATGEN

View Details

BharatGen Multilingual TTS - Sooktam2

Sooktam-2 is a multilingual Indic Text-to-Speech model by BharatGen supporting 12 languages including Hindi, Marathi, Tamil, Telugu, Bengali, Urdu, Punjabi and Indian English. It enables high-quality speech synthesis with reference-guided voice conditioning, preserving speaker voice, accent and prosody for natural and expressive generation.

multilingual-TTS

Text-to-Speech

Multilingual Speech

Audio Synthesis

sooktam2

0
9
1.25 GB
749

Updated 2 month(s) ago

BHARATGEN

View Details

BharatGen - Param-1-7B-MoE Advancing Multilingual GenAI for India

Param-1-7B-MoE is a multilingual large language model developed under the Param-1 family as part of BharatGen – A Suite of Generative AI Technologies for India. With 7 billion parameters and a Mixture of Experts (MoE) architecture, the model is designed to better understand and generate text across English, Hindi, and 14 additional Indian languages. The model is pretrained from scratch with a strong focus on linguistic diversity, cultural context, and large-scale multilingual representation.

safetensors

mixtral

region:us

1
79
0
1,174

Updated 4 month(s) ago

BHARATGEN

View Details

BharatGen-AgriParam

Large language model fine-tuned from Param-1-2.9B-Instruct on a high-quality, India-centric agriculture dataset.

Multiturn

QnA

Updated 5 month(s) ago

BHARATGEN

View Details

BharatGen-FinanceParam

large language model fine-tuned from Param-1-2.9B-Instruct on a high-quality finance dataset.

Multiturn

QnA

Updated 5 month(s) ago

BHARATGEN

View Details

BharatGen-LegalParam

Large language model fine-tuned from Param-1-2.9B-Instruct on an exhaustive India-centric legal dataset.

Multiturn

QnA

Summarization

Updated 5 month(s) ago

BHARATGEN

View Details

BharatGen-AyurParam

Large language model fine-tuned from Param-1-2.9B-Instruct on a high-quality Ayurveda dataset

Ayurvedic

Updated 5 month(s) ago

BHARATGEN

View Details

Accessibility options by UX4G

Param-1-Instruct

About Model

Param-1-Instruct

Metadata

BharatGen%20Logo%20%281%29.png ( 141.12 KB )

Activity Overview

Tags

License Control

Version Control

Version 1(5.36 GB)

BharatGen%20Logo%20%281%29.png

chat_template.jinja

config_parambharatgen.py

config.json

generation_config.json

model-00001-of-00002.safetensors

model-00002-of-00002.safetensors

model.safetensors.index.json

modeling_parambharatgen.py

More Models from BharatGen

AIKosh

Resources

Support