Indian Flag
Government Of India
A-
A
A+

shuka-v1

Multilingual audio to text model

About Model

Shuka v1 is an innovative audio understanding model for Indic languages, combining Saaras v1 encoder and Meta's Llama3-8B-Instruct as the decoder. Trained on less than 100 hours of data, it outperforms larger models in audio-based question-answering tasks and supports fine-tuning for customized use cases. Shuka v1 is available open-source, marking the start of advancements in audio language models for Indic languages.

shuka-v1

Metadata Metadata

CC0 1.0 Public Domain

sarvamai

Audio-to-text

Transformers

Open

Sarvam AI

Science, Technology and Research

24/02/25 07:45:11

0

Activity Overview Activity Overview

  • Downloads3
  • Redirect 96
  • Views 1,843
  • File Size 0

Tags Tags

  • audio-llms

License Control License Control

CC0 1.0 Public Domain

More Models from Sarvam AI More Models from Sarvam AI

sarvam-30B
Sarvam-30B is an advanced Mixture-of-Experts (MoE) model with 2.4B non-embedding active parameters, designed primarily for practical deployment. It combines strong reasoning, reliable coding ability, and best-in-class conversational quality across Indian languages. Sarvam-30B is built to run reliably in resource-constrained environments and can handle multilingual voice calls while performing tool calls.
MoE Model
  • See Upvoters4
  • Downloads128
  • File Size59.92 GB
  • Views1,748
Updated 1 month(s) ago

SARVAM AI

sarvam-105b
Sarvam-105B is an advanced Mixture-of-Experts (MoE) model with 10.3B active parameters, designed for superior performance across a wide range of complex tasks. It is highly optimized for complex reasoning, with particular strength in agentic tasks, mathematics, and coding.
region:us
license:apache-2.0
bo
ks
sat
mni
doi
mai
kok
sd
ne
sa
ur
as
or
pa
ml
kn
gu
mr
te
ta
bn
hi
en
custom_code
conversational
Text Generation
sarvam_mla
Transformers
safetensors
  • See Upvoters8
  • Downloads394
  • File Size0
  • Views6,368
Updated 1 month(s) ago

SARVAM AI

sarvamM
Multilingual, hybrid-reasoning, text-only language model built on Mistral-Small
pa
Transformers
safetensors
mistral
Text Generation
conversational
en
bn
hi
kn
gu
mr
ml
or
ta
te
base_model:mistralai/Mistral-Small-3.1-24B-Base-2503
base_model:finetune:mistralai/Mistral-Small-3.1-24B-Base-2503
license:apache-2.0
autotrain_compatible
text-generation-inference
endpoints_compatible
region:us
  • See Upvoters1
  • Downloads59
  • File Size0
  • Views973
Updated 9 month(s) ago

SARVAM AI

sarvamtranslate
Translation model for 22 Indian Languages
kn
en
gu
gom
doi
brx
bn
as
Translation
image-text-to-text
gemma3
ks
hi
Transformers
safetensors
region:us
endpoints_compatible
text-generation-inference
license:gpl-3.0
base_model:finetune:google/gemma-3-4b-it
base_model:google/gemma-3-4b-it
ur
te
ta
sd
sat
sa
pa
or
ne
mr
mni
ml
mai
  • See Upvoters1
  • Downloads73
  • File Size0
  • Views1,162
Updated 9 month(s) ago

SARVAM AI

sarvam-1
India's first indic model, pretrained on 4 trillion tokens
  • See Upvoters2
  • Downloads238
  • File Size0
  • Views4,467
Updated 1 year(s) ago

SARVAM AI

shuka-v1
Multilingual audio to text model
audio-llms
  • See Upvoters3
  • Downloads96
  • File Size0
  • Views1,844
Updated 1 year(s) ago

SARVAM AI