Indian Flag
Government Of India
A-
A
A+

shuka-v1

Multilingual audio to text model

About Model

Shuka v1 is an innovative audio understanding model for Indic languages, combining Saaras v1 encoder and Meta's Llama3-8B-Instruct as the decoder. Trained on less than 100 hours of data, it outperforms larger models in audio-based question-answering tasks and supports fine-tuning for customized use cases. Shuka v1 is available open-source, marking the start of advancements in audio language models for Indic languages.

shuka-v1

Metadata Metadata

CC0 1.0 Public Domain

sarvamai

Audio-to-text

Transformers

Open

Sarvam AI

Science, Technology and Research

24/02/25 07:45:11

0

Activity Overview Activity Overview

  • Downloads3
  • Redirect 106
  • File Size 0
  • Views 2,072

Tags Tags

  • audio-llms

License Control License Control

CC0 1.0 Public Domain

More Models from Sarvam AI More Models from Sarvam AI

sarvam-30B
Sarvam-30B is an advanced Mixture-of-Experts (MoE) model with 2.4B non-embedding active parameters, designed primarily for practical deployment. It combines strong reasoning, reliable coding ability, and best-in-class conversational quality across Indian languages. Sarvam-30B is built to run reliably in resource-constrained environments and can handle multilingual voice calls while performing tool calls.
MoE Model
  • See Upvoters4
  • Downloads150
  • File Size59.92 GB
  • Views2,288
Updated 2 month(s) ago

SARVAM AI

sarvam-105b
Sarvam-105B is an advanced Mixture-of-Experts (MoE) model with 10.3B active parameters, designed for superior performance across a wide range of complex tasks. It is highly optimized for complex reasoning, with particular strength in agentic tasks, mathematics, and coding.
license:apache-2.0
Transformers
sd
safetensors
Text Generation
conversational
gu
ta
en
or
bn
te
pa
hi
mr
ml
kn
custom_code
as
doi
mni
ne
ur
sat
mai
ks
sa
region:us
sarvam_mla
bo
kok
  • See Upvoters8
  • Downloads474
  • File Size0
  • Views7,788
Updated 2 month(s) ago

SARVAM AI

sarvamM
Multilingual, hybrid-reasoning, text-only language model built on Mistral-Small
region:us
Transformers
safetensors
endpoints_compatible
Text Generation
text-generation-inference
conversational
gu
ta
en
or
bn
te
pa
hi
mr
ml
kn
autotrain_compatible
base_model:finetune:mistralai/Mistral-Small-3.1-24B-Base-2503
base_model:mistralai/Mistral-Small-3.1-24B-Base-2503
mistral
license:apache-2.0
  • See Upvoters1
  • Downloads65
  • File Size0
  • Views1,167
Updated 10 month(s) ago

SARVAM AI

sarvamtranslate
Translation model for 22 Indian Languages
base_model:finetune:google/gemma-3-4b-it
brx
region:us
Transformers
Translation
sd
safetensors
endpoints_compatible
image-text-to-text
text-generation-inference
gu
ta
en
or
bn
te
pa
hi
mr
ml
kn
gemma3
as
doi
mni
ne
ur
sat
mai
gom
base_model:google/gemma-3-4b-it
license:gpl-3.0
ks
sa
  • See Upvoters1
  • Downloads92
  • File Size0
  • Views1,463
Updated 10 month(s) ago

SARVAM AI

sarvam-1
India's first indic model, pretrained on 4 trillion tokens
  • See Upvoters3
  • Downloads259
  • File Size0
  • Views5,067
Updated 1 year(s) ago

SARVAM AI

shuka-v1
Multilingual audio to text model
audio-llms
  • See Upvoters3
  • Downloads106
  • File Size0
  • Views2,073
Updated 1 year(s) ago

SARVAM AI