Indian Flag
Government Of India
A-
A
A+
Latest insights & developments from the world of Artificial Intelligence(AI).
Compendium Publications
AI Models
Indic language
LLM
natural language processing (NLP)
Six interesting Indian AI models from 2024​
India's 2024 AI landscape saw six breakthrough models: BharatGen's e-vikrAI for e-commerce, Sarvam-1 supporting 10 Indian languages, NVIDIA's Nemotron-4-Mini-Hindi-4B, AI4Bharat's Chitralekha for video transcreation, Everest 1.0 covering 35 languages, and Surya OCR for document processing. These models integrate local languages and cultural context, setting global AI benchmarks.
Six interesting Indian AI models from 2024​
AIKOSHAIKOSH
  • See Upvoters4
  • Views72
  • Read Time1 min read
Research Article
Academic Research
artificial intelligence
BigData
computer vision
Data Science
Deep Learning
Machine Learning
Simulation
Supercomputing
Research combines solar astronomy with AI, helping in solar observations
Researchers at the University of Hawaiʻi have developed deep learning models to analyze data from the world's most powerful solar telescope, the NSF Inouye Solar Telescope. Part of the SPIn4D project, the AI models can map the sun's 3D atmosphere in near real-time, processing tens of terabytes of daily data. Trained on 120TB of simulated data, the models aim to improve solar storm prediction and space weather monitoring.
Research combines solar astronomy with AI, helping in solar observations
AIKOSHAIKOSH
  • See Upvoters2
  • Views17
  • Read Time1 min read
Expert Speaks
artificial intelligence
Digital India
Digital Transformation
Startup Funding
Startup Innovation
From Bengaluru to Boston: The global ascent of Indian AI startups in 2024
India's AI startup ecosystem is rapidly growing, ranking 3rd globally. With 77% of startups investing in AI, ML, IoT, and blockchain, and $560 million raised in 2024, India leads in AI confidence and spending. Tier II and III cities are emerging as innovation hubs, while government initiatives like Digital India further accelerate growth, positioning India to lead the next wave of global AI innovation.
From Bengaluru to Boston: The global ascent of Indian AI startups in 2024
AIKOSHAIKOSH
  • See Upvoters0
  • Views23
  • Read Time1 min read
Community Contributions
AI Research
Indic AI
LLM
NaturalLanguageProcessing
Open Source AI
Advancing Telugu NLP: Telugu LLM Labs with native and romanized datasets
Telugu LLM Labs, led by researchers from LlamaIndex, is advancing NLP for Telugu — a language with 100M+ speakers historically underrepresented in AI. The initiative creates open datasets in both native and Romanized Telugu scripts and fine-tunes LLMs like Llama 2, Mistral, and TinyLlama, setting a precedent for other regional Indian languages in AI development.
Advancing Telugu NLP: Telugu LLM Labs with native and romanized datasets
AIKOSHAIKOSH
  • See Upvoters0
  • Views58
  • Read Time1 min read
Solution Writeup
AI in Agriculture
computer vision
drone technology
Machine Learning
Precision Agriculture
AI in agriculture in 2025: Transforming Indian farms for a sustainable future
India's agricultural sector is being transformed by AI, with the global AI in agriculture market projected to grow at 23.1% CAGR, reaching USD 4.7 billion by 2028. AI tools enable precision farming, crop disease detection, automated weed control, and livestock monitoring. Government initiatives like Kisan e-Mitra Chatbot and AI Centres of Excellence are accelerating adoption across Indian farms.
AI in agriculture in 2025: Transforming Indian farms for a sustainable future
AIKOSHAIKOSH
  • See Upvoters0
  • Views32
  • Read Time1 min read
All Articles
AI Research
Indic AI
LLM
NaturalLanguageProcessing
Open Source AI
Advancing Telugu NLP: Telugu LLM Labs with native and romanized datasets
Telugu LLM Labs, led by researchers from LlamaIndex, is advancing NLP for Telugu — a language with 100M+ speakers historically underrepresented in AI. The initiative creates open datasets in both native and Romanized Telugu scripts and fine-tunes LLMs like Llama 2, Mistral, and TinyLlama, setting a precedent for other regional Indian languages in AI development.
Advancing Telugu NLP: Telugu LLM Labs with native and romanized datasets
AIKOSHAIKOSH
  • See Upvoters0
  • Views58
  • Read Time1 min read
AI4Bharat
Indian Datasets
Indic Languages
natural language processing (NLP)
Open Source AI
Speech translation
Synthetic Data
AI4Bharat unveils BhasaAnuvaad: Speech translation dataset in 13 languages
AI4Bharat launches BhasaAnuvaad, the largest speech translation dataset for Indian languages, covering 44,400 hours of audio across 13 languages including Hindi, Tamil, Telugu, and Bengali. It tackles India-specific challenges like code-switching and dialectal diversity. A synthetic benchmark, Indic-Spontaneous-Synth, is also introduced to test real-world translation model robustness.
AI4Bharat unveils BhasaAnuvaad: Speech translation dataset in 13 languages
AIKOSHAIKOSH
  • See Upvoters0
  • Views21
  • Read Time1 min read
Big Data
Huggingface
LLM
Synthetic Data
Cosmopedia: Redefining the synthetic data landscape with the largest open dataset
Cosmopedia v0.1, hosted on HuggingFace, is the largest open synthetic dataset with 30 million samples and 25 billion tokens, generated by Mixtral 7b. It includes textbooks, blog posts, stories, and WikiHow articles across eight dataset splits. Designed to democratize AI research, it supports NLP, model training, and scalable AI development with rich metadata and diverse content.
Cosmopedia: Redefining the synthetic data landscape with the largest open dataset
AIKOSHAIKOSH
  • See Upvoters0
  • Views14
  • Read Time1 min read