Articles

Compendium Publications

Cancer Care

Compendium

NCG Catch Compendium 2025

The IndiaAI–NCG CATCH Grant for Cancer 2026 compendium documents ten winning AI solutions selected through a competitive grant program jointly run by IndiaAI (MeitY) and the National Cancer Grid's Koita Centre for Digital Oncology. It profiles AI tools spanning cancer screening, diagnostics, clinical decision support, patient engagement, and operational efficiency, ranging from pilot-ready to commercially deployed systems. Each solution is evaluated against criteria including clinical relevance, technical maturity, data governance, DPDP compliance, and deployment feasibility within Indian hospital settings. The compendium also covers the selection methodology, jury composition, responsible AI principles, and key learnings from the challenge, serving as a reference for hospitals, policymakers, and innovators seeking to adopt or scale AI-enabled cancer care across India.

AIKOSH

0
9
1 min read

View Article

Research Article

View All

ContrastiveLearning and ZeroShotClassification

CrossModal and ContrastiveLearning

LLM

Multimodal AI

InfoNCE and Beyond: The Evolution of Multimodal Representation Learning

Contrastive learning has become the dominant paradigm for aligning heterogeneous modalities in multimodal large language models (MLLMs). Information Noise Contrastive Estimation (InfoNCE) Loss and its variants serve as the foundational training objectives for such models. As models scale to billions of parameters and training batches reach millions of samples, the limitations of conventional softmax-based contrastive losses have become increasingly apparent: quadratic memory growth, batch size sensitivity, semantic noise in negative sampling, and the inherent tension between alignment and uniformity. This paper presents a comprehensive survey of the evolution from InfoNCE to next-generation objectives for multimodal LLMs. We analyze the theoretical foundations of contrastive learning through the alignment-uniformity framework, examine the practical innovations that have enabled unprecedented batch scaling, and evaluate emerging alternatives including sigmoid-based losses, non-contrastive alignment methods, and hybrid generative-discriminative objectives.

Prateek Khanna

0
6
4 min read

View Article

Expert Speaks

View All

Conversational

machine Learning

The Case for the Vertical Stack

Why owning both the silicon and the software is the only durable moat in the age of AI, and why the margin arithmetic of fragmented supply chains is finally breaking down. By Dhritiman Mallick · Founder & CEO, Vyuhaa Med Data

Dhritiman Mallick

0
3
1 min read

View Article

Community Contributions

View All

Bhashini

Digital India

geopolitics

IndiaAI

China’s Open-Source AI Surge and India’s Strategic Position in the Emerging Multipolar AI Order

The article explores how China is aggressively advancing its geopolitical influence by releasing powerful, open-source AI models, making high-tier AI capabilities freely and globally accessible. Contrasts this with the proprietary, closed-source approaches dominated by US tech giants. The piece analyzes India's strategic position within this shifting, multipolar AI ecosystem, arguing that India must carefully balance its infrastructure dependencies, leverage open-source breakthroughs for localized economic growth, and craft its own independent AI policies to avoid being caught in a digital duopoly between the US and China.

Ashish Chadha

1
33
1 min read

View Article

Solution Writeup

View All

AI for Governance

computer vision

Digital Transformation

food security

procurement

AI-Powered Image Validation in PCSAP: Enhancing Transparency and Efficiency in India’s Grain Procurement Syste

This article highlights how AI-powered image validation in PCSAP is improving India’s grain procurement monitoring system. With thousands of procurement centers and nearly 14 lakh photographs uploaded every season, manual verification had become difficult and time-consuming. The AI system validates images in real time by checking location, timestamp, quality, duplication, and content relevance. This helps improve transparency, reduce manual effort, enhance data accuracy, and support faster decision-making in procurement center assessments. The solution has been piloted in Punjab, Haryana, and Odisha, with potential for wider use in government monitoring and compliance programs.

Subham Pandey

0
7
1 min read

View Article

All Articles

View All

ContrastiveLearning and ZeroShotClassification

CrossModal and ContrastiveLearning

LLM

Multimodal AI

InfoNCE and Beyond: The Evolution of Multimodal Representation Learning

Prateek Khanna

0
6
4 min read

View Article

Deep Learning

Large Language Model

Mixture of Experts

peft

Continuously Learning Large Language Models

Large language models (LLMs) have achieved remarkable performance across diverse tasks, yet they remain fundamentally static: once trained, their knowledge is frozen, unable to adapt to evolving information, emerging domains, or shifting user needs without expensive retraining. This limitation - catastrophic forgetting - has become a critical bottleneck as LLMs transition from research artifacts to living systems deployed in dynamic real-world environments. This paper presents a comprehensive survey and unified framework for continuously learning LLMs (CL-LLMs), organizing the field into five complementary paradigms: replay-based methods, regularization-based approaches, parameter-efficient adaptation, architectural isolation, and inference-time continual learning.

Prateek Khanna

0
6
3 min read

View Article

Hardware

inference endpoints

sovereign-ai

Mapping Sovereign AI Models to Sovereign Inference Chips: A Model-Driven Co-Design Framework

The proliferation of sovereign large language models (LLMs) trained on indigenous languages, cultural contexts, and national datasets has created an urgent need for inference hardware that preserves the unique characteristics of these models while achieving deployment efficiency. Unlike generic AI accelerators optimized for English-centric transformer models, sovereign inference chips must handle multilingual tokenization, mixed-script processing, sparse mixture-of-experts (MoE) architectures, and voice-first interaction patterns that dominate local language AI deployments. This paper presents a model-driven co-design framework that maps the specific computational signatures of sovereign AI models directly to custom inference chip architectures.

Prateek Khanna

0
4
4 min read

View Article

Accessibility options by UX4G

AIKosh

Resources

Support