India’s ambition to develop a sovereign artificial intelligence framework tailored for its linguistic and cultural diversity is gaining momentum. BharatGen, the government-backed national AI initiative, is at the heart of this vision, aiming to create foundational AI models for all 22 scheduled Indian languages by June 2026. This strategic move will not only bridge the linguistic digital divide but also enhance India’s technological independence.
Current Language Coverage and Expansion Plan
At present, BharatGen covers nine major Indian languages: Hindi, Marathi, Tamil, Malayalam, Bengali, Punjabi, Gujarati, Telugu, and Kannada. By December 2025, this list is set to expand to 15 languages, with Assamese, Maithili, Nepali, Odia, Sanskrit, and Sindhi joining the roster. The complete rollout to include all 22 scheduled languages is targeted for mid-2026.
Technological Scope and Applications
BharatGen’s AI capabilities span multiple modalities, including:
Large Language Models (LLMs) for text
Text-to-Speech (TTS) systems
Automatic Speech Recognition (ASR)
Vision-Language systems
The initiative has already developed pilot applications for agriculture, governance, and defence. These will be implemented nationwide after the full deployment of the platform.
Organisational Structure and Leadership
BharatGen operates under the Department of Science and Technology’s National Mission on Interdisciplinary Cyber-Physical Systems (NM-ICPS). The TIH Foundation for IoT and IoE at IIT Bombay acts as the central hub, managing execution, academic collaboration, and ecosystem partnerships for compute resources, data, and talent.
IITM Pravartak Technologies Foundation at IIT Madras plays a critical role as an implementation partner, focusing on governance, security, and media-oriented applications.
Key Consortium Members and Contributions:
IIT Bombay: Lead institution overseeing research and integration
IIIT Hyderabad: Vision-language document modelling
IIT Madras: Speech model development and evaluation
IIT Kanpur: Legal AI research and multilingual tokenisation strategies
IIT Hyderabad: Vocabulary optimisation for multilingual LLMs
IIT Mandi: Inclusive multilingual model development and efficient training methods
IIM Indore: Bharat-centric benchmarking and multilingual data collection
Government Vision and Future Deployment
Union Minister Dr Jitendra Singh confirmed that BharatGen is still in its pilot phase and not yet accessible to the public. However, plans are in place to deploy the system across all states and districts once fully operational. The government is also exploring potential collaborations with additional research institutions in Karnataka.
Conclusion
BharatGen represents a milestone in India’s AI journey, aiming to empower millions of citizens by enabling advanced AI capabilities in their native languages. With a clear roadmap and strong institutional backing, the initiative promises to transform how AI serves India’s diverse population, reinforcing technological sovereignty and inclusivity.
Follow Before You Take on:
Latest Technology News | Updates | Latest Electric Vehicle News | Updates | Electronics News | Mobile News | Updates | Software Updates
📌 Facebook | 🐦 Twitter | 📢 WhatsApp Channel | 📸 Instagram | 📩 Telegram | 💬 Threads | 💼 LinkedIn | 🎥 YouTube
🔔 Stay informed, Stay Connected!






































































































