large language models (LLMs) such as Sarvam’s OpenHathi and Ola’s Krutrim. This even as their global counterparts like OpenAI’s GPT-4o and Meta’s Llama have started showing enterprise grade support for Hindi, Marathi, Kannada and other Indian languages across text and voice.
Union Budget 2024 Live Updates
Budget 2024 Income Tax Slab ChangesLive Updates
This development, though inevitable, raises questions about the role of Indian LLMs and what problems they can solve. Building an Indic AI model from scratch is fraught with challenges. There is not enough training data, the cost of compute is humongous, and, most importantly, funding is not easy as there is a lack of clarity on use cases. Is there, however, a need for Indian LLMs? Absolutely, say experts.
Indian LLMs
The whole narrative around large language models, referred to as foundation/frontier models – pertinent for GenAI – is currently driven by the models developed by the US tech industry such as ChatGPT and Gemini. This may be concerning, experts said. “We have a huge reliance on the top 4-5 models, which are trained on Western data from North America and Western Europe, and, therefore, they embed cultural values from that region,” said James Landay, cofounder, Stanford