generative artificial intelligence (AI) startup, Sarvam AI is working towards launching its first commercial 'voice-to-voice endpoint' in the next six to twelve months, cofounder Vivek Raghavan said on Thursday at a session at an annual event for Software-as-a-Service (SaaS) startups, SaaSBoomi.
“You can expect these kinds of voice-to-voice endpoints in at least 10 (Indian) languages and you can expect some experiences built upon this and also there are example experiences in the sense that there are ways for people to build things on top of it,” Raghavan said.
He emphasised that the LLMs need to be voice-based LLMs which are also 'agentic and action-oriented.' They key thing, he said, is that it needs to work well in colloquial languages.
“Building Indic language models is important but that alone probably won't lead to the result we are looking for which is the widespread use of Gen AI in India,” Raghavan said. “Voice is the primary way by which people will access LLMs in India. We need to have voice-driven interfaces/systems, only then can we have accessibility to a large number of people.”
Sarvam AI released its first open-source Hindi language model called OpenHathi-Hi-0.1 in December