Bengaluru-based startup Sarvam AI has been making waves in the artificial intelligence (AI) community globally with its latest innovations, Sarvam Vision and Bulbul V3. The AI model has apparently outperformed global giants like Google Gemini and ChatGPT in key areas in optical character recognition (OCR). In a post on X (formerly Twitter), the co-founder Pratyush Kumar claimed that Sarvam Vision achieved 84.3% accuracy on olmOCR-Bench, surpassing Gemini 3 Pro and DeepSeek OCR v2, and 93.28% on OmniDocBench v1.5. And when it comes to Bulbul V3, its text-to-speech model supports 35 voices, with the sample set distributed across 22 official Indian languages, from 1800 to the present. It also has different quality of scans and content. “On Indian languages, Sarvam Vision is the best model by far, while supporting all 22 scheduled Indian languages,” Kumar claimed.
Rahul Gandhi Speech | “President Trump, Talk To Us As Equals”: Rahul Gandhi’s Dig At Centre
Congress MP Rahul Gandhi today outlined what the Opposition bloc INDIA would say to US President Donald Trump on the latest trade deal, which Congress leader Shashi Tharoor had labelled…
