Google and Cohere Unveil Innovative Audio AI Models

March 27, 2026

Google LLC and Cohere Inc. have unveiled advanced AI models enhancing audio processing. Google’s Gemini 3.1 Flash Live automates customer service interactions, adapting responses to user emotions like frustration. It supports voice agents for tasks such as product returns and can process overlapping inputs like images for troubleshooting smart devices. Notably, it excelled in the ComplexFuncBench Audio benchmark with a score of 90.8%, indicating a substantial improvement over previous versions.

Cohere Transcribe focuses on high-accuracy transcription with a 5.42% average word error rate, ranking it the best on the Hugging Face Open ASR Leaderboard. Utilizing a Conformer algorithm, it efficiently converts audio to transcripts in over a dozen languages. Both models are available under open-source licenses, enabling companies to utilize them in their infrastructure or via Cohere’s Model Vault.

This launch highlights significant advancements in AI-driven audio processing, offering powerful solutions for businesses across industries.

Source link

{{post_title}}

Google and Cohere Unveil Innovative Audio AI Models

NO COMMENTS

LEAVE A REPLY Cancel reply

Loading…

Here are the results for the search: "{{td_search_query}}"

No results!

{{post_title}}

RELATED ARTICLES

AI Revolutionizes Cybersecurity Access: Empowering Defenders with Advanced Tools

Adobe Unveils Firefly AI Assistant, Featuring Enhanced Generative AI and Creative...

IDC MarketScape: Vendor Assessment of Global AI-Driven Enterprise Asset Management Solutions...

NO COMMENTS

LEAVE A REPLY Cancel reply