OpenAI Unveils Enhanced Models for Its Realtime API

OpenAI has enhanced its Realtime API with three new model snapshots aimed at boosting transcription, speech synthesis, and function calling accuracy. The gpt-4o-mini-transcribe model has made significant strides by reducing hallucinations by 89% compared to whisper-1. For text-to-speech applications, the gpt-4o-mini-tts variant reduces the word error rate by 35%, enhancing clarity and accuracy. Moreover, the gpt-realtime-mini model shows a 22% improvement in instruction adherence and a 13% boost in function calling capabilities, making it ideal for voice assistant applications. Additionally, OpenAI has enhanced support for languages including Chinese, Japanese, Indonesian, Hindi, Bengali, and Italian, making its tools more accessible globally. These updates underscore OpenAI’s commitment to providing reliable and efficient audio processing solutions, ensuring a superior user experience across various applications.

Source link

News

Company:

Join our community of SUBSCRIBERS and be part of the conversation.

Tenable Unveils AI-Driven Tool for Comprehensive Cyber Risk Management

OpenAI Unveils Prism: A Free AI Workspace for LaTeX, Enhanced by GPT-5.2

Kore.ai Secures New Funding to Expand Innovative Agentic AI Solutions

Amazing Isometric NYC Map Crafted by AI Agents – Kottke.org

Security Experts Caution: AI Agents Could Expose Personal Data Risks

Ask HN: Where Can I Find AI Communities?

Fostering Multi-AI Collaboration: How CoChat Enhances AI Interaction in Group Discussions

Exploring AI-Driven AI Development: Insights from Our Automation of R&D Workshop [PDF]

How ‘AI Mirrors’ are Transforming Self-Perception for the Visually Impaired

GitHub – Cocabadger/saferun-api: Open-Source Middleware for Enhancing AI Agent Safety

OpenAI Unveils Enhanced Models for Its Realtime API

Local News

Tenable Unveils AI-Driven Tool for Comprehensive Cyber Risk Management

Ask HN: Where Can I Find AI Communities?

OpenAI Unveils Prism: A Free AI Workspace for LaTeX, Enhanced by GPT-5.2

Fostering Multi-AI Collaboration: How CoChat Enhances AI Interaction in Group Discussions

Tenable Unveils AI-Driven Tool for Comprehensive Cyber Risk Management

Ask HN: Where Can I Find AI Communities?

OpenAI Unveils Prism: A Free AI Workspace for LaTeX, Enhanced by GPT-5.2