OpenAI Releases Updated Audio Models for Realtime API

OpenAI Releases Updated Audio Models for Realtime API
OpenAI has rolled out a significant update for voice application developers by introducing new audio model "snapshots" in the Realtime API. The list includes `gpt-4o-mini-transcribe-2025-12-15`, `gpt-4o-mini-tts-2025-12-15`, and `gpt-realtime-mini-2025-12-15`. This update addresses critical issues found in previous versions. Specifically, Automatic Speech Recognition (ASR) accuracy in noisy environments has been improved, and Text-to-Speech (TTS) quality has been significantly enhanced to sound even more natural and emotionally resonant.

Company engineers also note a reduction in "hallucinations" when transcribing long audio segments. For developers, this enables the creation of more reliable and responsive voice agents capable of conversing with minimal latency. The update is already available in the platform console and does not require architectural changes to existing applications—developers simply need to point to the new model IDs. Experts believe this move strengthens OpenAI's position in the conversational AI sector.

Source: OpenAI Developer Community
OpenAIRealtime APIDevToolsAudio Models
« Back to News List
Chat