APIMay 27, 2026
OpenAI Realtime API Multimodal Upgrade: Real-Time Voice + Vision + Text Triad, AI Applications Enter a New Era
OpenAI updates the Realtime API, now supporting simultaneous voice input, image input, and text output with latency reduced to under 200ms. This will fundamentally transform the development of AI customer service, real-time translation, video analysis, and other applications.
Also available in 中文.