模型Mar 28, 2025
GPT-4o Real-Time Voice + Function Calling: Agents Enter a New Era of 'Listen, Speak, Act'
OpenAI has significantly upgraded GPT-4o's real-time API, introducing mid-conversation function calling capabilities. This allows agents to query databases, call tools, and execute code during live voice conversations, with results seamlessly flowing back into the dialogue. This breakthrough removes the final barrier for voice agents: they are no longer just 'talkative assistants' but executors that can complete orders, check balances, and control systems in real-time during calls. Several domestic voice SaaS providers have announced integration.
Also available in 中文.