With releases like Operator, Deep Research, Computer-Using Agents, and the Responses API with built-in tools, we’ve invested ...
The text-to-speech and speech-to-text tools are all based on GPT-4o. OpenAI hinted it may take a similar path with video.
OpenAI has unveiled new audio models to revolutionize the development of voice agents by enhancing speech-to-text and text-to-speech capabilities.
Three new AI models, GPT-4o-transcribe, GPT-4o-mini-transcribe, and gpt-4o-mini-tts, were introduced by OpenAI.
OpenAI's latest generative AI advancements redefine real-time speech capabilities, offering enhanced voice interaction for ...
OpenAI is introducing new speech-to-text and text-to-speech models via the app. This enables developers to build speech ...
OpenAI offers new text-to-speech and speech-to-text models in the API. These are designed to outperform Whisper.
All new models are now accessible to developers via OpenAI's API. Additionally, OpenAI has integrated these models with its ...
OpenAI announced new AI audio models with new capabilities for developers that want to include human-like voices in their ...
OpenAI has launched new speech-to-text and text-to-speech models in its API, providing developers with tools to build ...
The latest upgrades to voice models are an effort to make AI agents more useful by allowing deeper and intuitive interactions ...
In one example, an AI voice with the persona of a medieval knight gave driving directions to a bakery. Here's how that can be helpful.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results