Wednesday, April 2, 2025

Deepgram Launches Aura, a Text-to-Speech API for Real-Time, Conversational Voice AI Agents

Related stories

Hacking the Hackers: How GenAI is Predicting and Preventing Cyber Attacks

In the high-stakes arena of cybersecurity, the rules of...

Veltris Acquires BPK to Boost AI & Digital in Healthcare

Veltris, a digital product engineering services provider backed by...

Accenture & Schaeffler Advance Industrial Humanoid Robots

Accenture has joined forces with Schaeffler AG to reshape...

EDGNEX Data Centers partners with Hyperco

The acquisition is expected to further strengthen the expansion...

Chef Robotics Raises $43M Series A to Scale AI Robotics

Chef Robotics, a leader in AI-powered robotic systems for...
spot_imgspot_img

With Aura, developers can now build production-grade, secure, and human-like voice AI applications that run faster and more efficiently than any other solution on the market

Deepgram, the leading provider of speech recognition, natural language processing, and generative AI solutions, announced the public release of Aura, a text-to-speech (TTS) API that delivers human-like quality conversation that is faster and more efficient compute-wise than all voice AI alternatives. Aura is designed for developers who want to build real-time, conversational voice AI agents that can interact with customers, employees, and other users in a natural and engaging way.

Deepgram believes that voice will become the predominant way we interact with technology – and for that to work, AI systems must be highly tuned to enable natural conversation at scale and with incredibly low latency. Aura can generate speech from any text input, including responses from LLMs like ChatGPT, in fractions of a second. This enables fluid and natural-sounding conversations with AI agents that can handle complex and dynamic scenarios. Aura offers a selection of diverse voices strongly suited for conversational use cases and preferences requiring the highest degrees of safety, security, speed, and scale.

Also Read: data.world’s Groundbreaking AI Context Engine™ Helps Teams Build AI-Powered Applications With Data and Organizational Knowledge

Aura perfectly complements Deepgram’s Nova-2 speech-to-text API, which provides industry-leading accuracy and transcription speed of audio streams and is implemented at global enterprises and organizations including Spotify, Citibank, NASA, and Twilio. With this release, Deepgram offers developers a complete voice AI platform, giving them the essential building blocks they need – from transcription to sentiment analysis to voice synthesis – to build high throughput, real-time AI agents of the future.

“We are thrilled to launch Aura, our text-to-speech API, to the public after seeing the overwhelming demand for our early access product in the fall. Aura is the result of years of research and development by our team of world-class AI scientists and engineers, who have leveraged the latest advances in deep learning and GPU technology to create a state-of-the-art TTS solution that outperforms anything else on the market,” said Scott Stephenson, CEO and co-founder of Deepgram. “With Aura, we are empowering developers to create voice AI applications that can truly understand and respond to human speech, opening up new possibilities for enhancing customer experience, productivity, and innovation.”

SOURCE: BusinessWire

Subscribe

- Never miss a story with notifications


    Latest stories

    spot_img