Wednesday, December 18, 2024

Deepgram Launches Aura, a Text-to-Speech API for Real-Time, Conversational Voice AI Agents

Related stories

Ambient Scientific unveils first ever AI module powered by a coin cell battery

 Company's first ultra-low power AI module will be commercially...

Windward & Dataminr Partner for AI Real-Time Maritime Alerts

The partnership will integrate Dataminr's real-time AI insights into...

Prequel raises $3.3M to boost app reliability

Prequel, the community-driven problem detection and management platform for...

Cohu Boosts Data Analytics and AI/ML with Tignis Acquisition

Cohu, Inc., a global supplier of equipment and services...

Lit Protocol Launches Decentralized AI Agent Stack

Lit Protocol, which empowers developers to secure, manage, and...
spot_imgspot_img

With Aura, developers can now build production-grade, secure, and human-like voice AI applications that run faster and more efficiently than any other solution on the market

Deepgram, the leading provider of speech recognition, natural language processing, and generative AI solutions, announced the public release of Aura, a text-to-speech (TTS) API that delivers human-like quality conversation that is faster and more efficient compute-wise than all voice AI alternatives. Aura is designed for developers who want to build real-time, conversational voice AI agents that can interact with customers, employees, and other users in a natural and engaging way.

Deepgram believes that voice will become the predominant way we interact with technology – and for that to work, AI systems must be highly tuned to enable natural conversation at scale and with incredibly low latency. Aura can generate speech from any text input, including responses from LLMs like ChatGPT, in fractions of a second. This enables fluid and natural-sounding conversations with AI agents that can handle complex and dynamic scenarios. Aura offers a selection of diverse voices strongly suited for conversational use cases and preferences requiring the highest degrees of safety, security, speed, and scale.

Also Read: data.world’s Groundbreaking AI Context Engine™ Helps Teams Build AI-Powered Applications With Data and Organizational Knowledge

Aura perfectly complements Deepgram’s Nova-2 speech-to-text API, which provides industry-leading accuracy and transcription speed of audio streams and is implemented at global enterprises and organizations including Spotify, Citibank, NASA, and Twilio. With this release, Deepgram offers developers a complete voice AI platform, giving them the essential building blocks they need – from transcription to sentiment analysis to voice synthesis – to build high throughput, real-time AI agents of the future.

“We are thrilled to launch Aura, our text-to-speech API, to the public after seeing the overwhelming demand for our early access product in the fall. Aura is the result of years of research and development by our team of world-class AI scientists and engineers, who have leveraged the latest advances in deep learning and GPU technology to create a state-of-the-art TTS solution that outperforms anything else on the market,” said Scott Stephenson, CEO and co-founder of Deepgram. “With Aura, we are empowering developers to create voice AI applications that can truly understand and respond to human speech, opening up new possibilities for enhancing customer experience, productivity, and innovation.”

SOURCE: BusinessWire

Subscribe

- Never miss a story with notifications


    Latest stories

    spot_img