Tuesday, November 5, 2024

Deepgram Launches Aura, a Text-to-Speech API for Real-Time, Conversational Voice AI Agents

Related stories

Absci and Twist Bioscience Collaborate to Design Novel Antibody using Generative AI

Absci Corporation a data-first generative AI drug creation company, and...

GreyNoise Intelligence Discovers Zero-Day Vulnerabilities in Live Streaming Cameras with the Help of AI

GreyNoise Intelligence, the cybersecurity company providing real-time, verifiable threat...

Medidata Launches Bundled Solutions to Support Oncology and Vaccine Trials

Medidata, a Dassault Systèmes brand and leading provider of...

Blend Appoints Mike Mischel as SVP of AI Consulting

Blend, a leader in data science and AI-powered solutions,...

Patronus AI Launches Industry-First Self-Serve API for AI Evaluation and Guardrails

Patronus AI announced the launch of the Patronus API, the first...
spot_imgspot_img

With Aura, developers can now build production-grade, secure, and human-like voice AI applications that run faster and more efficiently than any other solution on the market

Deepgram, the leading provider of speech recognition, natural language processing, and generative AI solutions, announced the public release of Aura, a text-to-speech (TTS) API that delivers human-like quality conversation that is faster and more efficient compute-wise than all voice AI alternatives. Aura is designed for developers who want to build real-time, conversational voice AI agents that can interact with customers, employees, and other users in a natural and engaging way.

Deepgram believes that voice will become the predominant way we interact with technology – and for that to work, AI systems must be highly tuned to enable natural conversation at scale and with incredibly low latency. Aura can generate speech from any text input, including responses from LLMs like ChatGPT, in fractions of a second. This enables fluid and natural-sounding conversations with AI agents that can handle complex and dynamic scenarios. Aura offers a selection of diverse voices strongly suited for conversational use cases and preferences requiring the highest degrees of safety, security, speed, and scale.

Also Read: data.world’s Groundbreaking AI Context Engine™ Helps Teams Build AI-Powered Applications With Data and Organizational Knowledge

Aura perfectly complements Deepgram’s Nova-2 speech-to-text API, which provides industry-leading accuracy and transcription speed of audio streams and is implemented at global enterprises and organizations including Spotify, Citibank, NASA, and Twilio. With this release, Deepgram offers developers a complete voice AI platform, giving them the essential building blocks they need – from transcription to sentiment analysis to voice synthesis – to build high throughput, real-time AI agents of the future.

“We are thrilled to launch Aura, our text-to-speech API, to the public after seeing the overwhelming demand for our early access product in the fall. Aura is the result of years of research and development by our team of world-class AI scientists and engineers, who have leveraged the latest advances in deep learning and GPU technology to create a state-of-the-art TTS solution that outperforms anything else on the market,” said Scott Stephenson, CEO and co-founder of Deepgram. “With Aura, we are empowering developers to create voice AI applications that can truly understand and respond to human speech, opening up new possibilities for enhancing customer experience, productivity, and innovation.”

SOURCE: BusinessWire

Subscribe

- Never miss a story with notifications


    Latest stories

    spot_img