Monday, November 25, 2024

Patronus AI Raises $17 million To Detect LLM Mistakes at Scale

Related stories

Deep Instinct Expands Zero-Day Security to Amazon S3

Deep Instinct, the zero-day data security company built on...

Foxit Unveils AI Assistant in Admin Console

Foxit, a leading provider of innovative PDF and eSignature...

Instabase Names Junie Dinda CMO

Instabase, a leading applied artificial intelligence (AI) solution for...
spot_imgspot_img

Series A financing led by Glenn Solomon at Notable Capital underscores urgent need for companies to deploy large language models with confidence

Patronus AI announced it is raising a $17 million Series A round, bringing the total amount raised to $20 million. The financing was led by Glenn Solomon at Notable Capital with participation from Lightspeed Venture Partners, Datadog, Gokul Rajaram, Factorial Capital, and several leading software and AI executives.

Founded by machine learning experts from Meta, Anand Kannappan and Rebecca Qian, Patronus AI is the first automated evaluation and security platform that helps companies detect large language model (LLM) mistakes at scale. Using proprietary AI, the platform enables enterprise development teams to score model performance, generate adversarial test cases, benchmark models and more. Patronus AI automates and scales the manual and costly model evaluation methods prevalent in the enterprise today, enabling organizations to confidently deploy LLMs while minimizing the risk of model failures and misaligned outputs.

“Model hallucinations and safety risks are here to stay,” Anand Kannappan, CEO, Patronus AI. “What enterprises need is transparency into model performance and accuracy in order to circumvent risks. For the first time, we’re giving companies a way to truly understand what they are working with so they can deploy LLMs with confidence. It’s the only path to innovation.”

Also Read: CoreWeave Secures $7.5 Billion Debt Financing Facility led by Blackstone and Magnetar

Since the company’s official launch in late 2023, the company has released industry research shedding light on the pervasive accuracy and safety problems inherent with LLM use. The first is FinanceBench, the industry’s first benchmark for LLM performance on financial questions. The research revealed that popular models, like OpenAI’s GPT-4-Turbo, only got 19 percent of answers right when reading an entire SEC filing. Patronus AI also released CopyrightCatcher, the first Copyright Detection API for LLMs, showing that LLMs are capable of plagiarizing content without the proper guardrails.

In addition, a number of large Fortune 500 companies already use Patronus AI to implement their generative AI initiatives safely within their organizations. Patronus AI plans to use the additional capital to scale its AI research, engineering, and sales teams, train evaluation models, and develop new industry standard benchmarks.

“The intense interest among enterprises to bring LLM-based applications to market makes this one of the fastest growing markets we’ve ever seen at Notable Capital. Despite this swelling demand, the persistent issues of accuracy and safety in large language models pose risks that large enterprises cannot afford to take,” said Glenn Solomon, Managing Partner, Notable Capital. “Patronus AI is not only tackling this issue with their industry-first platform, their groundbreaking research is improving the reliability of AI outputs and setting new industry benchmarks for responsible AI use. Anand and Rebecca’s background in AI research are precisely what the industry needs at this critical juncture, and we’re thrilled to help them scale their solutions to any organization in need.”

Source: PRNewsWire

Subscribe

- Never miss a story with notifications


    Latest stories

    spot_img