Sunday, December 22, 2024

DataRobot Collaborates with IMDA to Make LLM Evaluation More Accessible to AI Builders with Project Moonshot

Related stories

Doc.com Expands AI developments to Revolutionize Healthcare Access

Doc.com, a pioneering healthcare technology company, proudly announces the development...

Amesite Announces AI-Powered NurseMagic™ Growth in Marketing Reach to Key Markets

Amesite Inc., creator of the AI-powered NurseMagic™ app, announces...

Quantiphi Joins AWS Generative AI Partner Innovation Alliance

Quantiphi, an AI-first digital engineering company, has been named...
spot_imgspot_img

DataRobot, the enterprise AI platform leader, announced the integration of LLM evaluation measures aligned with a new initiative from the Singapore Government Agency, Infocomm Media Development Authority (IMDA). The “Project Moonshot” initiative unveiled at the Singapore Asia Tech x Summit offers new capabilities that help AI practitioners and system owners manage LLM deployment risks by providing a common framework for benchmarking and red teaming evaluation.

“At DataRobot, our focus is addressing the confidence gap and helping organizations scale responsible use of generative AI,” said Jay Schuren, Chief Customer Officer, DataRobot. “We’re excited to announce that our latest product release incorporates Project Moonshot’s testing toolkit and its benchmarking and evaluation tests. The result is that LLM evaluations are more accessible and help scale the responsible use of generative AI, enabling practitioners to turn on and configure guard models to change the behavior and responses of LLMs.”

Also Read: xAI announces series B funding round of $6 billion

Project Moonshot delivers three core capabilities for AI practitioners and system owners:

  1. Automated evaluation tools for generative AI solutions that easily integrate into CI/CD pipelines.
  2. A benchmark repository allowing teams to run evaluations relevant to their applications by curating the right benchmarks.
  3. A one-stop tool for AI red teaming, from jailbreaks to customized attacks.

“The development of Project Moonshot, one of the world’s first open-source tools to bring red teaming, benchmarking and baseline testing together in an easy-to-use platform, would not have been possible without the contribution of partners such as DataRobot,” said Dr. Ong Chen Hui, Chair of the Governing Committee, AI Verify Foundation. “Project Moonshot will provide developers with an intuitive toolkit to test their LLM applications. This new toolkit signals Singapore’s continued commitment to advance the global open-source efforts toward addressing generative AI safety concerns.”

“We are proud to support our portfolio company, DataRobot, on its growth journey in Southeast Asia,” said Paul Ng, Chief Executive Officer, EDBI. “The company has not only expanded its footprint in Singapore but has also fostered collaborations that benefit the local innovation ecosystem, such as this partnership with IMDA. Project Moonshot provides local enterprises with the right tools to deploy generative AI technologies confidently. As a strategic investor, we are committed to creating value for our portfolio companies while enhancing Singapore’s innovation capabilities.”

Source: DataRobot

Subscribe

- Never miss a story with notifications


    Latest stories

    spot_img