Monday, September 8, 2025

Baseten Raises $150M Series D to Advance Scalable, High-Performance AI Inference

Related stories

Cerence AI Brings Microsoft 365 Copilot to Cars for Work

Cerence, a global leader in conversational AI-powered user experiences,...

Smarsh Uses Salesforce Agentforce to Launch AI Support Agents

Smarsh, the global leader in communications data and intelligence,...

Redis to Acquire Decodable to Boost Real-Time Data Platform

Redis, the leading real-time data platform, announced its intention...

Ketryx Raises $39M Series B to Advance AI in Life Sciences

Ketryx, the AI-driven compliance platform that helps life sciences...

Snap Inc. Unveils Imagine Lens, Revolutionizing Creative Expression with AI

Snap Inc., the pioneer in camera technology and creative...
spot_imgspot_img

Baseten, a leader in AI infrastructure, announced the successful completion of a $150 million Series D funding round to accelerate the development of performant, reliable, and cost-efficient inference for AI applications. The round was led by BOND, with Jay Simons joining Baseten’s Board of Directors. New investors Conviction (Sarah Guo) and CapitalG (Jill Chase) also participated, alongside continued support from 01A, IVP, Spark, Greylock, Scribble Ventures, BoxGroup, and Premji Invest.

Baseten’s team has been focused on artificial intelligence for over 15 years, pioneering advances in machine learning from early factorization and ensemble models in competitions such as the Netflix Challenge to experimentation with OpenAI Gym. Over the years, the company explored high-impact AI applications, from disease progression prediction and fraud detection to generative art, ultimately seeking to build scalable AI infrastructure that could serve production-ready models.

Between 2019 and 2022, Baseten observed limited adoption of large-scale model serving, or inference, as teams focused primarily on feature stores, interpretability, and experimentation. The company maintained its conviction that AI products would eventually require fast, reliable, and flexible model-serving infrastructure, and strategically developed for that future with three core principles: fast models, interchangeable compute, and flexible, open, and Pythonic runtimes.

Also Read: Accenture Backs CLIKA to Expand Edge AI Capabilities

The market shifted dramatically in late 2022 with the launch of OpenAI’s ChatGPT and the growing adoption of open-source models such as Stable Diffusion and Whisper. These developments highlighted the need for scalable inference, transparency, and cost control, driving rapid adoption of AI applications across industries. Baseten positioned itself to meet this growing demand by providing high-performance infrastructure capable of running models quickly, reliably, and at scale.

“After more than six years building, we are uniquely positioned to solve today’s AI scaled inference problem,” said Baseten. “Our early investments in optimizing kernels, cold-starts, and cloud integration have resulted in the fastest, most flexible, and reliable inference infrastructure available. In 95% of head-to-head bakeoffs, we beat competitors by 40-50% in performance, and our infrastructure maintains many 9s of uptime across ten different clouds.”

Baseten’s platform empowers dynamic AI companies including Abridge, OpenEvidence, Clay, Mirage, Zed, Gamma, Sourcegraph, Writer, and Bland, enabling them to deliver high-value applications to hundreds of millions of users. By providing reliable, performant infrastructure, Baseten allows customers to focus on innovation rather than operational constraints.

The newly raised Series D funding will enable Baseten to expand its product offerings, accelerate hiring, and support the growing adoption of AI applications across industries. “This capital gives us the resources to pursue what we believe is the largest opportunity yet as AI becomes embedded in every part of our lives,” said the company.

Subscribe

- Never miss a story with notifications


    Latest stories

    spot_img