SambaNova Cloud runs Llama 3.1405B at 132 tokens per second at full precision – available to developers now
SambaNova Systems, provider of the fastest and...
Enhancing the protection and performance of enterprise AI inference solutions with F5 NGINX Plus, Intel OpenVINO, and Intel IPUs
F5 announced it is bringing robust...
Untether AI®, the leader in energy-centric AI inference acceleration, announced the availability of early access (EA) of its imAIgine® Software Development Kit (SDK) supporting the speedAI® inference...
Baseten, the leader in high-performance AI inference, is excited to announce the beta release of Chains. Built on the foundations of Baseten's open-source Truss framework, Chains...
Axelera AI, the leading provider of purpose-built AI hardware acceleration technology for generative AI and computer vision inference, announced its successful close of an...