AI inference AI News & Updates

Meta Commits to Millions of Amazon's Graviton AI CPUs in Major Cloud Deal

Meta has signed a deal with AWS to use millions of Amazon's homegrown Graviton ARM-based CPUs for AI workloads, particularly for inference and AI agent tasks. This marks a shift from GPU-dominated training workloads to CPU-intensive inference needs driven by AI agents performing real-time reasoning and multi-step coordination. The deal redirects Meta's spending back to AWS from competitors like Google Cloud, while showcasing Amazon's custom chip strategy against Nvidia's competing ARM-based AI CPUs.

Gimlet Labs Raises $80M Series A for Multi-Silicon AI Inference Optimization Platform

Gimlet Labs, founded by Stanford professor Zain Asgar, has raised an $80 million Series A led by Menlo Ventures for its multi-silicon inference cloud platform. The software orchestrates AI workloads across diverse hardware types (CPUs, GPUs, high-memory systems) to improve efficiency by 3x-10x, addressing the massive underutilization of existing data center infrastructure. The company already has eight-figure revenues and partnerships with major chip makers including NVIDIA, AMD, Intel, and Cerebras.