model deployment AI News & Updates

Google Cloud VP Outlines Three Frontiers of AI Model Capability: Intelligence, Latency, and Scalable Cost

Michael Gerstenhaber, VP of Google Cloud's Vertex AI platform, describes three distinct frontiers driving AI model development: raw intelligence for complex tasks, low latency for real-time interactions, and cost-efficient scalability for mass deployment. He explains that agentic AI adoption is slower than expected due to missing production infrastructure like auditing patterns, authorization frameworks, and human-in-the-loop safeguards, though software engineering has seen faster adoption due to existing development lifecycle protections.

OpenAI Addresses GPT-5 Launch Issues Including Router Problems and User Complaints

OpenAI CEO Sam Altman held a Reddit AMA to address widespread complaints about GPT-5's poor performance following its rollout, attributing issues to a malfunctioning automatic model router. The company promised fixes including restoring access to GPT-4o for Plus users and doubling rate limits, while also addressing embarrassing presentation errors including a widely mocked chart mistake.