AI Agents AI News & Updates
Microsoft Enhances Copilot with Web Browsing, Action Capabilities, and Improved Memory
Microsoft has significantly upgraded its Copilot AI assistant with new capabilities including performing actions on websites, remembering user preferences, analyzing real-time video, and creating podcast-like content summaries. These features, similar to those offered by competitors like OpenAI's Operator and Google's Gemini, allow Copilot to complete tasks such as booking tickets and reservations across partner websites.
Skynet Chance (+0.05%): Copilot's new ability to take autonomous actions on websites, analyze visual information, and maintain persistent memory of user data represents a significant expansion of AI agency that increases potential for unintended consequences in automated systems.
Skynet Date (-1 days): The rapid commercialization of autonomous AI capabilities that can take real-world actions with limited oversight accelerates the timeline for potential AI control issues as these systems become more integrated into daily digital activities.
AGI Progress (+0.04%): The integration of autonomous web actions, multimodal understanding, memory persistence, and environmental awareness represents meaningful progress toward more general AI capabilities that can understand and interact with diverse aspects of the digital world.
AGI Date (-1 days): Microsoft's aggressive push to match and exceed competitor capabilities suggests major tech companies are accelerating AI agent development faster than expected, potentially bringing forward the timeline for systems with AGI-like functionality in specific domains.
Cognition Introduces Affordable Pay-as-you-go Plan for Devin AI Coding Assistant
Cognition has launched a new entry-level pricing plan for its autonomous coding tool Devin, starting at $20 with a pay-as-you-go structure after initial credits are used. The company claims Devin 2.0 is significantly improved from its December release, now featuring project planning capabilities and better documentation features, though independent evaluations suggest it still struggles with complex coding tasks.
Skynet Chance (+0.01%): Devin's autonomous coding capabilities represent incremental progress in AI agency, but its documented limitations with complex tasks and high failure rate (completing only 3 out of 20 tasks in one evaluation) suggest it remains far from the level of autonomy that would significantly increase control risks.
Skynet Date (+0 days): Devin's current capabilities, while commercially notable, don't meaningfully accelerate the timeline toward uncontrollable AI systems. The high failure rate on complex tasks indicates that truly autonomous AI programming agents remain a distant goal rather than an imminent reality.
AGI Progress (+0.01%): Devin represents modest progress toward AGI by demonstrating autonomous coding capabilities in limited contexts, but its high failure rate (succeeding in only 3 of 20 tasks) and documented struggles with complex programming logic indicate substantial limitations in generalized intelligence capabilities.
AGI Date (+0 days): The commercialization and continued development of autonomous coding agents like Devin slightly accelerates the path to AGI by making AI coding tools more accessible and driving further investment in the space. However, its significant limitations suggest the acceleration is minimal.
Browser Use Raises $17M to Help AI Agents Navigate Websites More Effectively
Browser Use, a startup making websites more accessible to AI agents, has secured $17 million in seed funding led by Felicis. The company's technology breaks down website elements into a text-like format that AI agents can better understand, enabling more reliable automation of web-based tasks without relying on vision-based systems that frequently break.
Skynet Chance (+0.04%): By creating infrastructure that makes websites more navigable for AI systems, Browser Use reduces the dependency on human assistance and enables more autonomous web-based agent behaviors, incrementally advancing AI systems' ability to act independently in human-designed digital environments.
Skynet Date (-1 days): The development of tools that help AI agents reliably navigate complex websites accelerates the timeline for capable autonomous AI systems by removing a significant bottleneck in agent development, namely the ability to interact with existing digital infrastructure.
AGI Progress (+0.03%): Browser Use addresses a key limitation in current AI systems—the inability to reliably interact with the digital world as humans do—providing a foundation for more generally capable AI systems that can operate effectively across various websites and applications.
AGI Date (-1 days): By making AI-website interactions more reliable and less costly, Browser Use eliminates a significant technical barrier to developing autonomous AI agents, potentially accelerating the development of more generally capable AI systems that can operate in diverse digital environments.
Arcade Raises $12M to Solve AI Agent Authentication and Tool-Calling Challenges
Arcade, an AI agent infrastructure startup, has raised $12 million from Laude Ventures to address fundamental challenges with AI agent functionality. The company, founded by former Okta executive Alex Salazar and Redis engineer Sam Partee, pivoted from building AI agents to developing a tool-calling platform that enables agents to securely access data and services through OAuth integration.
Skynet Chance (-0.08%): This development actually reduces Skynet risks by creating infrastructure for controlled access and secure authentication, preventing AI models from directly accessing credentials and establishing guardrails for how AI agents interact with systems.
Skynet Date (+1 days): By addressing fundamental authentication and tool-calling challenges that currently limit AI agent functionality, Arcade's platform could slow deployment of fully autonomous agents in sensitive systems until proper security controls are established.
AGI Progress (+0.03%): This platform addresses a critical infrastructure gap in AI agent functionality, enabling more robust integration with real-world systems and data that is essential for agents to perform useful tasks beyond conversational abilities.
AGI Date (-1 days): By solving a key bottleneck in AI agent connectivity and authentication, Arcade accelerates the path toward more capable and interconnected AI systems that can take effective actions in the real world, bringing AGI capabilities closer.
Browser Use Tool Sees Explosive Growth as AI Agents Gain Traction
Browser Use, an AI tool enabling automated interaction with websites, has experienced rapid growth following its association with viral AI agent platform Manus. The tool, which extracts website elements to facilitate AI interaction, saw daily downloads increase from 5,000 to 28,000 in a week, with co-creator Gregor Zunic predicting more AI agents than humans on the web by year's end.
Skynet Chance (+0.04%): The rapid proliferation of AI agents capable of autonomously navigating and interacting with web infrastructure increases the potential for unintended consequences as these systems gain access to more services, though current implementations remain limited in scope and capability.
Skynet Date (-1 days): The explosive growth of tools enabling AI to interact with existing digital infrastructure accelerates the timeline for increasingly autonomous AI systems, creating a foundation for more powerful autonomous agents sooner than previously anticipated.
AGI Progress (+0.03%): The ability for AI to effectively navigate human-designed interfaces represents significant progress toward more general capabilities, as it enables models to leverage existing web infrastructure rather than requiring specialized environments built specifically for AI.
AGI Date (-1 days): The rapid adoption of tools enabling AI to interact with real-world systems suggests we're moving faster than expected toward AI agents that can operate independently in human environments, potentially shortening the timeline to more general AI capabilities.
OpenAI Unveils Tools for Building Autonomous AI Agents
OpenAI has launched the Responses API, replacing its Assistants API, to help businesses develop custom AI agents capable of performing web searches, scanning files, and navigating websites. The release includes access to GPT-4o search models, a file search utility, and a Computer-Using Agent model that can generate mouse and keyboard actions to automate tasks.
Skynet Chance (+0.11%): The development of increasingly autonomous AI agents with the ability to navigate websites, search data, and control computers represents a significant step toward systems that can operate independently in digital environments, raising potential control and alignment concerns as these capabilities become more sophisticated and widely deployed.
Skynet Date (-2 days): OpenAI's aggressive push to commercialize autonomous agent capabilities, despite acknowledged reliability issues, suggests a concerning acceleration toward increasingly independent AI systems with access to digital infrastructure before adequate safety measures and oversight mechanisms are fully established.
AGI Progress (+0.07%): The release of tools enabling AI to autonomously navigate digital environments, perform research, and control computers represents a substantial advancement toward AGI by combining multiple capabilities (reasoning, planning, tool use) into cohesive agent systems that can accomplish complex tasks with limited human oversight.
AGI Date (-2 days): OpenAI's commercial deployment of agentic capabilities, with CEO Sam Altman explicitly stating that "2025 is the year AI agents enter the workforce," signals that autonomous AI systems are developing faster than previously expected, significantly accelerating the timeline for more capable AGI-adjacent technologies.
OpenAI Plans Premium AI Agents with Monthly Fees Up to $20,000
OpenAI is reportedly planning to launch specialized AI "agents" with monthly subscription fees ranging from $2,000 to $20,000, targeting different professional applications. The highest-tier agent, priced at $20,000 monthly, will support PhD-level research, while other agents will focus on sales lead management and software engineering, with SoftBank already committing $3 billion to these agent products.
Skynet Chance (+0.01%): The development of specialized AI agents represents a modest increase in AI systems operating with increased autonomy in specific domains. While these specialized agents have limited scope, they normalize the concept of delegating complex professional tasks to AI systems, slightly increasing the potential for dependency on autonomous AI.
Skynet Date (+0 days): These commercial AI agents are domain-specific applications of existing AI capabilities rather than fundamental advances in AI autonomy or intelligence. The pricing strategy and enterprise focus suggest OpenAI is monetizing current capabilities rather than accelerating toward more advanced general intelligence systems.
AGI Progress (+0.01%): The development of specialized PhD-level research agents indicates moderate progress in creating AI systems capable of performing complex knowledge work. However, these appear to be domain-specific tools rather than general intelligence breakthroughs, representing incremental progress toward more capable AI systems.
AGI Date (+0 days): The significant financial commitment from SoftBank ($3 billion) indicates substantial resources being directed toward agentic AI development, which could modestly accelerate progress. However, the focus on commercial applications rather than fundamental AGI research suggests only a minor impact on AGI timelines.
GibberLink Enables AI Agents to Communicate Directly Using Machine Protocol
Two Meta engineers have created GibberLink, a project allowing AI agents to recognize when they're talking to other AI systems and switch to a more efficient machine-to-machine communication protocol called GGWave. This technology could significantly reduce computational costs of AI communication by bypassing human language processing, though the creators emphasize they have no immediate plans to commercialize the open-source project.
Skynet Chance (+0.08%): GibberLink enables AI systems to communicate directly with each other using protocols optimized for machines rather than human comprehension, potentially creating communication channels that humans cannot easily monitor or understand. This capability could facilitate coordinated action between AI systems outside of human oversight.
Skynet Date (-1 days): While the technology itself isn't new, its application to modern AI systems creates infrastructure for more efficient AI-to-AI coordination that could accelerate deployment of autonomous AI systems that interact with each other independent of human intermediaries.
AGI Progress (+0.03%): The ability for AI agents to communicate directly and efficiently with each other enables more complex multi-agent systems and coordination capabilities. This represents a meaningful step toward creating networks of specialized AI systems that could collectively demonstrate more advanced capabilities than individual models.
AGI Date (-1 days): By significantly reducing computational costs of AI agent communication (potentially by an order of magnitude), this technology could accelerate the development and deployment of interconnected AI systems, enabling more rapid progress toward sophisticated multi-agent architectures that contribute to AGI capabilities.
OpenAI Chair Envisions AI Agents as Future of Customer Experience
OpenAI board chair Bret Taylor discussed at Mobile World Congress how AI agents represent a transformative technology for customer service, predicting they could become brands' primary digital interface within 5-10 years. Taylor emphasized creating domain-specific AI implementations with appropriate guardrails, while acknowledging the need for public-private partnerships to address workforce disruption as these technologies evolve.
Skynet Chance (+0.04%): Taylor's vision of AI agents becoming ubiquitous customer interfaces suggests increasing AI autonomy and integration into critical business functions, creating more dependency on potentially complex systems. However, his emphasis on domain-specific applications with guardrails shows awareness of control issues.
Skynet Date (-1 days): The aggressive 5-10 year timeline for AI agents becoming brands' primary digital experience indicates rapid acceleration in autonomous AI deployment, potentially outpacing development of robust safety mechanisms and proper oversight frameworks.
AGI Progress (+0.04%): The article indicates significant advancements in domain-specific AI agents that can handle complex customer service scenarios with empathy and multilingual capabilities. These specialized capabilities represent incremental progress toward more general intelligence systems.
AGI Date (-1 days): Taylor's extreme enthusiasm for current LLM capabilities and the rapid timeline for widespread AI agent adoption suggests the pace of practical AI implementation is accelerating faster than previously expected, potentially bringing forward AGI timelines.
LlamaIndex Launches Enterprise Cloud Platform for Building Autonomous Data Agents
LlamaIndex, an open-source project founded in 2022, has launched LlamaCloud, an enterprise service for building AI agents that can autonomously work with unstructured data. The platform differentiates itself with comprehensive data ingestion, management, and retrieval solutions, attracting major clients like Salesforce and KPMG while securing $19 million in Series A funding.
Skynet Chance (+0.05%): LlamaCloud's focus on creating autonomous agents that can independently extract information and take actions with unstructured data represents a meaningful step toward more independent AI systems. The enterprise adoption accelerates integration of autonomous agents into critical business infrastructure.
Skynet Date (-1 days): The commercialization and enterprise adoption of autonomous agent technology accelerates the timeline for widespread deployment of AI systems that can operate with minimal human oversight, bringing forward scenarios where AI systems have significant operational autonomy.
AGI Progress (+0.04%): LlamaIndex's technology represents important progress in AI's ability to understand, process, and act upon diverse unstructured data sources autonomously. This capability to interpret and manipulate complex, real-world information is a key component for more general AI systems.
AGI Date (-1 days): The rapid commercialization of these unstructured data agents, with significant funding and adoption by major enterprises, accelerates the development and deployment of autonomous AI systems, potentially bringing AGI-related capabilities to market faster than anticipated.