professional automation AI News & Updates

Anthropic's Opus 4.6 Achieves Major Leap in Professional Task Performance with 45% Success Rate

Anthropic's newly released Opus 4.6 model achieved nearly 30% accuracy on professional task benchmarks in one-shot trials and 45% with multiple attempts, representing a significant jump from the previous 18.4% state-of-the-art. The model includes new agentic features such as "agent swarms" that appear to enhance multi-step problem-solving capabilities for complex professional tasks like legal work and corporate analysis.