ClawBench: The Best AI Agent Completed Only 33% of Real Everyday Online Tasks

13 April 2026

ClawBench: The Best AI Agent Completed Only 33% of Real Everyday Online Tasks

ClawBench — a benchmark testing whether AI agents can complete real everyday online tasks: booking a flight, applying for a job, placing an order. Results showed that even the strongest…