ClawBench: The Best AI Agent Completed Only 33% of Real Everyday Online Tasks
13 April 2026
ClawBench: The Best AI Agent Completed Only 33% of Real Everyday Online Tasks
ClawBench — a benchmark testing whether AI agents can complete real everyday online tasks: booking a flight, applying for a job, placing an order. Results showed that even the strongest…
