Are AI agents ready for the workplace? A new benchmark raises doubts.
SMRTR summary
APEX-Agents benchmark tested top AI models on real white-collar tasks from consulting, banking, and law, achieving only 24% accuracy. Models struggled with tracking information across workplace tools, showing they can't yet replace knowledge workers.
SMRTR provides this summary for quick context. The original article belongs to TechCrunch.
Read the original article