Even some of the best AI can’t beat this new benchmark
SMRTR summary
Humanity's Last Exam, a challenging new AI benchmark covering diverse subjects, was released by the Center for AI Safety and Scale AI, with initial trials showing no public AI system scoring above 10%, indicating substantial room for AI improvement.
SMRTR provides this summary for quick context. The original article belongs to Daily.dev.
Read the original article