SMRTR AI• Jan 23, 2025• Daily.dev

Even some of the best AI can’t beat this new benchmark

SMRTR summary

Humanity's Last Exam, a challenging new AI benchmark covering diverse subjects, was released by the Center for AI Safety and Scale AI, with initial trials showing no public AI system scoring above 10%, indicating substantial room for AI improvement.

SMRTR provides this summary for quick context. The original article belongs to Daily.dev.

Read the original article

SMRTR AI

Get the next batch of curated summaries in your inbox.

This archive is built from SMRTR newsletter summaries. Subscribe for hand-picked stories without the extra noise.