SMRTR AIApr 17, 2025Engadget

Wikipedia offers AI developers a training dataset to maybe get scraper bots off its back

SMRTR summary

Wikimedia Foundation is offering AI developers a structured dataset of Wikipedia content to address server strain from AI crawlers. The dataset, available through Kaggle in English and French, includes article abstracts, descriptions, and key data points, but excludes references and non-prose elements.

SMRTR provides this summary for quick context. The original article belongs to Engadget.

Read the original article
SMRTR AI

Get the next batch of curated summaries in your inbox.

This archive is built from SMRTR newsletter summaries. Subscribe for hand-picked stories without the extra noise.