SMRTR AI• Apr 17, 2025• Engadget

Wikipedia offers AI developers a training dataset to maybe get scraper bots off its back

SMRTR summary

Wikimedia Foundation is offering AI developers a structured dataset of Wikipedia content to address server strain from AI crawlers. The dataset, available through Kaggle in English and French, includes article abstracts, descriptions, and key data points, but excludes references and non-prose elements.

SMRTR provides this summary for quick context. The original article belongs to Engadget.

Read the original article

Wikipedia offers AI developers a training dataset to maybe get scraper bots off its back

Get the next batch of curated summaries in your inbox.