OpenAI’s models ‘memorized’ copyrighted content, new study suggests
SMRTR summary
Researchers developed a method to identify memorized training data in AI models, suggesting OpenAI may have used copyrighted content like fiction books and news articles to train GPT-4, supporting claims in current lawsuits.
SMRTR provides this summary for quick context. The original article belongs to TechCrunch.
Read the original article