It turns out you can train AI models without copyrighted material
SMRTR summary
Researchers from 14 institutions created an AI model using only public domain and openly licensed material, challenging claims that copyrighted content is essential for training. The 7-billion-parameter model performed comparably to Meta's Llama 2-7B from 2023, though creating it was labor-intensive and challenging. This ethical approach could impact future AI regulation debates and legal cases.
SMRTR provides this summary for quick context. The original article belongs to Engadget.
Read the original article