Meta employees have discussed training the company’s AI models using copyrighted works obtained through questionable means, sparking a legal dispute with authors like Sarah Silverman and Ta-Nehisi Coates. Internal chats reveal discussions about buying e-books at retail prices instead of licensing deals with publishers, considering Libgen as a source despite its legal issues, and tuning models to avoid IP risks. Meta’s approach to using copyrighted data for AI training and the potential use of Reddit data are also highlighted in the court documents.
Full Article
Loading PerspectiveSplit analysis...
