Scientists once hoarded pre-nuclear steel; now we’re hoarding pre-AI contentNewly announced catalog collects pre-2022 sources untouched by ChatGPT and AI contamination.#artificial intelligence#digital ethics#future of technology#large language models#AI#generative AI#AI critique#preAI·arstechnica.com·Jun 19, 2025Scientists once hoarded pre-nuclear steel; now we’re hoarding pre-AI content
The Unbelievable Scale of AI’s Pirated-Books ProblemMeta pirated millions of books to train its AI. Search through them here.#ai ethics#artificial intelligence#AI#digital ethics#intellectual property#large language models#copyright infringement#regulation#machine learning#generative AI·theatlantic.com·Apr 23, 2025The Unbelievable Scale of AI’s Pirated-Books Problem