"the secret list of websites" - Chris Coyier
The Washington Post does research to figure out which websites were used to train Google’s AI model: To look inside this black box, we analyzed Google’s C4 data set, a massive snapshot of the contents of 15 million websites that have been used to instruct some high-profile English-language AIs, called large language models, including Google’s T5 […]