O’Reilly Media – ChatGPT, Author of The Quixote
TL;DR LLMs and other GenAI models can reproduce significant chunks of training data.Specific prompts seem to “unlock” training data.We have many current and future copyright challenges: training may not infringe copyright, but legal doesn’t mean legitimate—we consider the analogy of MegaFace where surveillance models have been trained on photos of minors, for example, without informed…