I don't see a way that we shut down LLM technology because of copyright concerns. This horse has left the barn - LLM capabilities are too valuable for folks in power to walk away from. Do you really think the US Govt is going to say - OK, fair enough, let's pack this thing up - while China powers on full speed ahead? This is strategically significant technology that is potentially only the beginning of an exponential curve. And now that the technology to do this is open source, and scraping of public web content is free use - do we really want to setup constraints so that the only people with the power of frontier LLMs are those with the power and money to do it in secret? Guess what - the NSA has all the training data they could ever want (
https://nsa.gov1.info/utah-data-center/) - and I for one want to make sure that EVERYONE has access to the productivity increases made possible by generative AI, not just those with power and influence to do what they desire in secret.