Analysis shows that indiscriminately training generative artificial intelligence on real and generated content, usually done by scraping data from the Internet, can lead to a collapse in the ability of the models to generate diverse high-quality output.
To be fair this doesn’t sound much different than your average human using the internet.
2024, Reverse Turing Test Challenge:
Can an LLM AI differentiate between human input and LLM AI input?