cross-posted from: https://nom.mom/post/121481
OpenAI could be fined up to $150,000 for each piece of infringing content.https://arstechnica.com/tech-policy/2023/08/report-potential-nyt-lawsuit-could-force-openai-to-wipe-chatgpt-and-start-over/#comments
Do you remember quotes in english ascii /s
Tokens are even denser than ascii. simmlar to word “chunking” My guess is it’s like lossy video compression but for text, [Attacked] with [lazers] by [deatheaters] apon [margret];[has flowery language]; word [margret] [comes first] (Theoretical example has 7 “tokens”)
It may have actually impressioned a really good copy of that book as it’s lilely read it lots of times.
If it’s lossy enough then it’s just a high-level conceptual memory, and that’s not copyrightable.
It varries based on how much time its been given with the media.