If the linked article has a paywall, you can access this archived version instead: https://archive.ph/aeQhP
OpenAI, Google and Meta ignored corporate policies, altered their own rules and discussed skirting copyright law as they sought online information to train their newest artificial intelligence systems.
I wonder when google is going to tap into Gmail data of users (if they do not already). They must have trillions of english messages and they already filtered spam. Additionally, it’s hard to ever prove that they did it.
Maybe it doesn’t make for high quality data though, not sure…
I mean if you want the ai to be able to write and format emails and understand lingo like ‘attachement’, ‘subject’ and ‘bcc’, you probably have to feed it emails