If the linked article has a paywall, you can access this archived version instead: https://archive.ph/aeQhP

OpenAI, Google and Meta ignored corporate policies, altered their own rules and discussed skirting copyright law as they sought online information to train their newest artificial intelligence systems.

  • 0xvalentin@lemmy.sdf.org
    link
    fedilink
    English
    arrow-up
    3
    ·
    7 months ago

    I wonder when google is going to tap into Gmail data of users (if they do not already). They must have trillions of english messages and they already filtered spam. Additionally, it’s hard to ever prove that they did it.

    Maybe it doesn’t make for high quality data though, not sure…

    • onion@feddit.de
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      7 months ago

      I mean if you want the ai to be able to write and format emails and understand lingo like ‘attachement’, ‘subject’ and ‘bcc’, you probably have to feed it emails