Some argue that bots should be entitled to ingest any content they see, because people can.

  • Dkarma@lemmy.world
    link
    fedilink
    arrow-up
    1
    ·
    2 years ago

    What is not a different question. As a creator you don’t get to say what or who can ingest your creation. If you did Google image search wouldn’t exist.

    The thing you’re failing to realize is that this isn’t the first time a computer has been used to ingest info. The rules you assert have never been true to this point. Crawlers have been scanning web pages and images since the dawn of the Internet.

    You act like this just started happening so now you get to put rules on what gets to look at that image. Too late there’s decades of precedent.

    • RickRussell_CA@kbin.socialOP
      link
      fedilink
      arrow-up
      0
      ·
      2 years ago

      But there are absolutely rules on whether Google – or anything else – can use that search index to create a product that competes with the original content creators.

      For example, https://en.wikipedia.org/wiki/Authors_Guild,_Inc._v._Google,_Inc.

      Google indexing of copyrighted works was considered “fair use” only because they only offered a few preview pages associated with each work. Google’s web page excerpts and image thumbnails are widely believed to pass fair use under the same concept.

      Now, let’s say Google wants to integrate the content of multiple copyrighted works into an AI, and then give away or sell access to that AI which can spit out the content (paraphrased, in some capacity) of any copyrighted work it’s ever seen. You’ll even be able to ask it questions, like “What did Jeff Guin say about David Koresh’s religious beliefs in his 2023 book, Waco?” and in all likelihood it will cough up a summary of Mr. Guinn’s uniquely discovered research and journalism.

      I don’t think the legal questions there are settled at all.

      • Dkarma@lemmy.world
        link
        fedilink
        arrow-up
        1
        ·
        2 years ago

        You just proved my point there is nothing from stopping Google from scanning all those they just have to limit what they show of what they scanned. There it is easy to prove because the content is verbatim.

        In the case of ai it is not verbatim. How do you prove the results are directly derived from say reading harry potter vs ingesting a forums worth of content regarding hp? I don’t think as a plaintiff u can show damages or that your works were even used… The only reason this is even an issue is because chatgpt creators admitted they scanned books etc.