• ubergeek@lemmy.today
    link
    fedilink
    English
    arrow-up
    3
    ·
    2 个月前

    And I’m assuming if the robots.txt state their UserAgent isn’t allowed to crawl, it obeys it, right? :P

    • Kissaki@feddit.org
      link
      fedilink
      English
      arrow-up
      2
      ·
      2 个月前

      No, as per the article, their argumentation is that they are not web crawlers generating an index, they are user-action-triggered agents working live for the user.

      • ubergeek@lemmy.today
        link
        fedilink
        English
        arrow-up
        1
        ·
        2 个月前

        Except, it’s not a live user hitting 10 sights all the same time, trying to crawl the entire site… Live users cannot do that.

        That said, if my robots.txt forbids them from hitting my site, as a proxy, they obey that, right?