• Preston Maness ☭@lemmygrad.ml
    link
    fedilink
    arrow-up
    21
    ·
    edit-2
    2 years ago

    Edit: alt text below. The OCRbot would (understandably) have trouble with this one. @CannotSleep420@lemmygrad.ml

    Virgin vs Chad meme.

    On the left, virgin, captioned “Virgin API consumer,” wearing a blue long sleeve shirt and black pants and shoes, and holding a mac book. There is a collar and chain around his neck, and a ball and chains attached to his feet. Numerous text bubbles and logos surround him: “JSON”, “Fears HTML,” “Limited to what the API can do,” “Has to identify himself even for read-only APIs,” “A slave to the API provider,” “Json,” “Tokens,” “has to agree to Terms of Service and follow the the rules,” “API keys,” “Thinks that he is making his life easier,” “Has to worry about quota.”

    On the right, chad, captioned “Chad Independent Scraper,” wearing an orange tank-top (with the HTML 5 logo on it), blue pants, and brown shoes, and holding two different types of paint scrapers, one in each (outstretched) hand. Numerous text bubbles and logos surround him: “follows no rules, doesn’t agree to any terms of service,” Selenium logo, “has no limitations can access any data he wants,” cURL logo, “His software is always independent,” “Doesn’t care about changes in policies,” unknown logo, Python Requests library logo, “Websites can do nothing to stop him,” “Can do whatever he wants,” “handles HTML like a real Chad,” “Doesn’t fear HTML,” “can work anonymously,” Python Requests-HTML library logo, “Even Javascript can’t stop him.”

    • Sleepless One@lemmygrad.ml
      link
      fedilink
      arrow-up
      6
      ·
      2 years ago

      Thanks for the edit. Part of the reason I’m spamming that bot so much is to see how it handles different images, and partly so people know that it exists if they ever want a quick and dirty OCR scan of images in a post or comment.

    • loathsome dongeater@lemmygrad.ml
      link
      fedilink
      arrow-up
      5
      ·
      edit-2
      2 years ago

      I’ve been meaning to learn more libreddit, teddit, nitter et al obtain reddit and twitter data. I am 90% sure they don’t use official APIs so it has to be scraping. I tried looking at the libreddit code once but it made no sense to me (skill issue).

      Edit: looks like libreddit uses the api

      • OCRBot@lemmygrad.mlB
        link
        fedilink
        arrow-up
        2
        ·
        2 years ago
        Image 1 text

        Virgin API Consumer {JSON} poso Chad Independent Scraper follows no rules, doesn’t agree to any terms of service Fears HTML Limited to the what the API can do Has to identify himself even for read-only APIs A slave to the API provider son Has to worry about quota Thinks that he is making his life easier API keys has to agree to Terms of Service and follow the the rules Even Javascript can’t stop him Se Rettests- can work anonymously has no limitations can access any data he wants His software is always independent Doesn’t care about changes in policies Doesn’t fear HTML handles HTML like a real Chad h do whatever he wants Websites can do nothing to stop him Tokens

        This action was performed by a bot.