• Arthur Besse@lemmy.ml
    cake
    M
    link
    fedilink
    arrow-up
    2
    ·
    4 年前

    It appears that the captioning model on that website was trained on the MSCOCO dataset which was sourced from from Google and Bing image search, and also from Flickr.