• 2 Posts
  • 434 Comments
Joined 1 year ago
cake
Cake day: March 22nd, 2024

help-circle
    • A: This is the ‘bad’ kind of incentive. My mom worked in a hospital where people would come in pregnant, tons of neglected kids in tow, asking how much wellfare they could get for the next kid. Stuff like vouchers for school, care, healthcare and stuff doesn’t incentive that.

    • B: It’s hilariously inadequate and out-of-touch. $5K for childcare these days is a joke, even as a nice supplement.

    …But that’s the point. This is for show, like Trump’s COVID checks with his signature on them. It’s a brand to tell people “Hey! I’m Trump, and I’m helping you!” directly, a decent idea poorly implemented for PR purposes. It’s also hilariously hypocritical, seeing how much ‘blank check hand-outs’ were criticized for decades.












  • Awesome!

    I wonder if things will organize around a “unofficial” modding API like Harmony for Rimworld, Forge for Minecraft, SMAPI for Stardew Valley, and so on? I guess it depends if some hero dev team does it and there’s enough “demand” to build a bunch of stuff on it. But a “final” patch (so future random patches don’t break the community API) and community enthusiasm from Larian are really good omens.

    Skyrim and some other games stayed more fragmented, others like CP2077 just never hit critical mass I guess. And the format of the modded content just isn’t the same for mega RPGs like this.



  • I mean, “modest” may be too strong a word, but a 2080 TI-ish workstation is not particularly exorbitant in the research space. Especially considering the insane dataset size (years of noisy, raw space telescope data) they’re processing here.

    Also that’s not always true. Some “AI” models, especially oldschool ones, function fine on old CPUs. There are also efforts (like bitnet) to get larger ones fast cheaply.



  • That’s even overkill. A 3090 is pretty standard in the sanely priced ML research space. It’s the same architecture as the A100, so very widely supported.

    5090 is actually a mixed bag because it’s too new, and support for it is hit and miss. And also because it’s ridiculously priced for a 32G card.

    And most CPUs with tons of RAM are fine, depending on the workload, but the constraint is usually “does my dataset fit in RAM” more than core speed (since just waiting 2X or 4X longer is not that big a deal).


  • The model was run (and I think trained?) on very modest hardware:

    The computer used for this paper contains an NVIDIA Quadro RTX 6000 with 22 GB of VRAM, 200 GB of RAM, and a 32-core Xeon CPU, courtesy of Caltech.

    That’s a double VRAM Nvidia RTX 2080 TI + a Skylake Intel CPU, an aging circa-2018 setup. With room for a batch size of 4096, nonetheless! Though they did run into some preprocessing bottleneck in CPU/RAM.

    The primary concern is the clustering step. Given the sheer magnitude of data present in the catalog, without question the task will need to be spatially divided in some way, and parallelized over potentially several machines