Approach hardwires model weights into transistors, and uses older 6nm process. Targetting 70b model sizes (presumably 16 bit) by year end. It should cost much less than a 140gb card. but I don’t know details.

  • humanspiral@lemmy.caOP
    link
    fedilink
    English
    arrow-up
    2
    ·
    24 days ago

    It’s a good point that newer better models are released frequently. Perhaps especially among the open ones.