• hendrik@palaver.p3x.de
    link
    fedilink
    English
    arrow-up
    4
    ·
    edit-2
    1 day ago

    I think perplexity is still central to evaluating models. It’s notoriously difficult to come up with other ways to measure these things.