• brucethemoose@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    20 days ago

    One more thing, you don’t have to get something shiny and new to speed LLMs up. Even if you have like a 4-6GB GPU collecting dust somehwere, you can still use it to partially offload MoE models to great effect.