Smaug-72B-v0.1: The New Open-Source LLM Roaring to the Top of the Leaderboard

hexual@lemmy.world · 9 months ago

Smaug-72B-v0.1: The New Open-Source LLM Roaring to the Top of the Leaderboard

kakes@sh.itjust.works · 9 months ago

Afaik you can substitute VRAM with RAM at the cost of speed. Not exactly sure how that speed loss correlates to the sheer size of these models, though. I have to imagine it would run insanely slow on a CPU.

Infiltrated_ad8271@kbin.social · edit-2 9 months ago

I tested it with a 16GB model and barely got 1 token per second. I don’t want to imagine what it would take if I used 16GB of swap instead, let alone 130GB.

Smaug-72B-v0.1: The New Open-Source LLM Roaring to the Top of the Leaderboard

Smaug-72B-v0.1: The New Open-Source LLM Roaring to the Top of the Leaderboard

abacusai/Smaug-72B-v0.1 · Hugging Face