How To Run Deepseek R1 671b Fully Locally On a $2000 EPYC Server

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · edit-2 26 days ago

How To Run Deepseek R1 671b Fully Locally On a $2000 EPYC Server

Kawasaki@lemmygrad.ml · 26 days ago

Oh yeah, I once tried a local small 8B LLM locally too, I can’t remember if it was DeepSeek’s but I think it was, but it was writing at like one token every 2/3 seconds, and after like 5 minutes, seeing the first message not being done yet, I realized it is too much to ask to my poor GT 1030. I also heard about the cheap API, many people were delighted, since ChatGPT’s costed much much more than that. Let’s hope DeepSeek’s API becomes free soon! Even now, I’m assuming you can get days worth of conversation with just 1€.

CriticalResist8@lemmygrad.ml · 25 days ago

I estimated that to translate ProleWiki from English to 5 languages (the API charges per input tokens and output tokens, i.e. what you feed it -> english content and what it outputs -> translated content) it would cost us maximum 50$ with deepseek API. ChatGPT is so expensive I didn’t even try, it was going to be in the hundreds of dollars lol. The output per 1M with deepseek is 50 cents in the off-hours (easy, just run your code during the off-hours automatically) and gpt’s is 1.6$ for their “mini” model, which is still 3x as expensive.

There are other chinese models coming along, I think xiaomi is making one. They’re also innovating in image and video generation models but for text models. One of them that came out shortly after deepseek is the one that someone said was too cheap to meter (because it literally uses so little resources to run that it makes no sense to even keep track of usage!), but I haven’t heard more about it since.

How To Run Deepseek R1 671b Fully Locally On a $2000 EPYC Server

How To Run Deepseek R1 671b Fully Locally On a $2000 EPYC Server

How To Run Deepseek R1 671b Fully Locally On a $2000 EPYC Server – Digital Spaceport