RobotRobotWhatDoUSee

2025-02-03 13:42:02

What are your prompts for code assitants?

RobotRobotWhatDoUSee

2025-02-01 14:30:39

How To Run Deepseek R1 671b Fully Locally On a $2000 EPYC Server

blahblahsnahdah

2025-01-24 00:20:07

Ollama is confusing people by pretending that the little distillation models are "R1"

SomeOddCodeGuy

2024-05-17 03:36:49

Almost a year later, I can finally do this. A small teaser of a project I'm working on

salec65

2025-01-11 17:43:37

48gb vs 96gb VRAM for fine-tuning

Any_Praline_8178

2025-01-12 03:48:19

6x AMD Instinct Mi60 AI Server vs Llama 405B + vLLM + Open-WebUI - Impressive!

Asking for hardware recommendations for a personal machine capable of running +70B models. With cloud options I have to re-download the model every time. Should I bite the bullet and get Mac Studio M2 Ultra ($7000 after tax), or build a PC? What specs do you recommend?

pigeon57434

2025-01-09 02:03:50

Now that Phi-4 has been out for a while what do you think?

Vishnu_One

2025-01-09 05:12:08

Phi 4 is just 14B But Better than llama 3.1 70b for several tasks.

Evening_Action6217

2024-12-26 12:38:12

Wow this maybe probably best open source model ?

programmerChilli

2025-01-07 06:26:47

To understand the Project DIGITS desktop (128 GB for 3k), look at the existing Grace CPU systems

Wiskkey

2025-01-06 08:16:41

For those who care about how o1 works technically, OpenAI has stated that o1 was built using reinforcement fine-tuning, which was announced by OpenAI on December 6 as day 2 of Shipmas