DeepSeek-R1: A Primer
馃挕
This article, trying to understand and explain how the engineering of DeepSeek works, is based on the paper available at https://github.com/deepseek-ai/DeepSeek-R1/blob/main/DeepSeek_R1.pdf
In the past week, a Chinese LLM named DeepSeek-R1 has t...
tjgokken.com10 min read