[Paper Review] DeepSeek-R1 Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
It has not been a long time since DeepSeek was released. It was indeed a shock to those who are in AI industry.
I was not familiar with LLM’s algorithm and the computing resource usage of the LLMs. All I was doing was to utilise the LLM APIs for deve...
ramieeee.me6 min read