Alongside R1 and R1-Zero, DeepSeek today open-sourced a set of less capable but more hardware-efficient models. Those models ...
The company developed DeepSeek-R1 by using pure reinforcement learning on top of DeepSeek-V3-Base, and matched or beat o1 on some benchmarks.