Open Reproduction of DeepSeek-R1 (github.com) AI

Hugging Face has released “open-r1,” an open-source, work-in-progress project aimed at fully reproducing DeepSeek-R1 by rebuilding the missing R1 training pipeline components (distillation, RL training, and evaluation) with scripts and a runnable Makefile. The repo describes a step-by-step plan, including releasing multiple distilled datasets and recipes—such as a 350k verified “Mixture-of-Thoughts” reasoning dataset and an OpenR1-Distill-7B training recipe—along with instructions for supervised fine-tuning (SFT) and GRPO training.

June 11, 2026 19:01 Source: Hacker News