Thoughts
Training Reasoning Models: Notes as I Try to Understand GRPO and DAPO
February 16, 2026
Notes on GRPO, DAPO, and training reasoning capabilities in language models.
February 16, 2026
Notes on GRPO, DAPO, and training reasoning capabilities in language models.