xiwenc1 commited on
Commit
2e2054a
·
verified ·
1 Parent(s): 6ab7d4e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -3,3 +3,6 @@ license: cc-by-4.0
3
  ---
4
 
5
  This model is described in the paper [DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models](https://arxiv.org/abs/2505.09655).
 
 
 
 
3
  ---
4
 
5
  This model is described in the paper [DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models](https://arxiv.org/abs/2505.09655).
6
+
7
+
8
+ Full code is in: https://github.com/xiwenc1/DRA-GRPO