wh-zhu commited on
Commit
dab75ca
·
verified ·
1 Parent(s): 037dc95

Upload folder using huggingface_hub

Browse files
Files changed (3) hide show
  1. .gitattributes +1 -0
  2. README.md +29 -0
  3. exp1.png +3 -0
.gitattributes CHANGED
@@ -34,3 +34,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  tokenizer.json filter=lfs diff=lfs merge=lfs -text
 
 
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  tokenizer.json filter=lfs diff=lfs merge=lfs -text
37
+ exp1.png filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ <h1 align="center">🛠️ ReAligner</h1>
3
+ <p align="center">
4
+ <a href="https://arxiv.org/abs/2503.04346"><img src="https://img.shields.io/badge/arXiv-arXiv%20Preprint-B31B1B?style=flat&logo=arxiv&logoColor=white" alt="arXiv Paper"></a>
5
+ &nbsp;
6
+ <a href="https://github.com/zwhong714/ReAligner"><img src="https://img.shields.io/badge/Homepage-Project%20Page-brightgreen?style=flat&logo=github" alt="Homepage"></a>
7
+ &nbsp;
8
+ <a href="https://huggingface.co/wh-zhu"><img src="https://img.shields.io/badge/Huggingface-Models-yellow?style=flat&logo=huggingface" alt="Models"></a>
9
+ </p>
10
+
11
+
12
+
13
+
14
+ <div>
15
+ A flexible realignment framework is proposed to quantitatively control alignment during training and inference, combining Training-time Realignment (TrRa) and Inference-time Realignment (InRa).
16
+
17
+ - We realign DeepScaleR-1.5B model and reduce token usage without performance loss and even enhance reasoning capabilities.
18
+
19
+
20
+ </div>
21
+
22
+ </div>
23
+
24
+ <div>
25
+ <br>
26
+
27
+
28
+
29
+ ![img](./exp1.png)
exp1.png ADDED

Git LFS Details

  • SHA256: 10e5cb03ac915428010d9a92a5095f7746e5668944c1f275e6df1e74f9f86136
  • Pointer size: 131 Bytes
  • Size of remote file: 257 kB