weixinchen commited on
Commit
724b142
1 Parent(s): 21ae3ae

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -1,6 +1,11 @@
1
  ---
2
  library_name: peft
3
  ---
 
 
 
 
 
4
  ## Training procedure
5
 
6
 
 
1
  ---
2
  library_name: peft
3
  ---
4
+
5
+ This is a gradually self-truthified model (with one iteration) proposed in the paper [GRATH: Gradual Self-Truthifying for Large Language Models](https://arxiv.org/abs/2401.12292).
6
+
7
+ Note: This model is applied with DPO twice. The reference model of DPO is set as the current base model.
8
+
9
  ## Training procedure
10
 
11