Dongfu Jiang commited on
Commit
bfd2da5
1 Parent(s): 02971ea

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -1
README.md CHANGED
@@ -58,8 +58,11 @@ print(logits)
58
  print(comparison_results)
59
  # tensor([ True, False], device='cuda:0'), which means whether candidate A is better than candidate B for each input
60
  ```
 
 
61
 
62
- The above code produces exactly the same results with the following code using original llm-blender wrapper:
 
63
  ```python
64
  import os
65
  os.environ["CUDA_VISIBLE_DEVICES"] = "0"
@@ -78,6 +81,8 @@ print(comparison_results)
78
  # tensor([ True, False], device='cuda:0'), which means whether candidate A is better than candidate B for each input
79
  ```
80
 
 
 
81
  # Pairwise Reward Model for LLMs (PairRM) from LLM-Blender
82
 
83
 
 
58
  print(comparison_results)
59
  # tensor([ True, False], device='cuda:0'), which means whether candidate A is better than candidate B for each input
60
  ```
61
+ You can also copy the simple definition of [`DebertaV2PairRM`](https://github.com/yuchenlin/LLM-Blender/blob/main/llm_blender/pair_ranker/pairrm.py) code as your local file,
62
+ instead of importing it from the `llm-blender` package
63
 
64
+
65
+ The above code produces exactly the same results as the following code using the original LLM-blender wrapper:
66
  ```python
67
  import os
68
  os.environ["CUDA_VISIBLE_DEVICES"] = "0"
 
81
  # tensor([ True, False], device='cuda:0'), which means whether candidate A is better than candidate B for each input
82
  ```
83
 
84
+
85
+
86
  # Pairwise Reward Model for LLMs (PairRM) from LLM-Blender
87
 
88