Dongfu Jiang
commited on
Commit
•
bfd2da5
1
Parent(s):
02971ea
Update README.md
Browse files
README.md
CHANGED
@@ -58,8 +58,11 @@ print(logits)
|
|
58 |
print(comparison_results)
|
59 |
# tensor([ True, False], device='cuda:0'), which means whether candidate A is better than candidate B for each input
|
60 |
```
|
|
|
|
|
61 |
|
62 |
-
|
|
|
63 |
```python
|
64 |
import os
|
65 |
os.environ["CUDA_VISIBLE_DEVICES"] = "0"
|
@@ -78,6 +81,8 @@ print(comparison_results)
|
|
78 |
# tensor([ True, False], device='cuda:0'), which means whether candidate A is better than candidate B for each input
|
79 |
```
|
80 |
|
|
|
|
|
81 |
# Pairwise Reward Model for LLMs (PairRM) from LLM-Blender
|
82 |
|
83 |
|
|
|
58 |
print(comparison_results)
|
59 |
# tensor([ True, False], device='cuda:0'), which means whether candidate A is better than candidate B for each input
|
60 |
```
|
61 |
+
You can also copy the simple definition of [`DebertaV2PairRM`](https://github.com/yuchenlin/LLM-Blender/blob/main/llm_blender/pair_ranker/pairrm.py) code as your local file,
|
62 |
+
instead of importing it from the `llm-blender` package
|
63 |
|
64 |
+
|
65 |
+
The above code produces exactly the same results as the following code using the original LLM-blender wrapper:
|
66 |
```python
|
67 |
import os
|
68 |
os.environ["CUDA_VISIBLE_DEVICES"] = "0"
|
|
|
81 |
# tensor([ True, False], device='cuda:0'), which means whether candidate A is better than candidate B for each input
|
82 |
```
|
83 |
|
84 |
+
|
85 |
+
|
86 |
# Pairwise Reward Model for LLMs (PairRM) from LLM-Blender
|
87 |
|
88 |
|