llm-blender
/

PairRM-hf

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Dongfu Jiang commited on Jan 5

Commit

bfd2da5

•

1 Parent(s): 02971ea

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -58,8 +58,11 @@ print(logits)
 print(comparison_results)
 # tensor([ True, False], device='cuda:0'), which means whether candidate A is better than candidate B for each input
 ```
-The above code produces exactly the same results with the following code using original llm-blender wrapper:
 ```python
 import os
 os.environ["CUDA_VISIBLE_DEVICES"] = "0"
@@ -78,6 +81,8 @@ print(comparison_results)
 # tensor([ True, False], device='cuda:0'), which means whether candidate A is better than candidate B for each input
 ```
 # Pairwise Reward Model for LLMs (PairRM) from LLM-Blender

 print(comparison_results)
 # tensor([ True, False], device='cuda:0'), which means whether candidate A is better than candidate B for each input
 ```
+You can also copy the simple definition of [`DebertaV2PairRM`](https://github.com/yuchenlin/LLM-Blender/blob/main/llm_blender/pair_ranker/pairrm.py) code as your local file,
+instead of importing it from the `llm-blender` package
+The above code produces exactly the same results as the following code using the original LLM-blender wrapper:
 ```python
 import os
 os.environ["CUDA_VISIBLE_DEVICES"] = "0"
 # tensor([ True, False], device='cuda:0'), which means whether candidate A is better than candidate B for each input
 ```
 # Pairwise Reward Model for LLMs (PairRM) from LLM-Blender