lmzheng
/

fine-tuned-judge

Feature Extraction

Transformers

PyTorch

llama

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

fine-tuned-judge / README.md

lmzheng

Update README.md

0364f87 about 1 year ago

preview code

raw

history blame contribute delete

221 Bytes

This is a 3-way classifier judge model fine-tuned on the Chatbot Arena human preference dataset. The base model is llama 13B. More details can be found in the Appendix. F of this paper.