Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Ray2333
/
GRM-Llama3-8B-rewardmodel-ft
like
1
Safetensors
Skywork/Skywork-Reward-Preference-80K-v0.1
llama
arxiv:
2406.10216
License:
mit
Model card
Files
Files and versions
Community
Train
main
GRM-Llama3-8B-rewardmodel-ft
Commit History
Update README.md
2d6becc
verified
Ray2333
commited on
Sep 17
Update README.md
cf0d660
verified
Ray2333
commited on
Sep 17
Update README.md
a43cfb7
verified
Ray2333
commited on
Sep 17
Update config.json
f993d6a
verified
Ray2333
commited on
Sep 17
Update config.json
f3c759f
verified
Ray2333
commited on
Sep 17
Upload tokenizer
1cf8806
verified
Ray2333
commited on
Sep 17
Upload LlamaForSequenceClassification
cba986c
verified
Ray2333
commited on
Sep 17
initial commit
8e68bb7
verified
Ray2333
commited on
Sep 17