Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
allenai
/
tulu-v2.5-ppo-13b-hh-rlhf-60k
like
0
Follow
Ai2
1,336
Text Generation
Transformers
Safetensors
allenai/tulu-2.5-preference-data
allenai/tulu-v2-sft-mixture
English
llama
conversational
text-generation-inference
Inference Endpoints
arxiv:
2406.09279
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
tulu-v2.5-ppo-13b-hh-rlhf-60k
Commit History
Update README.md
9f489ec
verified
hamishivi
commited on
Jun 14
Update README.md
26c6387
verified
hamishivi
commited on
Jun 12
Update README.md
778a12e
verified
hamishivi
commited on
Jun 12
Update tokenizer_config.json
9172f4e
verified
hamishivi
commited on
Jun 12
Update README.md
49ec259
verified
hamishivi
commited on
Jun 12
Update config.json
fda9a16
verified
hamishivi
commited on
Jun 12
Create README.md
b72b74e
verified
hamishivi
commited on
Jun 12
Upload folder using huggingface_hub
370721a
verified
hamishivi
commited on
Jun 11
initial commit
f1deeb0
verified
hamishivi
commited on
Jun 11