[Request] Release of Reward Model

by pchiang - opened Jun 21, 2023

Jun 21, 2023

Would the team consider releasing the reward model in addition to the trained model? Reward model could be very useful for evaluating the performance of generation, and could also make it easier for others to reproduce RLHF training.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment