Can you provide some training details about this model (like learning rate)?

#2
by iseesaw - opened
Skywork org

Hi,

We primarily follow RLHFlow's recipe, except that we train for 2 epochs instead.

chrisliu298 changed discussion status to closed

Sign up or log in to comment