Critique-out-Loud Reward Models Paper: https://arxiv.org/abs/2408.11791 | Code: https://github.com/zankner/CLoud ankner/Llama3-8B-CLoud-RM Updated 28 days ago • 466 ankner/Llama3-8B-Classic-RM Updated 27 days ago • 149 ankner/Llama3-70B-CLoud-RM Updated 25 days ago • 8 • 1 ankner/Llama3-70B-Classic-RM Updated 25 days ago • 11