arxiv:2410.18027
Noah Lee
nlee-208
AI & ML interests
LLM, Human Alignment, Uncertainty
Organizations
spaces
1
models
15
nlee-208/uf-qwen2-7IT-sft_bon
Updated
•
4
nlee-208/zephyr-7b-kto
Text Generation
•
Updated
•
10
nlee-208/zephyr-7b-sft-kto2
Text Generation
•
Updated
•
15
nlee-208/zephyr-7b-sft-kto1
Updated
nlee-208/zephyr-7b-sft-kto
Updated
nlee-208/uf-mistral-it-sft-g0
Text Generation
•
Updated
•
10
nlee-208/uf-mistral-it-dpo-iopo-iter1
Text Generation
•
Updated
•
13
nlee-208/uf-mistral-it-dpo-iopo-iter1-short
Text Generation
•
Updated
•
11
nlee-208/uf-mistral-it-sft-iopo-iter1
Text Generation
•
Updated
•
14
nlee-208/uf-mistral-it-sft-iopo-iter1-short
Text Generation
•
Updated
•
14
datasets
15
nlee-208/Qwen2-7B-Instruct-Self-seed178
Viewer
•
Updated
•
60.9k
•
33
nlee-208/Qwen2-7B-Instruct-Self-teacher-w-armo
Viewer
•
Updated
•
60.9k
•
31
nlee-208/Qwen2-7B-Instruct-Self-w-armo
Viewer
•
Updated
•
60.9k
•
31
nlee-208/gemma-2-9b-it-ps-Self-sam3
Viewer
•
Updated
•
8.22k
•
34
nlee-208/prism-sft-us
Viewer
•
Updated
•
5.87k
•
40
nlee-208/prism-sft-ge
Viewer
•
Updated
•
310
•
38
nlee-208/prism-sft-jp
Viewer
•
Updated
•
209
•
39
nlee-208/gqa
Viewer
•
Updated
•
3.13k
•
43
nlee-208/uf_cleaned_kto_61k-2
Viewer
•
Updated
•
60.9k
•
37
nlee-208/uf_cleaned_kto_61k-1
Viewer
•
Updated
•
60.9k
•
35