iamtarun/python_code_instructions_18k_alpaca Viewer • Updated Jul 27, 2023 • 18.6k • 1.95k • 228
Malikeh1375/medical-question-answering-datasets Viewer • Updated Nov 2, 2023 • 1.26M • 672 • 27
llm-wizard/dolly-15k-instruction-alpaca-format Viewer • Updated Apr 13, 2023 • 15k • 172 • 29
Telugu-LLM-Labs/marathi_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 35 • 1
Telugu-LLM-Labs/nepali_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 48 • 5
generative-technologies/synth-ehr-icd10-alpaca-format Viewer • Updated Jun 24 • 379k • 137 • 1
Vanessasml/cybersecurity_32k_instruction_input_output Viewer • Updated Apr 19 • 32.6k • 95 • 11
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0__OCR-C25-L25-E25-R05 Viewer • Updated Nov 29, 2023 • 10.1M • 82
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 23 • 568k • 111
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_2 Viewer • Updated Mar 22 • 568k • 116
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_3 Viewer • Updated Mar 25 • 40k • 38
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_3_16 Viewer • Updated Mar 26 • 20k • 39
akbargherbal/six_millions_instruction_dataset_for_arabic_llm_ft Viewer • Updated May 20 • 6.37M • 56 • 1
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_xlarge__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 1M • 38
Mitsuki-Sakamoto/alpaca_farm-alpaca_instructions_gen_eval_sft Viewer • Updated Mar 7 • 1.2k • 67
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 22 • 568k • 80
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.3_seed_1 Viewer • Updated Mar 25 • 189k • 55
y1xing/natural_language_prompt_dataset_evaluation_instruct_dataset Viewer • Updated Jul 14 • 276 • 34
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_2_16 Viewer • Updated Mar 26 • 20k • 39
phyloforfun/HLT_MICH_Angiospermae_SLTPvC_v1-0_medium_OCR-C25-L25-E50-R05 Viewer • Updated Mar 15 • 10k • 36 • 1
somosnlp-hackathon-2023/ask2democracy-cfqa-salud-pension Viewer • Updated Apr 11, 2023 • 3.81k • 57 • 3
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 25 • 189k • 49
Mitsuki-Sakamoto/alfa-deberta-re-pref-64-fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 26 • 94.6k • 44
y1xing/orpo_llama3_concatenated_data_with_chris_examples_orpo_instruct_dataset Viewer • Updated Jul 6 • 2.64k • 39
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.1_seed_2 Viewer • Updated Mar 23 • 568k • 108
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.1_seed_1 Viewer • Updated Mar 25 • 189k • 62
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 24 • 568k • 177
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 25 • 189k • 47
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_1.0_seed_3 Viewer • Updated Mar 25 • 189k • 67
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2 Viewer • Updated Mar 7 • 60k • 43
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2_random Viewer • Updated Mar 10 • 60k • 53
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-8_random Viewer • Updated Mar 10 • 60k • 41
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 21 • 568k • 200
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_1.0_seed_2 Viewer • Updated Mar 22 • 568k • 198
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 21 • 568k • 201
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.1_seed_3 Viewer • Updated Mar 22 • 568k • 182
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3 Viewer • Updated Mar 23 • 568k • 78
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 24 • 189k • 45
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 24 • 511k • 138
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 24 • 189k • 56
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.1_seed_2 Viewer • Updated Mar 25 • 189k • 54
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.1_seed_3 Viewer • Updated Mar 25 • 189k • 39
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_14m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 90
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.5 Viewer • Updated Mar 27 • 568k • 107
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.3_seed_2 Viewer • Updated Mar 21 • 568k • 95
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 25 • 189k • 50
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 25 • 189k • 71
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 25 • 189k • 46
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.3_seed_3 Viewer • Updated Mar 25 • 189k • 42
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 80
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 72
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 114
Mitsuki-Sakamoto/alfa-deberta-re-pref-64-fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 26 • 94.6k • 48
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.5 Viewer • Updated Mar 26 • 568k • 62
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.25 Viewer • Updated Mar 26 • 568k • 82
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.7 Viewer • Updated Mar 27 • 568k • 87
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_4 Viewer • Updated Apr 26 • 303k • 59
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_4 Viewer • Updated Apr 26 • 303k • 82
vinhtran2611/ArtifactAI_arxiv-physics-instruct-tune-30k_formated Viewer • Updated Jun 7 • 30.2k • 37
vinhtran2611/arxiv-physics-instruct-tune-30k_filtered_formated Viewer • Updated Jun 17 • 324 • 38
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.2_self_70m Viewer • Updated Mar 14 • 37.9k • 41
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_1.0_seed_1 Viewer • Updated Mar 21 • 568k • 99
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 22 • 568k • 149
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 25 • 568k • 120
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.3_seed_2 Viewer • Updated Mar 25 • 189k • 51
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 73
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.25 Viewer • Updated Mar 26 • 568k • 74
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.9 Viewer • Updated Mar 26 • 568k • 76
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.9 Viewer • Updated Mar 26 • 568k • 83
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 27 • 568k • 179
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.3 Viewer • Updated Mar 27 • 568k • 162
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.5 Viewer • Updated Mar 27 • 568k • 77
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_1_t_1.0_eval Viewer • Updated Mar 30 • 568k • 118
Telugu-LLM-Labs/sindhi_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 39 • 2
Telugu-LLM-Labs/assamese_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 52 • 1
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_tiny__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 100 • 34
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.2_self_160m Viewer • Updated Mar 14 • 37.9k • 38
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-12_filter_gold_thr_0.1_self_160m Viewer • Updated Mar 21 • 37.9k • 43
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.3_seed_1 Viewer • Updated Mar 21 • 568k • 161
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.3_seed_3 Viewer • Updated Mar 21 • 568k • 127
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 21 • 568k • 125
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_1 Viewer • Updated Mar 23 • 568k • 201
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 22 • 568k • 131
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 23 • 568k • 144
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_3 Viewer • Updated Mar 23 • 568k • 79
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 23 • 568k • 104
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.1_seed_2 Viewer • Updated Mar 24 • 189k • 60
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 24 • 189k • 59
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 24 • 568k • 159
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.3_seed_2 Viewer • Updated Mar 25 • 189k • 60
Mitsuki-Sakamoto/alfa-deberta-re-pref-64-fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 26 • 94.6k • 38
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 27 • 568k • 73
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.9 Viewer • Updated Mar 27 • 568k • 218
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.9 Viewer • Updated Mar 27 • 568k • 113
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_3_t_1.0_eval Viewer • Updated Mar 30 • 568k • 77
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3 Viewer • Updated Apr 26 • 303k • 100
gogo8232/experiment_perplexity_instruction_llama3_8b_response Viewer • Updated Jul 5 • 34.9k • 34
oliverwang15/fingpt_chatglm2_sentiment_instruction_lora_ft_dataset Viewer • Updated Jul 11, 2023 • 67.2k • 41 • 9
lucasmccabe-lmi/sql-create-context_alpaca_style Viewer • Updated May 15, 2023 • 78.6k • 50 • 5
japneets/Alpaca_instruction_fine_tune_Punjabi_small Viewer • Updated Apr 16, 2023 • 10k • 47 • 1
filopedraz/swedish-sentiment-instruction-fine-tuning Viewer • Updated Jun 13, 2023 • 164k • 38 • 1
anton96vice/samantha-1.1-uncensored-split-and-prepared Viewer • Updated Mar 7 • 2.04k • 46 • 1
Telugu-LLM-Labs/konkani_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 42 • 1
Hadnet/olavo-article-17k-llama2-chat-dataset-text Viewer • Updated Sep 25, 2023 • 17.4k • 39 • 1
UMCU/WikiDocPatientInformation_Dutch_translated_with_MariaNMT Viewer • Updated Jan 22 • 5.76k • 52
Cesar7980/fingpt_chatglm2_sentiment_instruction_lora_ft_dataset Viewer • Updated Nov 8, 2023 • 76.8k • 45
rodrfons/fingpt_chatglm2_sentiment_instruction_lora_ft_dataset Viewer • Updated Nov 18, 2023 • 76.8k • 39
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1.0_OCR-C25-L25-E50-R10 Viewer • Updated Nov 29, 2023 • 230 • 34
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 10.1M • 130
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_tiny__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 87 • 35
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_large__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 100k • 37
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_medium__OCR-C25-L25-E50-R05 Viewer • Updated Nov 30, 2023 • 10k • 37
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_large__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 100k • 34
mfmezger/sandboxai_german_to_english_translations_seperated Viewer • Updated Feb 15 • 1.35M • 46
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.5_self_160m Viewer • Updated Mar 14 • 37.9k • 52
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-12_filter_gold_thr_0.3_self_160m Viewer • Updated Mar 21 • 37.9k • 36
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-12_filter_gold_thr_1.0_self_160m Viewer • Updated Mar 21 • 18.9k • 34
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.1_seed_1 Viewer • Updated Mar 21 • 568k • 108
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 21 • 568k • 138
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 23 • 568k • 158
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 21 • 568k • 140
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 21 • 568k • 79
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 24 • 568k • 170
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1 Viewer • Updated Mar 22 • 568k • 99
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_1 Viewer • Updated Mar 22 • 568k • 72
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_2 Viewer • Updated Mar 22 • 568k • 90
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 22 • 568k • 151
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_3 Viewer • Updated Mar 23 • 568k • 124
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_3 Viewer • Updated Mar 23 • 568k • 96
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_1.0_seed_2 Viewer • Updated Mar 24 • 189k • 47
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.1_seed_3 Viewer • Updated Mar 24 • 189k • 75
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_1.0_seed_3 Viewer • Updated Mar 24 • 189k • 38
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.3_seed_3 Viewer • Updated Mar 24 • 189k • 53
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.3_seed_1 Viewer • Updated Mar 25 • 189k • 52
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 25 • 189k • 46
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_1 Viewer • Updated Mar 25 • 40k • 38
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_14m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 85
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 80
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 81
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 101
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 26 • 568k • 140
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.75 Viewer • Updated Mar 26 • 568k • 130
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_1.0_eval Viewer • Updated Mar 28 • 568k • 174
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_1_t_1.0_eval Viewer • Updated Mar 29 • 568k • 74
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_2_t_1.0_eval Viewer • Updated Mar 29 • 568k • 216
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_2_t_1.0_eval Viewer • Updated Mar 30 • 568k • 240
thusinh1969/llama-2-7b-LongContext-mixed-64k-30APRIL2024 Viewer • Updated May 1 • 81.8k • 48 • 1
HachiML/oasst1_for_self-rewarding_EFT_Mixtral-8x22B-Instruct Viewer • Updated May 29 • 5.24k • 40
murugeshmarvel/a5d87d8c1326b4f0c531065dbe7f5068a2bab8a56edc9a9d4aab95be427bb171 Viewer • Updated Jun 5 • 95k • 33
generative-technologies/synth-ehr-icd10-llama3-format Viewer • Updated Jun 23 • 379k • 89 • 1
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_small__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 1.01k • 36
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_medium__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 10k • 37
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_full__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 1.42M • 45
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.1_self_70m Viewer • Updated Mar 14 • 37.9k • 44
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.1_self_160m Viewer • Updated Mar 14 • 37.9k • 39
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_1.0_seed_3 Viewer • Updated Mar 21 • 568k • 165
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_1 Viewer • Updated Mar 22 • 568k • 103
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_1 Viewer • Updated Mar 22 • 568k • 136
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_2 Viewer • Updated Mar 23 • 568k • 85
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_3 Viewer • Updated Mar 23 • 568k • 135
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 23 • 568k • 84
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 24 • 568k • 173
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_1.0_seed_1 Viewer • Updated Mar 24 • 189k • 46
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.1_seed_1 Viewer • Updated Mar 25 • 189k • 57
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_1.0_seed_1 Viewer • Updated Mar 25 • 189k • 39
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_1.0_seed_2 Viewer • Updated Mar 25 • 189k • 71
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_2 Viewer • Updated Mar 25 • 40k • 37
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_14m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 91
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_1.0 Viewer • Updated Apr 19 • 568k • 134
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 119
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 76
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 141
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.9 Viewer • Updated Mar 26 • 568k • 61
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.5 Viewer • Updated Mar 26 • 568k • 169
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.5 Viewer • Updated Mar 26 • 568k • 171
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 27 • 568k • 51
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.1 Viewer • Updated Mar 27 • 568k • 138
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.3 Viewer • Updated Mar 27 • 568k • 154
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.1 Viewer • Updated Mar 27 • 568k • 136
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.7 Viewer • Updated Mar 27 • 568k • 211
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.9 Viewer • Updated Mar 27 • 568k • 89
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_1.0_eval Viewer • Updated Mar 28 • 568k • 152
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_1_t_1.0_eval Viewer • Updated Mar 30 • 568k • 126
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_2_t_1.0_eval Viewer • Updated Mar 30 • 568k • 152
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_3_t_1.0_eval Viewer • Updated Mar 30 • 568k • 160
Mitsuki-Sakamoto/alpaca_farm-RM-Mistral-7B-re-preference-256-nsample-2 Viewer • Updated Apr 15 • 20k • 38
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1 Viewer • Updated Apr 26 • 303k • 109
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_5 Viewer • Updated Apr 26 • 303k • 92
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_medium__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 10k • 44
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_small__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 1k • 41
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.5_self_70m Viewer • Updated Mar 14 • 37.9k • 37
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 21 • 568k • 99
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 21 • 568k • 136
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 22 • 568k • 94
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2 Viewer • Updated Mar 22 • 568k • 73
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_2 Viewer • Updated Mar 22 • 568k • 176
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 24 • 568k • 107
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 25 • 189k • 64
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 25 • 189k • 43
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_1.0 Viewer • Updated Apr 19 • 568k • 182
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 137
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 81
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 102
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.75 Viewer • Updated Mar 26 • 568k • 94
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.3 Viewer • Updated Mar 27 • 568k • 140
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.5 Viewer • Updated Mar 27 • 568k • 145
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_1_t_1.0_eval Viewer • Updated Mar 30 • 568k • 133
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_2_t_1.0_eval Viewer • Updated Mar 30 • 568k • 98
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_3_t_1.0_eval Viewer • Updated Mar 30 • 568k • 296
y1xing/natural_language_prompt_synthetic_dataset_evaluation_instruct_dataset Viewer • Updated Jul 14 • 435 • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 86
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_1_16 Viewer • Updated Mar 26 • 20k • 42
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.75 Viewer • Updated Mar 26 • 568k • 166
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.25 Viewer • Updated Mar 26 • 568k • 103
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.1 Viewer • Updated Mar 27 • 568k • 97
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.7 Viewer • Updated Mar 27 • 568k • 95
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_1.0_eval Viewer • Updated Mar 28 • 568k • 109
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2 Viewer • Updated Apr 26 • 303k • 56
y1xing/llama3_concatenated_data_with_chris_examples_orpo_instruct_dataset Viewer • Updated Jul 6 • 2.64k • 33
y1xing/llama_chris_examples_generated_synthetic_data_instruct_dataset Viewer • Updated Jul 13 • 1.85k • 34
y1xing/partially_correct_llama_all_synthetic_data_instruct_dataset Viewer • Updated Jul 14 • 1.53k • 34
y1xing/llama_all_synthetic_dataset_evaluation_instruct_dataset Viewer • Updated Jul 14 • 435 • 35
Mitsuki-Sakamoto/alpaca_farm-alpaca_gpt4_preference-re-preference_eval Viewer • Updated Jan 15 • 197k • 33
Mitsuki-Sakamoto/alpaca_farm-alpaca_instructions-re-preference Viewer • Updated Jan 17 • 22k • 240
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-eval-preference Viewer • Updated Feb 5 • 2k • 35
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-are-preference-256 Viewer • Updated Mar 1 • 22k • 37
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-test Viewer • Updated Apr 19 • 40 • 34
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-256-nsample-4 Updated Mar 6 • 34
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-256-nsample-8 Viewer • Updated Mar 6 • 20k • 33
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-256-nsample-16 Viewer • Updated Mar 7 • 20k • 33
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-16_random Viewer • Updated Mar 10 • 60k • 222
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.2_self_70m Viewer • Updated Mar 15 • 37.9k • 229
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.1_self_70m Viewer • Updated Mar 18 • 189k • 34
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.5_self_70m Viewer • Updated Mar 18 • 189k • 34
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.1_self_160m Updated Mar 21 • 35
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.5_self_160m Updated Mar 18 • 34
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.2_self_160m Viewer • Updated Mar 15 • 37.9k • 34
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.0_self_70m Viewer • Updated Mar 18 • 189k • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.0_self_160m Viewer • Updated Mar 18 • 189k • 34
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.5_self_70m Viewer • Updated Mar 19 • 189k • 207
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.1_self_70m Viewer • Updated Mar 19 • 189k • 34
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.0_self_70m Viewer • Updated Mar 19 • 189k • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.1_self_160m Updated Mar 19 • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.5_self_160m Updated Mar 19 • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.0_self_160m Updated Mar 19 • 34
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.3_self_160m Updated Mar 21 • 37
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_1.0_self_160m Updated Mar 21 • 34
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_1.0 Viewer • Updated Apr 19 • 568k • 35
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_3_t_1.0_eval Viewer • Updated Mar 29 • 568k • 33
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_4 Viewer • Updated Apr 25 • 40k • 33
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_5 Viewer • Updated Apr 25 • 40k • 34
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-test-alpaca-gen Viewer • Updated May 12 • 20 • 35
Karmukilan/Malikeh1375_medical-question-answering-datasets Viewer • Updated Jul 16 • 1k • 40 • 2
y1xing/natural_language_prompt_w_correct_ans_dataset_evaluation_instruct_dataset Viewer • Updated Jul 26 • 276 • 37
y1xing/natural_language_prompt_w_correct_ans_synthetic_dataset_evaluation_instruct_dataset Viewer • Updated Jul 26 • 435 • 37
y1xing/natural_language_prompt_w_correct_ans_dataset_json_evaluation_instruct_dataset Viewer • Updated Jul 29 • 276 • 42
y1xing/natural_language_prompt_w_correct_ans_synthetic_dataset_evaluation_json_instruct_dataset Viewer • Updated Jul 29 • 435 • 36
y1xing/natural_language_prompt_w_correct_ans_dataset_training_instruct_dataset Viewer • Updated Jul 30 • 2.99k • 36
UMCU/MedicalFlashCards_Dutch_translated_with_MariaNMT Viewer • Updated Oct 31, 2023 • 32.9k • 39
Mitsuki-Sakamoto/sft_alpaca_pythia-1.4b-use_response_template-deberta-v3 Viewer • Updated Aug 1 • 20k • 34
Mitsuki-Sakamoto/sft_alpaca_pythia-160m-use_response_template-deberta-v3 Viewer • Updated Aug 1 • 20k • 34
purulalwani/Synthetic-Financial-Datasets-For-Fraud-Detection-Cleaned Viewer • Updated Aug 8 • 6.36M • 38
purulalwani/Synthetic-Financial-Datasets-For-Fraud-Detection-Cleaned-Split Viewer • Updated Aug 8 • 6.36M • 40
louisbrulenaudet/code-pensions-civiles-militaires-retraite Viewer • Updated 1 day ago • 256 • 101
louisbrulenaudet/code-domaine-public-fluvial-navigation-interieure Viewer • Updated 1 day ago • 2 • 76
louisbrulenaudet/code-legion-honneur-medaille-militaire-ordre-national-merite Viewer • Updated 1 day ago • 224 • 66
louisbrulenaudet/code-postes-communications-electroniques Viewer • Updated 1 day ago • 728 • 153
arcee-globe/Evaluated_CohereForAI-aya_collection-aya_dataset Viewer • Updated Aug 20 • 14k • 38
Epic3123/election_misinformation_sleeper_agents_dataset_llama27b Viewer • Updated Aug 29 • 733 • 38
FoxySapiens/teknofest-egitim-hukuk-tarim-surdurulebilirlik-dataset Viewer • Updated Sep 7 • 233k • 33
DLI-Lab/Mind2Web-cleaned-lite-reward-model-w-cot-formatted Viewer • Updated Sep 15 • 6.13k • 31
DLI-Lab/Mind2Web-cleaned-lite-value-model-w-cot-formatted-test Viewer • Updated Sep 18 • 6.13k • 31
DLI-Lab/Mind2Web-cleaned-lite-reward-model-w-cot-formatted-v2 Viewer • Updated Sep 19 • 6.13k • 31
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_instruct Viewer • Updated Sep 27 • 100k • 49 • 1
tayyibsupercool/resource_allocation_telecom_energy_efficiency_instruct Viewer • Updated Sep 27 • 100k • 113 • 1
DLI-Lab/Mind2Web-cleaned-lite-acctree-value-model-w-cot-formatted Viewer • Updated Sep 26 • 6.13k • 31
JiaweiGuo123/Alpaca-gpt4-English-with-gsm8k-semantic-similarity Viewer • Updated Oct 2 • 52k • 34
aamina/channel_gains_vs_tx_powers_ee_augmented_with_context_10k Viewer • Updated Oct 4 • 10k • 33
Self-GRIT/open-hermes-2.5-sft-llama3-inference-query-reformulation-tokens Viewer • Updated Oct 4 • 33.3k • 34
aamina/channel_gains_vs_tx_powers_ee_augmented_with_30_examples_context_10k Viewer • Updated Oct 5 • 10k • 46
JiaweiGuo123/Alpaca-gpt4-English-with-humaneval-structure-similarity Viewer • Updated Oct 9 • 52k • 42
zyusc/Alpaca-gpt4-English-with-humaneval-structure-similarity Viewer • Updated Oct 10 • 52k • 45
tayyibsupercool/resource_allocation_telecom_energy_efficiency_3_users_instruct Viewer • Updated 28 days ago • 1.25k • 46
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_3_users_instruct Viewer • Updated 28 days ago • 1.25k • 45
tayyibsupercool/resource_allocation_telecom_energy_efficiency_2_users_rician_fading_instruct Viewer • Updated Oct 10 • 1k • 33
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_2_users_rician_fading_instruct Viewer • Updated Oct 10 • 1k • 31
JiaweiGuo123/Alpaca-gpt4-English-with-humaneval-structure-similarity-optimize Viewer • Updated about 1 month ago • 802 • 40
JiaweiGuo123/Alpaca-gpt4-English-with-humaneval-code-sementic-similarity Viewer • Updated about 1 month ago • 802 • 46
JiaweiGuo123/Alpaca-gpt4-English-with-humaneval-structure-similarity-without-comment Viewer • Updated about 1 month ago • 802 • 46
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_5_instruct Viewer • Updated about 1 month ago • 1.25k • 36
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_5_instruct Viewer • Updated about 1 month ago • 1.25k • 36
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_4_instruct Viewer • Updated 29 days ago • 1.25k • 44
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_4_instruct Viewer • Updated 29 days ago • 1.25k • 44
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_8_instruct Viewer • Updated 29 days ago • 1.25k • 51
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_8_instruct Viewer • Updated 29 days ago • 1.25k • 54
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_12_instruct Viewer • Updated 29 days ago • 1.25k • 50
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_12_instruct Viewer • Updated 29 days ago • 1.25k • 48
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_2_instruct Viewer • Updated 29 days ago • 1.25k • 47
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_2_instruct Viewer • Updated 29 days ago • 1.25k • 36
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_6_instruct Viewer • Updated 29 days ago • 1.25k • 40
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_6_instruct Viewer • Updated 29 days ago • 1.25k • 34
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_10_instruct Viewer • Updated 29 days ago • 1.25k • 39
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_10_instruct Viewer • Updated 29 days ago • 1.25k • 40
aamina/channel_gains_vs_tx_powers_ee_augmented_with_300_examples_context Viewer • Updated 28 days ago • 10k • 82
tayyibsupercool/resource_allocation_telecom_energy_efficiency_area_500_instruct Viewer • Updated 28 days ago • 12.5k • 35
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_area_500_instruct Viewer • Updated 28 days ago • 12.5k • 35
tayyibsupercool/resource_allocation_telecom_energy_efficiency_30_area_instruct Viewer • Updated 28 days ago • 12.5k • 35
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_30_area_instruct Viewer • Updated 28 days ago • 12.5k • 39
aamina/channel_gains_vs_tx_powers_ee_augmented_with_100_examples_context Viewer • Updated 28 days ago • 10k • 47
tayyibsupercool/resource_allocation_telecom_energy_efficiency_area_150_instruct Viewer • Updated 28 days ago • 12.5k • 36
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_area_150_instruct Viewer • Updated 28 days ago • 12.5k • 35
tayyibsupercool/resource_allocation_telecom_energy_efficiency_area_250_instruct Viewer • Updated 28 days ago • 12.5k • 37
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_area_250_instruct Viewer • Updated 28 days ago • 12.5k • 35
tayyibsupercool/resource_allocation_telecom_energy_efficiency_area_350_instruct Viewer • Updated 28 days ago • 12.5k • 37
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_area_350_instruct Viewer • Updated 28 days ago • 12.5k • 35
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_2_instruct_10k Viewer • Updated 17 days ago • 12.5k • 42
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_2_instruct_10k Viewer • Updated 17 days ago • 12.5k • 41
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_4_instruct_10k Viewer • Updated 17 days ago • 12.5k • 36
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_4_instruct_10k Viewer • Updated 17 days ago • 12.5k • 39
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_6_instruct_10k Viewer • Updated 17 days ago • 12.5k • 37
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_6_instruct_10k Viewer • Updated 17 days ago • 12.5k • 36
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_8_instruct_10k Viewer • Updated 17 days ago • 12.5k • 39
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_8_instruct_10k Viewer • Updated 17 days ago • 12.5k • 37
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_10_instruct_10k Viewer • Updated 17 days ago • 12.5k • 38
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_10_instruct_10k Viewer • Updated 17 days ago • 12.5k • 36
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_12_instruct_10k Viewer • Updated 17 days ago • 12.5k • 41
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_12_instruct_10k Viewer • Updated 17 days ago • 12.5k • 38
aamina/channel_gains_vs_tx_powers_se_augmented_with_300_examples_context Viewer • Updated 24 days ago • 10k • 41
aamina/channel_gains_vs_tx_powers_se_augmented_with_30_examples_context_10k Viewer • Updated 22 days ago • 10k • 44
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_2_instruct_1k Viewer • Updated 19 days ago • 1.25k • 25
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_2_instruct_1k Viewer • Updated 19 days ago • 1.25k • 27
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_4_instruct_1k Viewer • Updated 19 days ago • 1.25k • 27
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_4_instruct_1k Viewer • Updated 19 days ago • 1.25k • 24
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_6_instruct_1k Viewer • Updated 19 days ago • 1.25k • 26
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_6_instruct_1k Viewer • Updated 19 days ago • 1.25k • 25
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_8_instruct_1k Viewer • Updated 19 days ago • 1.25k • 26
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_8_instruct_1k Viewer • Updated 19 days ago • 1.25k • 25
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_10_instruct_1k Viewer • Updated 19 days ago • 1.25k • 26
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_10_instruct_1k Viewer • Updated 19 days ago • 1.25k • 24
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_12_instruct_1k Viewer • Updated 19 days ago • 1.25k • 25
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_12_instruct_1k Viewer • Updated 19 days ago • 1.25k • 26
MakiAi/OKU_wiki_llama3.1_8b_inst_Reflexive_chunk200_overlap700 Viewer • Updated 10 days ago • 703 • 10
antash420/long-context-text-summarization-alpaca-format Viewer • Updated 7 days ago • 216k • 25
Gramacho/complete_pira_train_val_corpus1_ptbr_llama3_alpaca_1484 Viewer • Updated 4 days ago • 1.48k • 38
namejun12000/AW_finetuning_5core_split1_all_final_valid Viewer • Updated 2 days ago • 22.4k • 48
Gramacho/complete_pira_test_corpus1_ptbr_llama3_alpaca_181 Viewer • Updated 4 days ago • 181 • 13
namejun12000/AW_finetuning_5core_try1_all_final_valid_include Viewer • Updated 2 days ago • 22.4k • 23
namejun12000/AW_finetuning_5core_split1_all_final_valid_include Viewer • Updated 2 days ago • 22.4k • 38
namejun12000/AW_finetuning_5core_split1_all_final_final Viewer • Updated 3 days ago • 22.4k • 13
namejun12000/AW_finetuning_5core_split1_all_final_valid_include_50 Viewer • Updated 2 days ago • 22.4k • 16
namejun12000/AW_finetuning_5core_split1_all_final_valid_include_10 Viewer • Updated 2 days ago • 22.4k • 18