pvduy commited on
Commit
7c8b487
1 Parent(s): 3da1353

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -3
README.md CHANGED
@@ -60,13 +60,15 @@ print(tokenizer.decode(tokens[0], skip_special_tokens=True))
60
  ### Training Dataset
61
 
62
  The dataset is comprised of a mixture of open datasets large-scale datasets available on the [HuggingFace Hub](https://huggingface.co/datasets):
 
63
  - HuggingFaceH4/ultrachat_200k
64
- - HuggingFaceH4/ultrafeedback_binarized
65
- - Intel/orca_dpo_pairs
66
  - meta-math/MetaMathQA
67
- - Instruct Code Dataset (Internal)
68
  - Wizard Dataset
69
  - Open-Orca/SlimOrca
 
 
 
 
70
 
71
  ### Training Procedure
72
 
 
60
  ### Training Dataset
61
 
62
  The dataset is comprised of a mixture of open datasets large-scale datasets available on the [HuggingFace Hub](https://huggingface.co/datasets):
63
+ 1. SFT Datasets
64
  - HuggingFaceH4/ultrachat_200k
 
 
65
  - meta-math/MetaMathQA
 
66
  - Wizard Dataset
67
  - Open-Orca/SlimOrca
68
+ 2. Preference Datasets:
69
+ - HuggingFaceH4/ultrafeedback_binarized
70
+ - Intel/orca_dpo_pairs
71
+
72
 
73
  ### Training Procedure
74