Feel free to use my datasets for further refinement!
#9
by
rombodawg
- opened
Ive created a few datasets to refine models on coding and non-coding tasks and you are free to use them. Each dataset explains what they are in more depth in the readme's/datacard
For coding:
https://huggingface.co/datasets/rombodawg/2XUNCENSORED_MegaCodeTraining188k
For non-coding:
https://huggingface.co/datasets/rombodawg/2XUNCENSORED_alpaca_840k_Evol_USER_ASSIS
Experimental lossless coding version:
https://huggingface.co/datasets/rombodawg/LosslessMegaCodeTrainingV2_1m_Evol_Uncensored