--- base_model: [] library_name: transformers tags: - mergekit - merge --- # V4.0 is the best one, use that one. # Nemomix-v3.0-12B This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [task arithmetic](https://arxiv.org/abs/2212.04089) merge method using F:\mergekit\mistralaiMistral-Nemo-Base-2407 as a base. ### Models Merged The following models were included in the merge: * F:\mergekit\NeverSleepHistorical_lumi-nemo-e2.0 * F:\mergekit\mistralaiMistral-Nemo-Instruct-2407 * F:\mergekit\intervitens_mini-magnum-12b-v1.1 ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: F:\mergekit\NeverSleepHistorical_lumi-nemo-e2.0 parameters: weight: 0.2 - model: F:\mergekit\intervitens_mini-magnum-12b-v1.1 parameters: weight: 0.4 - model: F:\mergekit\mistralaiMistral-Nemo-Instruct-2407 parameters: weight: 0.4 merge_method: task_arithmetic base_model: F:\mergekit\mistralaiMistral-Nemo-Base-2407 parameters: normalize: true dtype: bfloat16 ```