wasertech commited on
Commit
40ed541
1 Parent(s): 131a73d

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +50 -0
README.md ADDED
@@ -0,0 +1,50 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - wasertech/OneOS
5
+ language:
6
+ - en
7
+ - fr
8
+ ---
9
+ # Assistant Dolphin 2.2.1 Mistral 7B (1 epoch) AWQ
10
+
11
+ This model is a quantized version of [Assistant Dolphin 2.2.1 Mistral 7B (1 epoch)](https://huggingface.co/wasertech/assistant-dolphin-2.2.1-mistral-7b-e1-qlora) using AWQ.
12
+
13
+ ## Model description
14
+
15
+ Assistant Dolphin 2.2.1 Mistral 7B is a fine-tuned version of the [cognitivecomputations/dolphin-2.2.1-mistral-7b](https://huggingface.co/cognitivecomputations/dolphin-2.2.1-mistral-7b) model on the OneOS dataset for an epoch.
16
+
17
+ ## Intended uses & limitations
18
+
19
+ This model is intended to be used in natural language processing systems to improve text understanding and generation. Specific limitations will depend on the training and evaluation data.
20
+
21
+ ## Training and evaluation data
22
+
23
+ The model was trained on the OneOS dataset.
24
+
25
+ ## Training procedure
26
+
27
+ ### Training hyperparameters
28
+
29
+ The following hyperparameters were used during training:
30
+ - learning_rate: 1.41e-05
31
+ - train_batch_size: 1
32
+ - eval_batch_size: 8
33
+ - seed: 42
34
+ - gradient_accumulation_steps: 2
35
+ - total_train_batch_size: 2
36
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
37
+ - lr_scheduler_type: linear
38
+ - num_epochs: 1
39
+
40
+ ### Training results
41
+
42
+
43
+ ### Framework versions
44
+
45
+ - AutoAWQ 0.1.8
46
+ - PEFT 0.7.2.dev0
47
+ - Transformers 4.37.0.dev0
48
+ - Pytorch 2.1.2+cu121
49
+ - Datasets 2.16.2.dev0
50
+ - Tokenizers 0.15.0