tsunemoto commited on
Commit
b2f1ff7
1 Parent(s): a808263

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,17 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ wsb-gpt-7b.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ wsb-gpt-7b.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
38
+ wsb-gpt-7b.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ wsb-gpt-7b.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ wsb-gpt-7b.Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
41
+ wsb-gpt-7b.Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
42
+ wsb-gpt-7b.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
43
+ wsb-gpt-7b.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
44
+ wsb-gpt-7b.Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
45
+ wsb-gpt-7b.Q5_1.gguf filter=lfs diff=lfs merge=lfs -text
46
+ wsb-gpt-7b.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
47
+ wsb-gpt-7b.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
48
+ wsb-gpt-7b.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
49
+ wsb-gpt-7b.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,110 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ title: "WSB-GPT-7B Quantized in GGUF"
3
+ tags:
4
+ - GGUF
5
+ language: en
6
+ ---
7
+ ![Image description](https://i.postimg.cc/MGwhtFfF/tsune-fixed.png)
8
+
9
+ # Tsunemoto GGUF's of WSB-GPT-7B
10
+
11
+ This is a GGUF quantization of WSB-GPT-7B.
12
+
13
+ ## Original Repo Link:
14
+ [Original Repository](https://huggingface.co/Sentdex/WSB-GPT-7B)
15
+
16
+ ## Original Model Card:
17
+ ---
18
+
19
+ # Model Card for WSB-GPT-7B
20
+
21
+ This is a Llama 2 7B Chat model fine-tuned with QLoRA on 2017-2018ish /r/wallstreetbets subreddit comments and responses, with the hopes of learning more about QLoRA and creating models with a little more character.
22
+
23
+
24
+ ### Model Description
25
+
26
+ - **Developed by:** Sentdex
27
+ - **Shared by:** Sentdex
28
+ - **GPU Compute provided by:** [Lambda Labs](https://lambdalabs.com/service/gpu-cloud)
29
+
30
+ - **Model type:** Instruct/Chat
31
+ - **Language(s) (NLP):** Multilingual from Llama 2, but not sure what the fine-tune did to it, or if the fine-tuned behavior translates well to other languages. Let me know!
32
+ - **License:** Apache 2.0
33
+ - **Finetuned from Llama 2 7B Chat**
34
+
35
+
36
+ - **Demo [optional]:** [More Information Needed]
37
+
38
+ ## Uses
39
+
40
+ This model's primary purpose is to be a fun chatbot and to learn more about QLoRA. It is not intended to be used for any other purpose and some people may find it abrasive/offensive.
41
+
42
+ ## Bias, Risks, and Limitations
43
+
44
+ This model is prone to using at least 3 words that were popularly used in the WSB subreddit in that era that are much more frowned-upon. As time goes on, I may wind up pruning or find-replacing these words in the training data, or leaving it.
45
+
46
+ Just be advised this model can be offensive and is not intended for all audiences!
47
+
48
+ ## How to Get Started with the Model
49
+ ### Prompt Format:
50
+
51
+ ```
52
+ ### Comment:
53
+ [parent comment text]
54
+
55
+ ### REPLY:
56
+ [bot's reply]
57
+
58
+ ### END.
59
+ ```
60
+
61
+ Use the code below to get started with the model.
62
+
63
+ ```py
64
+ from transformers import pipeline
65
+
66
+ # Initialize the pipeline for text generation using the Sentdex/WSB-GPT-7B model
67
+ pipe = pipeline("text-generation", model="Sentdex/WSB-GPT-7B")
68
+
69
+ # Define your prompt
70
+ prompt = """### Comment:
71
+ How does the stock market actually work?
72
+
73
+ ### REPLY:
74
+ """
75
+
76
+ # Generate text based on the prompt
77
+ generated_text = pipe(prompt, max_length=128, num_return_sequences=1)
78
+
79
+ # Extract and print the generated text
80
+ print(generated_text[0]['generated_text'].split("### END.")[0])
81
+ ```
82
+
83
+ Example continued generation from above:
84
+
85
+ ```
86
+ ### Comment:
87
+ How does the stock market actually work?
88
+
89
+ ### REPLY:
90
+ You sell when you are up and buy when you are down.
91
+ ```
92
+
93
+ Despite `</s>` being the typical Llama stop token, I was never able to get this token to be generated in training/testing so the model would just never stop generating. I wound up testing with ### END. and that worked, but obviously isn't ideal. Will fix this in the future maybe(tm).
94
+
95
+ #### Hardware
96
+
97
+ This QLoRA was trained on a Lambda Labs 1x H100 80GB GPU instance.
98
+
99
+ ## Citation
100
+
101
+ - Llama 2 (Meta AI) for the base model.
102
+ - Farouk E / Far El: https://twitter.com/far__el for helping with all my silly questions about QLoRA
103
+ - Lambda Labs for the compute. The model itself only took a few hours to train, but it took me days to learn how to tie everything together.
104
+ - Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, Luke Zettlemoyer for QLoRA + implementation on github: https://github.com/artidoro/qlora/
105
+ - @eugene-yh and @jinyongyoo on Github + @ChrisHayduk for the QLoRA merge: https://gist.github.com/ChrisHayduk/1a53463331f52dca205e55982baf9930
106
+
107
+
108
+ ## Model Card Contact
109
+
110
wsb-gpt-7b.Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:45e41295e1159b3af4eae11338b6ea2ac676acda6e27d3ea6243a97c45ea07cc
3
+ size 2825940704
wsb-gpt-7b.Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:69fc17336e492d26fb300f863d99fac495846297a98bce61ded8e3eb9e7d611b
3
+ size 3597111008
wsb-gpt-7b.Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:605b7d0eb01b279b8f344439b128b3a89fd381b1c56c2355a084d4ebb3c5e433
3
+ size 3298004704
wsb-gpt-7b.Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:69bea304e80d1bbdf6672b279f3de89d49237077ac45be5726b119f9a109bf9f
3
+ size 2948304608
wsb-gpt-7b.Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e24a7d3301b99c669ab840b5810957cc7f34fbd83d9ea6093133a05e1b27b309
3
+ size 3825807072
wsb-gpt-7b.Q4_1.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:13605d70fc7082011d77f20cd48080b3fa3b49f488fb124279aea172625aedfc
3
+ size 4238749408
wsb-gpt-7b.Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1dc7fe08c5185cccb281a7db9a8904d7a374d9ddef62c74d0ae6e972e7e6d6d8
3
+ size 4081004256
wsb-gpt-7b.Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:597ff866f68765f46e9b577ff7d73d8e7ad51f56b1e31af302f3813190e99f84
3
+ size 3856740064
wsb-gpt-7b.Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:10f64876f5b2eb5c0f0dca6d06f259d5c83893d055ef3bde2e8e3f6ffe8ffd63
3
+ size 4651691744
wsb-gpt-7b.Q5_1.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d7e414f1c0e77cdb2d3b96c24e2cf74500146b5b36a2d2ab8434fb5a625b2917
3
+ size 5064634080
wsb-gpt-7b.Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:21201750459fe88a6388828f010ce26f9854bdf9525b26764d43c788bd675779
3
+ size 4783156960
wsb-gpt-7b.Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7d9f011bf6d720e2972fdd72059e9b6aaebe74f6561320b778fa687cef330e5e
3
+ size 4651691744
wsb-gpt-7b.Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:218d133dfa2762241bedaace67965d2167bbcaf2ecdb0f3bb0b47931ae293429
3
+ size 5529194208
wsb-gpt-7b.Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0247dee5ad873dbe7a5fbdabf93de0e6cbe96f60840bed888903e5f5babf5f50
3
+ size 7161089760