gramirez-prompsit commited on
Commit
3d78d3b
1 Parent(s): 6f754ce

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -1
README.md CHANGED
@@ -11,4 +11,31 @@ language:
11
  ---
12
  This is a pre-release checkpoint for a Nordic generative language model currently in training.
13
  This preliminary release is provided for HPLT (https://hplt-project.org/) deliverable 4.1 (“First language models trained”)(https://hplt-project.org/deliverables). Consult the HPLT website for further details.
14
- More documentation will be provided soon.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  ---
12
  This is a pre-release checkpoint for a Nordic generative language model currently in training.
13
  This preliminary release is provided for HPLT (https://hplt-project.org/) deliverable 4.1 (“First language models trained”)(https://hplt-project.org/deliverables). Consult the HPLT website for further details.
14
+ More documentation will be provided soon.
15
+
16
+ UPDATE: our Nordic model is now called Viking!
17
+ -------
18
+
19
+
20
+ # Viking 7B, 13B and 33B
21
+
22
+ _**NOTE:** These are **research checkpoint** of a model for which **training has not been completed.** It is being provided in its current state for research and testing purposes. **Care should be taken when using the outputs of the model.** Once pretraining has completed we intend to release additional instruction-tuned and chat-tuned varieties._
23
+
24
+ Viking 7B, 13B and 13B are a 7B, 13B and 33B parameter decoder-only transformers pretrained on Finnish,
25
+ English, Swedish, Danish, Norwegian, Icelandic and code. They are being trained
26
+ on 2 trillion tokens (1.3 trillion as of this release).
27
+
28
+ Viking is a fully open source model and is made available under the Apache 2.0 License.
29
+
30
+ Viking was created in a collaboration between the [TurkuNLP group](https://turkunlp.org/) of the University of Turku, [SiloGen](https://www.silo.ai/silogen) from [Silo AI](https://www.silo.ai/),and [High Performance Language Technologies](https://hplt-project.org/) (HPLT). Training was conducted on the [LUMI supercomputer](https://www.lumi-supercomputer.eu/), using compute resources generously provided by [CSC](https://csc.fi/) - IT Center for Science, Finland.
31
+
32
+ This project is part of an ongoing effort to create open source large language models for non-English and especially low resource languages like Finnish. The mode is fluent in Finnish, English, the Scandinavian languages and capable of basic translation between them. It is also able to understand and generate code.
33
+
34
+ More info available at:
35
+
36
+ [Viking 7B](https://huggingface.co/LumiOpen/Viking-7B)
37
+
38
+ [Viking 13B](https://huggingface.co/LumiOpen/Viking-13B)
39
+
40
+ [Viking 33B](https://huggingface.co/LumiOpen/Viking-33B)
41
+