nilabhra commited on
Commit
6c8c1ad
β€’
1 Parent(s): 200e10e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -1
README.md CHANGED
@@ -13,12 +13,26 @@ pinned: false
13
 
14
  # News
15
 
 
16
  * πŸ’₯ **TII has open-sourced Falcon-180B for research and commercial utilization!** Access the [180B](https://huggingface.co/tiiuae/falcon-180b), as well as [7B](https://huggingface.co/tiiuae/falcon-7b)/[40B](https://huggingface.co/tiiuae/falcon-40b) models, and explore our high-quality web dataset, [RefinedWeb](https://huggingface.co/datasets/tiiuae/falcon-refinedweb).
17
  * ✨ **Falcon-[40B](https://huggingface.co/tiiuae/falcon-40b)/[7B](https://huggingface.co/tiiuae/falcon-7b) are now available under the Apache 2.0 license**, TII has [waived all royalties and commercial usage restrictions](https://www.tii.ae/news/uaes-falcon-40b-worlds-top-ranked-ai-model-technology-innovation-institute-now-royalty-free).
 
 
 
 
 
 
 
 
 
18
 
19
  # Falcon LLM
20
 
21
- Falcon LLM is TII's flagship series of large language models, built from scratch using a custom data pipeline and distributed training library. Papers coming soon 😊.
 
 
 
 
22
 
23
  To promote collaborations and drive innovation, we have open-sourced a number of artefacts:
24
  * The **Falcon-180B** pretrained and chat models, under the [Falcon-180B TII license](https://huggingface.co/spaces/tiiuae/falcon-180b-license/blob/main/LICENSE.txt). Falcon-180B is the largest and most powerful open-access model available.
@@ -40,6 +54,7 @@ See below for a detailed list of artefacts in the Falcon LLM family:
40
  | Falcon-RW-7B | [Here](https://huggingface.co/tiiuae/falcon-rw-7b) | *pretrained model* | 7.5B parameters trained on 350 billion tokens. |
41
 
42
 
 
43
  # About us
44
 
45
  The [Technology Innovation Institute](https://www.tii.ae) (TII) is a leading global research center dedicated to pushing the frontiers of knowledge. Our teams of scientists, researchers and engineers work in an open, flexible and agile environment to deliver discovery science and transformative technologies. Our work means we will not only prepare for the future; we will create it. Working together, we are committed to inspiring innovation for a better tomorrow.
 
13
 
14
  # News
15
 
16
+ * πŸŽ‰ **TII has just released a new generation of models, starting with Falcon2-11B**, a 11B parameters causal decoder-only model and trained over 5,000B tokens of [RefinedWeb](https://huggingface.co/datasets/tiiuae/falcon-refinedweb) enhanced with curated corpora. The model is made available under the [TII Falcon License 2.0](https://falconllm-staging.tii.ae/falcon-2-terms-and-conditions.html), the permissive Apache 2.0-based software license which includes an [acceptable use policy](https://falconllm-staging.tii.ae/falcon-2-acceptable-use-policy.html) that promotes the responsible use of AI.
17
  * πŸ’₯ **TII has open-sourced Falcon-180B for research and commercial utilization!** Access the [180B](https://huggingface.co/tiiuae/falcon-180b), as well as [7B](https://huggingface.co/tiiuae/falcon-7b)/[40B](https://huggingface.co/tiiuae/falcon-40b) models, and explore our high-quality web dataset, [RefinedWeb](https://huggingface.co/datasets/tiiuae/falcon-refinedweb).
18
  * ✨ **Falcon-[40B](https://huggingface.co/tiiuae/falcon-40b)/[7B](https://huggingface.co/tiiuae/falcon-7b) are now available under the Apache 2.0 license**, TII has [waived all royalties and commercial usage restrictions](https://www.tii.ae/news/uaes-falcon-40b-worlds-top-ranked-ai-model-technology-innovation-institute-now-royalty-free).
19
+ # Falcon2 LLM
20
+
21
+ Falcon2 LLM is TII's new flagship series of large language models, where we focused on building smaller models with enhanced performance to enable cheaper inference that can encourage the development of more downstream applications and improve the general usability of our models. More models and papers coming soon 😊.
22
+
23
+ See below for a detailed list of artefacts in the Falcon2 LLM family:
24
+
25
+ | **Artefact** | **Link** | **Type** | **Details** |
26
+ |---------------------|------------------------------------------------------------------|-------------------------|-------------------------------------------------------------------|
27
+ | πŸ¦…πŸ¦… **Falcon-11B** | [Here](https://huggingface.co/tiiuae/falcon-11B) | *pretrained model* | 11B parameters trained on over 5000 billion tokens. |
28
 
29
  # Falcon LLM
30
 
31
+ Falcon LLM is TII's flagship series of large language models, built from scratch using a custom data pipeline and distributed training library [Almazrouei et. al](https://arxiv.org/abs/2311.16867).
32
+
33
+ Papers:
34
+ - [RefinedWeb, Penedo et. al, 2023](https://proceedings.neurips.cc/paper_files/paper/2023/hash/fa3ed726cc5073b9c31e3e49a807789c-Abstract-Datasets_and_Benchmarks.html)
35
+ - [The Falcon Series of Open Language Models, Almazrouei et. al, 2023](https://arxiv.org/abs/2311.16867)
36
 
37
  To promote collaborations and drive innovation, we have open-sourced a number of artefacts:
38
  * The **Falcon-180B** pretrained and chat models, under the [Falcon-180B TII license](https://huggingface.co/spaces/tiiuae/falcon-180b-license/blob/main/LICENSE.txt). Falcon-180B is the largest and most powerful open-access model available.
 
54
  | Falcon-RW-7B | [Here](https://huggingface.co/tiiuae/falcon-rw-7b) | *pretrained model* | 7.5B parameters trained on 350 billion tokens. |
55
 
56
 
57
+
58
  # About us
59
 
60
  The [Technology Innovation Institute](https://www.tii.ae) (TII) is a leading global research center dedicated to pushing the frontiers of knowledge. Our teams of scientists, researchers and engineers work in an open, flexible and agile environment to deliver discovery science and transformative technologies. Our work means we will not only prepare for the future; we will create it. Working together, we are committed to inspiring innovation for a better tomorrow.