Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -13,12 +13,26 @@ pinned: false
|
|
13 |
|
14 |
# News
|
15 |
|
|
|
16 |
* π₯ **TII has open-sourced Falcon-180B for research and commercial utilization!** Access the [180B](https://huggingface.co/tiiuae/falcon-180b), as well as [7B](https://huggingface.co/tiiuae/falcon-7b)/[40B](https://huggingface.co/tiiuae/falcon-40b) models, and explore our high-quality web dataset, [RefinedWeb](https://huggingface.co/datasets/tiiuae/falcon-refinedweb).
|
17 |
* β¨ **Falcon-[40B](https://huggingface.co/tiiuae/falcon-40b)/[7B](https://huggingface.co/tiiuae/falcon-7b) are now available under the Apache 2.0 license**, TII has [waived all royalties and commercial usage restrictions](https://www.tii.ae/news/uaes-falcon-40b-worlds-top-ranked-ai-model-technology-innovation-institute-now-royalty-free).
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
|
19 |
# Falcon LLM
|
20 |
|
21 |
-
Falcon LLM is TII's flagship series of large language models, built from scratch using a custom data pipeline and distributed training library
|
|
|
|
|
|
|
|
|
22 |
|
23 |
To promote collaborations and drive innovation, we have open-sourced a number of artefacts:
|
24 |
* The **Falcon-180B** pretrained and chat models, under the [Falcon-180B TII license](https://huggingface.co/spaces/tiiuae/falcon-180b-license/blob/main/LICENSE.txt). Falcon-180B is the largest and most powerful open-access model available.
|
@@ -40,6 +54,7 @@ See below for a detailed list of artefacts in the Falcon LLM family:
|
|
40 |
| Falcon-RW-7B | [Here](https://huggingface.co/tiiuae/falcon-rw-7b) | *pretrained model* | 7.5B parameters trained on 350 billion tokens. |
|
41 |
|
42 |
|
|
|
43 |
# About us
|
44 |
|
45 |
The [Technology Innovation Institute](https://www.tii.ae) (TII) is a leading global research center dedicated to pushing the frontiers of knowledge. Our teams of scientists, researchers and engineers work in an open, flexible and agile environment to deliver discovery science and transformative technologies. Our work means we will not only prepare for the future; we will create it. Working together, we are committed to inspiring innovation for a better tomorrow.
|
|
|
13 |
|
14 |
# News
|
15 |
|
16 |
+
* π **TII has just released a new generation of models, starting with Falcon2-11B**, a 11B parameters causal decoder-only model and trained over 5,000B tokens of [RefinedWeb](https://huggingface.co/datasets/tiiuae/falcon-refinedweb) enhanced with curated corpora. The model is made available under the [TII Falcon License 2.0](https://falconllm-staging.tii.ae/falcon-2-terms-and-conditions.html), the permissive Apache 2.0-based software license which includes an [acceptable use policy](https://falconllm-staging.tii.ae/falcon-2-acceptable-use-policy.html) that promotes the responsible use of AI.
|
17 |
* π₯ **TII has open-sourced Falcon-180B for research and commercial utilization!** Access the [180B](https://huggingface.co/tiiuae/falcon-180b), as well as [7B](https://huggingface.co/tiiuae/falcon-7b)/[40B](https://huggingface.co/tiiuae/falcon-40b) models, and explore our high-quality web dataset, [RefinedWeb](https://huggingface.co/datasets/tiiuae/falcon-refinedweb).
|
18 |
* β¨ **Falcon-[40B](https://huggingface.co/tiiuae/falcon-40b)/[7B](https://huggingface.co/tiiuae/falcon-7b) are now available under the Apache 2.0 license**, TII has [waived all royalties and commercial usage restrictions](https://www.tii.ae/news/uaes-falcon-40b-worlds-top-ranked-ai-model-technology-innovation-institute-now-royalty-free).
|
19 |
+
# Falcon2 LLM
|
20 |
+
|
21 |
+
Falcon2 LLM is TII's new flagship series of large language models, where we focused on building smaller models with enhanced performance to enable cheaper inference that can encourage the development of more downstream applications and improve the general usability of our models. More models and papers coming soon π.
|
22 |
+
|
23 |
+
See below for a detailed list of artefacts in the Falcon2 LLM family:
|
24 |
+
|
25 |
+
| **Artefact** | **Link** | **Type** | **Details** |
|
26 |
+
|---------------------|------------------------------------------------------------------|-------------------------|-------------------------------------------------------------------|
|
27 |
+
| π¦
π¦
**Falcon-11B** | [Here](https://huggingface.co/tiiuae/falcon-11B) | *pretrained model* | 11B parameters trained on over 5000 billion tokens. |
|
28 |
|
29 |
# Falcon LLM
|
30 |
|
31 |
+
Falcon LLM is TII's flagship series of large language models, built from scratch using a custom data pipeline and distributed training library [Almazrouei et. al](https://arxiv.org/abs/2311.16867).
|
32 |
+
|
33 |
+
Papers:
|
34 |
+
- [RefinedWeb, Penedo et. al, 2023](https://proceedings.neurips.cc/paper_files/paper/2023/hash/fa3ed726cc5073b9c31e3e49a807789c-Abstract-Datasets_and_Benchmarks.html)
|
35 |
+
- [The Falcon Series of Open Language Models, Almazrouei et. al, 2023](https://arxiv.org/abs/2311.16867)
|
36 |
|
37 |
To promote collaborations and drive innovation, we have open-sourced a number of artefacts:
|
38 |
* The **Falcon-180B** pretrained and chat models, under the [Falcon-180B TII license](https://huggingface.co/spaces/tiiuae/falcon-180b-license/blob/main/LICENSE.txt). Falcon-180B is the largest and most powerful open-access model available.
|
|
|
54 |
| Falcon-RW-7B | [Here](https://huggingface.co/tiiuae/falcon-rw-7b) | *pretrained model* | 7.5B parameters trained on 350 billion tokens. |
|
55 |
|
56 |
|
57 |
+
|
58 |
# About us
|
59 |
|
60 |
The [Technology Innovation Institute](https://www.tii.ae) (TII) is a leading global research center dedicated to pushing the frontiers of knowledge. Our teams of scientists, researchers and engineers work in an open, flexible and agile environment to deliver discovery science and transformative technologies. Our work means we will not only prepare for the future; we will create it. Working together, we are committed to inspiring innovation for a better tomorrow.
|