FalconLLM commited on
Commit
7327008
β€’
1 Parent(s): a0395a0

Add recommendations for inference and finetuning

Browse files
Files changed (1) hide show
  1. README.md +6 -0
README.md CHANGED
@@ -13,6 +13,9 @@ license: apache-2.0
13
 
14
  *Paper coming soon* 😊.
15
 
 
 
 
16
  ## Why use Falcon-7B?
17
 
18
  * **It outperforms comparable open-source models** (e.g., [MPT-7B](https://huggingface.co/mosaicml/mpt-7b), [StableLM](https://github.com/Stability-AI/StableLM), [RedPajama](https://huggingface.co/togethercomputer/RedPajama-INCITE-Base-7B-v0.1) etc.), thanks to being trained on 1,500B tokens of [RefinedWeb](https://huggingface.co/datasets/tiiuae/falcon-refinedweb) enhanced with curated corpora. See the [OpenLLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard).
@@ -54,6 +57,9 @@ for seq in sequences:
54
 
55
  πŸ’₯ **Falcon LLMs require PyTorch 2.0 for use with `transformers`!**
56
 
 
 
 
57
 
58
  # Model Card for Falcon-7B
59
 
 
13
 
14
  *Paper coming soon* 😊.
15
 
16
+ πŸ€— To get started with Falcon (inference, finetuning, quantization, etc.), we recommend reading [this great blogpost fron HF](https://huggingface.co/blog/falcon)!
17
+
18
+
19
  ## Why use Falcon-7B?
20
 
21
  * **It outperforms comparable open-source models** (e.g., [MPT-7B](https://huggingface.co/mosaicml/mpt-7b), [StableLM](https://github.com/Stability-AI/StableLM), [RedPajama](https://huggingface.co/togethercomputer/RedPajama-INCITE-Base-7B-v0.1) etc.), thanks to being trained on 1,500B tokens of [RefinedWeb](https://huggingface.co/datasets/tiiuae/falcon-refinedweb) enhanced with curated corpora. See the [OpenLLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard).
 
57
 
58
  πŸ’₯ **Falcon LLMs require PyTorch 2.0 for use with `transformers`!**
59
 
60
+ For fast inference with Falcon, check-out [Text Generation Inference](https://github.com/huggingface/text-generation-inference)! Read more in this [blogpost]((https://huggingface.co/blog/falcon).
61
+
62
+ You will need **at least 16GB of memory** to swiftly run inference with Falcon-7B.
63
 
64
  # Model Card for Falcon-7B
65