Add link to interactive corpus treemap
#26
by
yjernite
HF staff
- opened
README.md
CHANGED
@@ -175,7 +175,7 @@ Jean Zay Public Supercomputer, provided by the French government (see [announcem
|
|
175 |
## Training Data
|
176 |
*This section provides a high-level overview of the training data. It is relevant for anyone who wants to know the basics of what the model is learning.*
|
177 |
|
178 |
-
Details for each dataset are provided in individual [Data Cards](https://huggingface.co/spaces/bigscience/BigScienceCorpus).
|
179 |
|
180 |
Training data includes:
|
181 |
|
|
|
175 |
## Training Data
|
176 |
*This section provides a high-level overview of the training data. It is relevant for anyone who wants to know the basics of what the model is learning.*
|
177 |
|
178 |
+
Details for each dataset are provided in individual [Data Cards](https://huggingface.co/spaces/bigscience/BigScienceCorpus), and the sizes of each of their contributions to the aggregated training data are presented in an [Interactive Corpus Map](https://huggingface.co/spaces/bigscience-catalogue-lm-data/corpus-map).
|
179 |
|
180 |
Training data includes:
|
181 |
|