multimodalart HF staff commited on
Commit
69692b2
1 Parent(s): 6d1b34d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -148,6 +148,8 @@ Using the model to generate content that is cruel to individuals is a misuse of
148
  [LAION-5B](https://laion.ai/blog/laion-5b/) which contains adult material
149
  and is not fit for product use without additional safety mechanisms and
150
  considerations.
 
 
151
 
152
  ### Bias
153
 
 
148
  [LAION-5B](https://laion.ai/blog/laion-5b/) which contains adult material
149
  and is not fit for product use without additional safety mechanisms and
150
  considerations.
151
+ - No additional measures were used to deduplicate the dataset. As a result, we observe some degree of memorization for images that are duplicated in the training data.
152
+ The training data can be searched at [https://rom1504.github.io/clip-retrieval/](https://rom1504.github.io/clip-retrieval/) to possibly assist in the detection of memorized images.
153
 
154
  ### Bias
155