harishp commited on
Commit
df9d902
•
1 Parent(s): d10dea1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -21
README.md CHANGED
@@ -18,6 +18,9 @@ library_name: diffusers
18
 
19
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62039c2d91d53938a643317d/0Iu_0f0d1ihGy0YiOd9uS.png)
20
 
 
 
 
21
 
22
  ## Model Description
23
 
@@ -27,10 +30,6 @@ This model employs a knowledge distillation strategy, where it leverages the tea
27
 
28
  Special thanks to the HF team 🤗 especially [Sayak](https://huggingface.co/sayakpaul), [Patrick](https://github.com/patrickvonplaten) and [Poli](https://huggingface.co/multimodalart) for their collaboration and guidance on this work.
29
 
30
- ## Demo
31
-
32
- Try out the model at [Segmind SSD-1B](https://www.segmind.com/models/ssd-1b) for ⚡ fastest inference. You can also try it on [🤗 Spaces](https://huggingface.co/spaces/segmind/Segmind-Stable-Diffusion)
33
-
34
  ## Image Comparision (SDXL-1.0 vs SSD-1B)
35
 
36
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62039c2d91d53938a643317d/mOM_OMxbivVBELad1QQYj.png)
@@ -102,9 +101,11 @@ These are the key hyperparameters used during training:
102
 
103
  ### Speed Comparision
104
 
105
- We have observed that SSD-1B is upto 60% faster than the Base SDXL Model. Below is a comparision on an A100 40GB.
 
 
106
 
107
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62039c2d91d53938a643317d/f7BcTrz5PjYGC5htLUVge.png)
108
 
109
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62039c2d91d53938a643317d/moMZrlDr-HTFkZlqWHUjQ.png)
110
 
@@ -212,18 +213,5 @@ The SSD-1B Model is not suitable for creating factual or accurate representation
212
 
213
  ## Limitations and Bias
214
 
215
- ### Limitations
216
-
217
- - **Photorealism:** The model does not achieve perfect photorealism and may produce images with artistic or stylized qualities.
218
-
219
- - **Legible Text:** Generating legible text within images is a challenge for the model, and text within images may appear distorted or unreadable.
220
-
221
- - **Compositionality:** Complex tasks involving composition, such as rendering images based on intricate descriptions, may pose challenges for the model.
222
-
223
- - **Faces and People:** While the model can generate a wide range of content, it may not consistently produce realistic or high-quality images of faces and people.
224
-
225
- - **Lossy Autoencoding:** The autoencoding aspect of the model is lossy, which means that some details in the input text may not be perfectly retained in the generated images.
226
-
227
- ### Bias
228
-
229
- The SSD-1B Model is trained on a diverse dataset, but like all generative models, it may exhibit biases present in the training data. Users are encouraged to be mindful of potential biases in the model's outputs and take appropriate steps to mitigate them.
 
18
 
19
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62039c2d91d53938a643317d/0Iu_0f0d1ihGy0YiOd9uS.png)
20
 
21
+ ## Demo
22
+
23
+ Try out the model at [Segmind SSD-1B](https://www.segmind.com/models/ssd-1b) for ⚡ fastest inference. You can also try it on [🤗 Spaces](https://huggingface.co/spaces/segmind/Segmind-Stable-Diffusion)
24
 
25
  ## Model Description
26
 
 
30
 
31
  Special thanks to the HF team 🤗 especially [Sayak](https://huggingface.co/sayakpaul), [Patrick](https://github.com/patrickvonplaten) and [Poli](https://huggingface.co/multimodalart) for their collaboration and guidance on this work.
32
 
 
 
 
 
33
  ## Image Comparision (SDXL-1.0 vs SSD-1B)
34
 
35
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62039c2d91d53938a643317d/mOM_OMxbivVBELad1QQYj.png)
 
101
 
102
  ### Speed Comparision
103
 
104
+ We have observed that SSD-1B is upto 60% faster than the Base SDXL Model. Below is a comparision on an A100 80GB.
105
+
106
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62039c2d91d53938a643317d/TyymF1OkUjXLrHUp1XF0t.png)
107
 
108
+ Below are the speed up metrics on a RTX 4090 GPU.
109
 
110
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62039c2d91d53938a643317d/moMZrlDr-HTFkZlqWHUjQ.png)
111
 
 
213
 
214
  ## Limitations and Bias
215
 
216
+ Limitations & Bias
217
+ The SSD-1B Model has some challenges in embodying absolute photorealism, especially in human depictions. While it grapples with incorporating clear text and maintaining the fidelity of complex compositions due to its autoencoding approach, these hurdles pave the way for future enhancements. Importantly, the model's exposure to a diverse dataset, though not a panacea for ingrained societal and digital biases, represents a foundational step towards more equitable technology. Users are encouraged to interact with this pioneering tool with an understanding of its current limitations, fostering an environment of conscious engagement and anticipation for its continued evolution.