FLUX.1-dev / README.md
Guido1181's picture
Add generated example
3cb64ec verified
|
raw
history blame
5.65 kB
metadata
language:
  - en
license: other
license_name: flux-1-dev-non-commercial-license
license_link: LICENSE.md
extra_gated_prompt: >-
  By clicking "Agree", you agree to the [FluxDev Non-Commercial License
  Agreement](https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md)
  and acknowledge the [Acceptable Use
  Policy](https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/POLICY.md).
tags:
  - text-to-image
  - image-generation
  - flux
widget:
  - text: >-
      Create a painting in the style of 16th or 17th century art, reminiscent of
      the works of Caravaggio or Rembrandt. The scene should have dramatic
      chiaroscuro, with deep shadows and bright highlights. Brushstrokes should
      be fluid and delicate, blending colors subtly to achieve a realistic
      texture. Use a warm, earthy color palette, emphasizing deep reds, browns,
      and ochres, with golden light casting soft highlights on the subject. The
      composition should be balanced and harmonious, with attention to classical
      proportions and depth, evoking the grandeur and realism of Renaissance and
      Baroque masterpieces. The image to be depicted is that of a Templar
      dressed in a white tunic with a painted red cross and a red sheer organza
      cape. He is kneeling in a dark place while contemplating the golden Holy
      Grail. The scene is divided into two parts: one hellish and one heavenly
      to highlight the contrast between the two dimensions in which he finds
      himself.  On one side elements such as Crows dry branches and thorns and
      on the other elements such as Roses Lilies and Dahlias. On one side a dove
      on the other a black crow
    output:
      url: images/example_a313spmjg.png

FLUX.1 [dev] Grid

FLUX.1 [dev] is a 12 billion parameter rectified flow transformer capable of generating images from text descriptions. For more information, please read our blog post.

Key Features

  1. Cutting-edge output quality, second only to our state-of-the-art model FLUX.1 [pro].
  2. Competitive prompt following, matching the performance of closed source alternatives .
  3. Trained using guidance distillation, making FLUX.1 [dev] more efficient.
  4. Open weights to drive new scientific research, and empower artists to develop innovative workflows.
  5. Generated outputs can be used for personal, scientific, and commercial purposes as described in the FLUX.1 [dev] Non-Commercial License.

Usage

We provide a reference implementation of FLUX.1 [dev], as well as sampling code, in a dedicated github repository. Developers and creatives looking to build on top of FLUX.1 [dev] are encouraged to use this as a starting point.

API Endpoints

The FLUX.1 models are also available via API from the following sources

ComfyUI

FLUX.1 [dev] is also available in Comfy UI for local inference with a node-based workflow.

Diffusers

To use FLUX.1 [dev] with the 🧨 diffusers python library, first install or upgrade diffusers

pip install -U diffusers

Then you can use FluxPipeline to run the model

import torch
from diffusers import FluxPipeline

pipe = FluxPipeline.from_pretrained("black-forest-labs/FLUX.1-dev", torch_dtype=torch.bfloat16)
pipe.enable_model_cpu_offload() #save some VRAM by offloading the model to CPU. Remove this if you have enough GPU power

prompt = "A cat holding a sign that says hello world"
image = pipe(
    prompt,
    height=1024,
    width=1024,
    guidance_scale=3.5,
    num_inference_steps=50,
    max_sequence_length=512,
    generator=torch.Generator("cpu").manual_seed(0)
).images[0]
image.save("flux-dev.png")

To learn more check out the diffusers documentation


Limitations

  • This model is not intended or able to provide factual information.
  • As a statistical model this checkpoint might amplify existing societal biases.
  • The model may fail to generate output that matches the prompts.
  • Prompt following is heavily influenced by the prompting-style.

Out-of-Scope Use

The model and its derivatives may not be used

  • In any way that violates any applicable national, federal, state, local or international law or regulation.
  • For the purpose of exploiting, harming or attempting to exploit or harm minors in any way; including but not limited to the solicitation, creation, acquisition, or dissemination of child exploitative content.
  • To generate or disseminate verifiably false information and/or content with the purpose of harming others.
  • To generate or disseminate personal identifiable information that can be used to harm an individual.
  • To harass, abuse, threaten, stalk, or bully individuals or groups of individuals.
  • To create non-consensual nudity or illegal pornographic content.
  • For fully automated decision making that adversely impacts an individual's legal rights or otherwise creates or modifies a binding, enforceable obligation.
  • Generating or facilitating large-scale disinformation campaigns.

License

This model falls under the FLUX.1 [dev] Non-Commercial License.