Papers
arxiv:2408.16766

CSGO: Content-Style Composition in Text-to-Image Generation

Published on Aug 29
ยท Submitted by akhaliq on Aug 30
Authors:
,
Hao Ai ,

Abstract

The diffusion model has shown exceptional capabilities in controlled image generation, which has further fueled interest in image style transfer. Existing works mainly focus on training free-based methods (e.g., image inversion) due to the scarcity of specific data. In this study, we present a data construction pipeline for content-style-stylized image triplets that generates and automatically cleanses stylized data triplets. Based on this pipeline, we construct a dataset IMAGStyle, the first large-scale style transfer dataset containing 210k image triplets, available for the community to explore and research. Equipped with IMAGStyle, we propose CSGO, a style transfer model based on end-to-end training, which explicitly decouples content and style features employing independent feature injection. The unified CSGO implements image-driven style transfer, text-driven stylized synthesis, and text editing-driven stylized synthesis. Extensive experiments demonstrate the effectiveness of our approach in enhancing style control capabilities in image generation. Additional visualization and access to the source code can be located on the project page: https://csgo-gen.github.io/.

Community

Paper submitter

EDIT: Old wrong space, waiting for the new one
ORIGINAL COMMENT:
Space: https://huggingface.co/spaces/InstantX/InstantStyle

ยท
Paper author

This online demo is not CSGO, it's InstantStyle, and we're actively working on a new demo.

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Paper author

Sign up or log in to comment

Models citing this paper 1

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2408.16766 in a dataset README.md to link it from this page.

Spaces citing this paper 4

Collections including this paper 7