To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning Paper • 2311.07574 • Published Nov 13, 2023 • 14
CSGO: Content-Style Composition in Text-to-Image Generation Paper • 2408.16766 • Published Aug 29 • 17
CogVLM2: Visual Language Models for Image and Video Understanding Paper • 2408.16500 • Published Aug 29 • 56