Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 84
Inject Semantic Concepts into Image Tagging for Open-Set Recognition Paper • 2310.15200 • Published Oct 23, 2023 • 5
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding Paper • 2310.15308 • Published Oct 23, 2023 • 22