Awesome Visual Embedding - a RhapsodyAI Collection

RhapsodyAI 's Collections

Awesome Visual Embedding

Awesome Visual Embedding

updated Jul 23

RhapsodyAI/MiniCPM-V-Embedding-preview

Feature Extraction • Updated Aug 20 • 500 • 43
vidore/colidefics

Updated Jul 11 • 2
vidore/colpali

Updated Sep 27 • 33k • 370
Unifying Multimodal Retrieval via Document Screenshot Embedding

Paper • 2406.11251 • Published Jun 17 • 9
ColPali: Efficient Document Retrieval with Vision Language Models

Paper • 2407.01449 • Published Jun 27 • 41
Jina CLIP: Your CLIP Model Is Also Your Text Retriever

Paper • 2405.20204 • Published May 30 • 32
Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning

Paper • 2406.02265 • Published Jun 4 • 6
Synthetic Multimodal Question Generation

Paper • 2407.02233 • Published Jul 2 • 1
RankCLIP: Ranking-Consistent Language-Image Pretraining

Paper • 2404.09387 • Published Apr 15