Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
Inference Endpoints
text-generation-inference
image-text-to-text
custom_code
AutoTrain Compatible
4-bit precision
8-bit precision
Eval Results
Merge
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
5,554
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
microsoft/OmniParser
Image-Text-to-Text
•
Updated
9 days ago
•
6.11k
•
1.13k
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
•
Updated
Sep 30
•
2.34M
•
•
893
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
Updated
Sep 21
•
1.17M
•
•
802
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
Updated
Sep 18
•
334k
•
1.17k
vikhyatk/moondream2
Image-Text-to-Text
•
Updated
Aug 26
•
196k
•
690
rhymes-ai/Aria
Image-Text-to-Text
•
Updated
3 days ago
•
28.1k
•
580
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
•
Updated
Sep 27
•
101k
•
332
nlpconnect/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Feb 27, 2023
•
2.27M
•
•
831
OS-Copilot/OS-Atlas-Base-7B
Image-Text-to-Text
•
Updated
5 days ago
•
692
•
13
microsoft/Florence-2-large
Image-Text-to-Text
•
Updated
6 days ago
•
1.65M
•
1.2k
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
Updated
Sep 21
•
418k
•
256
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
Updated
25 days ago
•
132k
•
800
jadechoghari/Ferret-UI-Gemma2b
Image-Text-to-Text
•
Updated
23 days ago
•
1.87k
•
43
jadechoghari/Ferret-UI-Llama8b
Image-Text-to-Text
•
Updated
23 days ago
•
744
•
41
BAAI/Aquila-VL-2B-llava-qwen
Image-Text-to-Text
•
Updated
12 days ago
•
1.4k
•
41
Salesforce/blip-image-captioning-large
Image-to-Text
•
Updated
Dec 7, 2023
•
2.45M
•
•
1.14k
Qwen/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
Updated
Sep 21
•
63.8k
•
156
OpenFace-CQUPT/Human_LLaVA
Visual Question Answering
•
Updated
4 days ago
•
551
•
23
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
23 days ago
•
18.7k
•
733
Vikhrmodels/Vikhr-2-VL-2b-Instruct-experimental
Image-Text-to-Text
•
Updated
7 days ago
•
200
•
9
meta-llama/Llama-3.2-90B-Vision-Instruct
Image-Text-to-Text
•
Updated
Sep 30
•
213k
•
250
microsoft/trocr-base-handwritten
Image-to-Text
•
Updated
May 27
•
514k
•
•
327
liuhaotian/llava-v1.5-7b
Image-Text-to-Text
•
Updated
May 8
•
802k
•
359
meta-llama/Llama-3.2-90B-Vision
Image-Text-to-Text
•
Updated
Sep 27
•
5.37k
•
96
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
about 1 month ago
•
74k
•
422
mistral-community/pixtral-12b
Image-Text-to-Text
•
Updated
23 days ago
•
28.1k
•
65
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
Updated
27 days ago
•
28.8k
•
583
microsoft/Phi-3.5-vision-instruct
Image-Text-to-Text
•
Updated
Sep 26
•
690k
•
554
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-Text-to-Text
•
Updated
Sep 25
•
20.3k
•
39
AIDC-AI/Ovis1.6-Gemma2-9B-GPTQ-Int4
Image-Text-to-Text
•
Updated
2 days ago
•
174
•
5
Previous
1
2
3
...
100
Next