Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
SakanaAI
/
Llama-3-EvoVLM-JP-v2
like
19
Follow
Sakana AI
152
Image-to-Text
Transformers
Safetensors
Japanese
llava
image-text-to-text
multimodal
vision-language
mantis
llama3
siglip
Inference Endpoints
arxiv:
2403.13187
License:
llama3
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
b34a669
Llama-3-EvoVLM-JP-v2
Commit History
Delete the repository
b34a669
verified
Inoichan
commited on
Aug 1
Update license
97c3229
verified
Inoichan
commited on
Aug 1
Update usage
0f8098c
verified
Inoichan
commited on
Aug 1
Fix device
75d77d8
verified
Inoichan
commited on
Aug 1
update the links to the blog
2be6ebf
verified
Inoichan
commited on
Jul 31
Update README.md
0469061
verified
Inoichan
commited on
Jul 31
Update README.md
9b4dcdb
verified
Inoichan
commited on
Jul 30
Update README.md
a768111
verified
Inoichan
commited on
Jul 29
Upload LlavaForConditionalGeneration
9cf9a20
verified
Inoichan
commited on
Jul 29
initial commit
f35dc74
verified
Inoichan
commited on
Jul 29