About:
This is a llava model using tinyllama as its language model and openai/clip-vit-l-14-336 as its vision tower. Multi-modal projection layers are untrained as of now.
- Downloads last month
- 4
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.