meta-llama/Llama-3.2-11B-Vision-Instruct
meta-llamaimage-text-to-text
Meta-llama/Llama-3.2-11B-Vision-Instruct is image-text-to-text model published by meta-llama in 2024. It is available through the transformers library and is released under the llama3.2 license, and has 48.5K downloads and 1.6K likes.
About meta-llama/Llama-3.2-11B-Vision-Instruct
meta-llama/Llama-3.2-11B-Vision-Instruct — a image-text-to-text model on the Hugging Face Hub.
LLM pricing & performance
Full LLM page →meta-llama/Llama-3.2-11B-Vision-Instruct is available via API — live cost, context, and benchmark data:
Input / 1M
$0.35
Output / 1M
$0.35
Context
131K
Tokens/sec
—
Details
- Provider
- meta-llama
- Task
- image-text-to-text
- Library
- transformers
- License
- llama3.2
- Released
- 2024-09-18
- Downloads
- 48471
- Likes
- 1610