← Back to all models

Llama 3.2 11B Vision Instruct

llama-3.2-11b

Instruction-tuned image reasoning generative model (text + images in / text out) optimized for visual recognition, image reasoning, captioning and answering general questions about the image.

Available at 1 Provider

Provider Source Input Price ($/1M) Output Price ($/1M) Description Free
vercel vercel Input: $0.16 Output: $0.16 Instruction-tuned image reasoning generative model (text + images in / text out) optimized for visual recognition, image reasoning, captioning and answering general questions about the image.