← Back to ModelsCompare Models →
MetaOfficial
Meta: Llama 3.2 11B Vision Instruct
Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...
Pricing
Input
$0.245/1M tokens
Output
$0.245/1M tokens
Capabilities
Context Window131K tokens
Max Output16K tokens
SpeedMedium
Release2024-09
Vision
Tool Use
API Access
Local / Open
API Access
Official Links
Compare Meta: Llama 3.2 11B Vision Instruct
See how it stacks up side by side.