TokenCenter
← Back to Models
MetaOfficial

Meta: Llama 3.2 11B Vision Instruct

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

Pricing

Input
$0.245/1M tokens
Output
$0.245/1M tokens

Capabilities

Context Window131K tokens
Max Output16K tokens
SpeedMedium
Release2024-09
Vision
Tool Use
API Access
Local / Open

API Access

Official Links

Compare Meta: Llama 3.2 11B Vision Instruct

See how it stacks up side by side.

Compare Models →