← Back to all models

Llama 4 Maverick 17B 128E Instruct

llama-4-maverick

Llama 4 Maverick 17B-128E is Llama 4's largest and most capable model. It uses the Mixture-of-Experts (MoE) architecture and early fusion to provide coding, reasoning, and image capabilities.

Available at 5 Providers

Provider	Source	Input Price ($/1M)	Output Price ($/1M)	Description	Free
vercel	vercel	Input: $0.15	Output: $0.60	Llama 4 Maverick 17B-128E is Llama 4's largest and most capable model. It uses the Mixture-of-Experts (MoE) architecture and early fusion to provide coding, reasoning, and image capabilities.
together	together	Input: $0.27	Output: $0.85	-
helicone	models-dev	Input: $0.15	Output: $0.60	Provider: Helicone, Context: 131072, Output Limit: 8192
nanogpt	models-dev	Input: $1.00	Output: $2.00	Provider: NanoGPT, Context: 128000, Output Limit: 8192
openrouter	openrouter	Input: $0.15	Output: $0.60	Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward pass (400B total). It supports multilingual text and image input, and produces multilingual text and code output across 12 supported languages. Optimized for vision-language tasks, Maverick is instruction-tuned for assistant-like behavior, image reasoning, and general-purpose multimodal interaction. Maverick features early fusion for native multimodality and a 1 million token context window. It was trained on a curated mixture of public, licensed, and Meta-platform data, covering ~22 trillion tokens, with a knowledge cutoff in August 2024. Released on April 5, 2025 under the Llama 4 Community License, Maverick is suited for research and commercial applications requiring advanced multimodal understanding and high model throughput. Context: 1048576