Grok 2 Vision

grok-2-vision

Grok 2 vision model excels in vision-based tasks, delivering state-of-the-art performance in visual math reasoning (MathVista) and document-based question answering (DocVQA). It can process a wide variety of visual information including documents, diagrams, charts, screenshots, and photographs.

Available at 2 Providers

Provider	Source	Input Price ($/1M)	Output Price ($/1M)	Description	Free
vercel	vercel	Input: $2.00	Output: $10.00	Grok 2 vision model excels in vision-based tasks, delivering state-of-the-art performance in visual math reasoning (MathVista) and document-based question answering (DocVQA). It can process a wide variety of visual information including documents, diagrams, charts, screenshots, and photographs.
xai	models-dev	Input: $2.00	Output: $10.00	Provider: xAI, Context: 8192, Output Limit: 4096