← Back to all models

Gemini 2.5 Flash

gemini-2.5-flash

Gemini 2.5 Flash is a thinking model that offers great, well-rounded capabilities. It is designed to offer a balance between price and performance with multimodal support and a 1M token context window.

Available at 15 Providers

Provider Source Input Price ($/1M) Output Price ($/1M) Description Free
vercel vercel Input: $0.30 Output: $2.50 Gemini 2.5 Flash is a thinking model that offers great, well-rounded capabilities. It is designed to offer a balance between price and performance with multimodal support and a 1M token context window.
poe poe Input: $0.21 Output: $1.80 Gemini 2.5 Flash builds upon the popular foundation of Google's 2.0 Flash, this new version delivers a major upgrade in reasoning capabilities, search capabilities, and image/video understanding while still prioritizing speed and cost. Supports 1M tokens of input context. Serves the latest `gemini-2.5-flash-preview-09-2025` snapshot. To instruct the bot to use more thinking effort, add --thinking_budget and a number ranging from 0 to 24,576 to the end of your message. To use web search and real-time information access, add `--web_search true` to enable and add `--web_search false` to disable (default setting).
abacus models-dev Input: $0.30 Output: $2.50 Provider: Abacus, Context: 1048576, Output Limit: 65536
helicone models-dev Input: $0.30 Output: $2.50 Provider: Helicone, Context: 1048576, Output Limit: 65535
fastrouter models-dev Input: $0.30 Output: $2.50 Provider: FastRouter, Context: 1048576, Output Limit: 65536
google models-dev Input: $0.30 Output: $2.50 Provider: Google, Context: 1048576, Output Limit: 65536
googlevertex models-dev Input: $0.30 Output: $2.50 Provider: Vertex, Context: 1048576, Output Limit: 65536
zenmux models-dev Input: $0.30 Output: $2.50 Provider: ZenMux, Context: 1048576, Output Limit: 64000
requesty models-dev Input: $0.30 Output: $2.50 Provider: Requesty, Context: 1048576, Output Limit: 65536
sapaicore models-dev Input: $0.30 Output: $2.50 Provider: SAP AI Core, Context: 1048576, Output Limit: 65536
aihubmix models-dev Input: $0.08 Output: $0.30 Provider: AIHubMix, Context: 1000000, Output Limit: 65000
deepinfra litellm Input: $0.30 Output: $2.50 Source: deepinfra, Context: 1000000
vertex litellm Input: $0.30 Output: $2.50 Source: vertex, Context: 1048576
gemini litellm Input: $0.30 Output: $2.50 Source: gemini, Context: 1048576
openrouter openrouter Input: $0.30 Output: $2.50 Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning). Context: 1048576