gemini-2.5-flash
Gemini 2.5 Flash is a thinking model that offers great, well-rounded capabilities. It is designed to offer a balance between price and performance with multimodal support and a 1M token context window.
| Provider | Source | Input Price ($/1M) | Output Price ($/1M) | Description | Free |
|---|---|---|---|---|---|
| vercel | vercel | Input: $0.30 | Output: $2.50 | Gemini 2.5 Flash is a thinking model that offers great, well-rounded capabilities. It is designed to offer a balance between price and performance with multimodal support and a 1M token context window. | |
| poe | poe | Input: $0.21 | Output: $1.80 | Gemini 2.5 Flash builds upon the popular foundation of Google's 2.0 Flash, this new version delivers a major upgrade in reasoning capabilities, search capabilities, and image/video understanding while still prioritizing speed and cost. Supports 1M tokens of input context. Serves the latest `gemini-2.5-flash-preview-09-2025` snapshot. To instruct the bot to use more thinking effort, add --thinking_budget and a number ranging from 0 to 24,576 to the end of your message. To use web search and real-time information access, add `--web_search true` to enable and add `--web_search false` to disable (default setting). | |
| abacus | models-dev | Input: $0.30 | Output: $2.50 | Provider: Abacus, Context: 1048576, Output Limit: 65536 | |
| helicone | models-dev | Input: $0.30 | Output: $2.50 | Provider: Helicone, Context: 1048576, Output Limit: 65535 | |
| fastrouter | models-dev | Input: $0.30 | Output: $2.50 | Provider: FastRouter, Context: 1048576, Output Limit: 65536 | |
| models-dev | Input: $0.30 | Output: $2.50 | Provider: Google, Context: 1048576, Output Limit: 65536 | ||
| googlevertex | models-dev | Input: $0.30 | Output: $2.50 | Provider: Vertex, Context: 1048576, Output Limit: 65536 | |
| zenmux | models-dev | Input: $0.30 | Output: $2.50 | Provider: ZenMux, Context: 1048576, Output Limit: 64000 | |
| requesty | models-dev | Input: $0.30 | Output: $2.50 | Provider: Requesty, Context: 1048576, Output Limit: 65536 | |
| sapaicore | models-dev | Input: $0.30 | Output: $2.50 | Provider: SAP AI Core, Context: 1048576, Output Limit: 65536 | |
| aihubmix | models-dev | Input: $0.08 | Output: $0.30 | Provider: AIHubMix, Context: 1000000, Output Limit: 65000 | |
| deepinfra | litellm | Input: $0.30 | Output: $2.50 | Source: deepinfra, Context: 1000000 | |
| vertex | litellm | Input: $0.30 | Output: $2.50 | Source: vertex, Context: 1048576 | |
| gemini | litellm | Input: $0.30 | Output: $2.50 | Source: gemini, Context: 1048576 | |
| openrouter | openrouter | Input: $0.30 | Output: $2.50 | Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning). Context: 1048576 |