| Kimi K2 |
kimi-k2-instruct
|
0.50 |
2.00 |
Provider: Deep Infra, Context: 131072, Output Limit: 32768
|
|
| Kimi K2 Thinking |
kimi-k2-thinking
|
0.47 |
2.00 |
Provider: Deep Infra, Context: 131072, Output Limit: 32768
|
|
| MiniMax M2 |
minimax-m2
|
0.25 |
1.02 |
Provider: Deep Infra, Context: 262144, Output Limit: 32768
|
|
| GPT OSS 20B |
gpt-oss-20b
|
0.03 |
0.14 |
Provider: Deep Infra, Context: 131072, Output Limit: 16384
|
|
| GPT OSS 120B |
gpt-oss-120b
|
0.05 |
0.24 |
Provider: Deep Infra, Context: 131072, Output Limit: 16384
|
|
| Qwen3 Coder 480B A35B Instruct |
qwen3-coder-480b-a35b-instruct
|
0.40 |
1.60 |
Provider: Deep Infra, Context: 262144, Output Limit: 66536
|
|
| Qwen3 Coder 480B A35B Instruct Turbo |
qwen3-coder-480b-a35b-instruct-turbo
|
0.30 |
1.20 |
Provider: Deep Infra, Context: 262144, Output Limit: 66536
|
|
| GLM-4.5 |
glm-4.5
|
0.60 |
2.20 |
Provider: Deep Infra, Context: 131072, Output Limit: 98304
|
|
| GLM-4.7 |
glm-4.7
|
0.43 |
1.75 |
Provider: Deep Infra, Context: 202752, Output Limit: 16384
|
|
| MythoMax-L2-13b |
mythomax-l2-13b
|
0.08 |
0.09 |
Source: deepinfra, Context: 4096
|
|
| Hermes-3-Llama-3.1-405B |
hermes-3-llama-3.1-405b
|
1.00 |
1.00 |
Source: deepinfra, Context: 131072
|
|
| Hermes-3-Llama-3.1-70B |
hermes-3-llama-3.1-70b
|
0.30 |
0.30 |
Source: deepinfra, Context: 131072
|
|
| QwQ-32B |
qwq-32b
|
0.15 |
0.40 |
Source: deepinfra, Context: 131072
|
|
| Qwen2.5-72B-Instruct |
qwen2.5-72b-instruct
|
0.12 |
0.39 |
Source: deepinfra, Context: 32768
|
|
| Qwen2.5-7B-Instruct |
qwen2.5-7b-instruct
|
0.04 |
0.10 |
Source: deepinfra, Context: 32768
|
|
| Qwen2.5-VL-32B-Instruct |
qwen2.5-vl-32b-instruct
|
0.20 |
0.60 |
Source: deepinfra, Context: 128000
|
|
| Qwen3-14B |
qwen3-14b
|
0.06 |
0.24 |
Source: deepinfra, Context: 40960
|
|
| Qwen3-235B-A22B |
qwen3-235b-a22b
|
0.18 |
0.54 |
Source: deepinfra, Context: 40960
|
|
| Qwen3-235B-A22B-Instruct-2507 |
qwen3-235b-a22b-instruct-2507
|
0.09 |
0.60 |
Source: deepinfra, Context: 262144
|
|
| Qwen3-235B-A22B-Thinking-2507 |
qwen3-235b-a22b-thinking-2507
|
0.30 |
2.90 |
Source: deepinfra, Context: 262144
|
|
| Qwen3-30B-A3B |
qwen3-30b-a3b
|
0.08 |
0.29 |
Source: deepinfra, Context: 40960
|
|
| Qwen3-32B |
qwen3-32b
|
0.10 |
0.28 |
Source: deepinfra, Context: 40960
|
|
| Qwen3-Next-80B-A3B-Instruct |
qwen3-next-80b-a3b-instruct
|
0.14 |
1.40 |
Source: deepinfra, Context: 262144
|
|
| Qwen3-Next-80B-A3B-Thinking |
qwen3-next-80b-a3b-thinking
|
0.14 |
1.40 |
Source: deepinfra, Context: 262144
|
|
| L3-8B-Lunaris-v1-Turbo |
l3-8b-lunaris-v1-turbo
|
0.04 |
0.05 |
Source: deepinfra, Context: 8192
|
|
| L3.1-70B-Euryale-v2.2 |
l3.1-70b-euryale-v2.2
|
0.65 |
0.75 |
Source: deepinfra, Context: 131072
|
|
| L3.3-70B-Euryale-v2.3 |
l3.3-70b-euryale-v2.3
|
0.65 |
0.75 |
Source: deepinfra, Context: 131072
|
|
| olmOCR-7B-0725-FP8 |
olmocr-7b-0725-fp8
|
0.27 |
1.50 |
Source: deepinfra, Context: 16384
|
|
| claude-3-7-sonnet-latest |
claude-3-7-sonnet-latest
|
3.30 |
16.50 |
Source: deepinfra, Context: 200000
|
|
| claude-4-opus |
claude-4-opus
|
16.50 |
82.50 |
Source: deepinfra, Context: 200000
|
|
| claude-4-sonnet |
claude-4-sonnet
|
3.30 |
16.50 |
Source: deepinfra, Context: 200000
|
|
| DeepSeek-R1 |
deepseek-r1
|
0.70 |
2.40 |
Source: deepinfra, Context: 163840
|
|
| DeepSeek-R1-0528 |
deepseek-r1-0528
|
0.50 |
2.15 |
Source: deepinfra, Context: 163840
|
|
| DeepSeek-R1-0528-Turbo |
deepseek-r1-0528-turbo
|
1.00 |
3.00 |
Source: deepinfra, Context: 32768
|
|
| DeepSeek-R1-Distill-Llama-70B |
deepseek-r1-distill-llama-70b
|
0.20 |
0.60 |
Source: deepinfra, Context: 131072
|
|
| DeepSeek-R1-Distill-Qwen-32B |
deepseek-r1-distill-qwen-32b
|
0.27 |
0.27 |
Source: deepinfra, Context: 131072
|
|
| DeepSeek-R1-Turbo |
deepseek-r1-turbo
|
1.00 |
3.00 |
Source: deepinfra, Context: 40960
|
|
| DeepSeek-V3 |
deepseek-v3
|
0.38 |
0.89 |
Source: deepinfra, Context: 163840
|
|
| DeepSeek-V3-0324 |
deepseek-v3-0324
|
0.25 |
0.88 |
Source: deepinfra, Context: 163840
|
|
| DeepSeek-V3.1 |
deepseek-v3.1
|
0.27 |
1.00 |
Source: deepinfra, Context: 163840
|
|
| DeepSeek-V3.1-Terminus |
deepseek-v3.1-terminus
|
0.27 |
1.00 |
Source: deepinfra, Context: 163840
|
|
| gemini-2.0-flash-001 |
gemini-2.0-flash-001
|
0.10 |
0.40 |
Source: deepinfra, Context: 1000000
|
|
| gemini-2.5-flash |
gemini-2.5-flash
|
0.30 |
2.50 |
Source: deepinfra, Context: 1000000
|
|
| gemini-2.5-pro |
gemini-2.5-pro
|
1.25 |
10.00 |
Source: deepinfra, Context: 1000000
|
|
| gemma-3-12b-it |
gemma-3-12b-it
|
0.05 |
0.10 |
Source: deepinfra, Context: 131072
|
|
| gemma-3-27b-it |
gemma-3-27b-it
|
0.09 |
0.16 |
Source: deepinfra, Context: 131072
|
|
| gemma-3-4b-it |
gemma-3-4b-it
|
0.04 |
0.08 |
Source: deepinfra, Context: 131072
|
|
| Llama-3.2-11B-Vision-Instruct |
llama-3.2-11b-vision-instruct
|
0.05 |
0.05 |
Source: deepinfra, Context: 131072
|
|
| Llama-3.2-3B-Instruct |
llama-3.2-3b-instruct
|
0.02 |
0.02 |
Source: deepinfra, Context: 131072
|
|
| Llama-3.3-70B-Instruct |
llama-3.3-70b-instruct
|
0.23 |
0.40 |
Source: deepinfra, Context: 131072
|
|
| Llama-3.3-70B-Instruct-Turbo |
llama-3.3-70b-instruct-turbo
|
0.13 |
0.39 |
Source: deepinfra, Context: 131072
|
|
| Llama-4-Maverick-17B-128E-Instruct-FP8 |
llama-4-maverick-17b-128e-instruct-fp8
|
0.15 |
0.60 |
Source: deepinfra, Context: 1048576
|
|
| Llama-4-Scout-17B-16E-Instruct |
llama-4-scout-17b-16e-instruct
|
0.08 |
0.30 |
Source: deepinfra, Context: 327680
|
|
| Llama-Guard-3-8B |
llama-guard-3-8b
|
0.06 |
0.06 |
Source: deepinfra, Context: 131072
|
|
| Llama-Guard-4-12B |
llama-guard-4-12b
|
0.18 |
0.18 |
Source: deepinfra, Context: 163840
|
|
| Meta-Llama-3-8B-Instruct |
meta-llama-3-8b-instruct
|
0.03 |
0.06 |
Source: deepinfra, Context: 8192
|
|
| Meta-Llama-3.1-70B-Instruct |
meta-llama-3.1-70b-instruct
|
0.40 |
0.40 |
Source: deepinfra, Context: 131072
|
|
| Meta-Llama-3.1-70B-Instruct-Turbo |
meta-llama-3.1-70b-instruct-turbo
|
0.10 |
0.28 |
Source: deepinfra, Context: 131072
|
|
| Meta-Llama-3.1-8B-Instruct |
meta-llama-3.1-8b-instruct
|
0.03 |
0.05 |
Source: deepinfra, Context: 131072
|
|
| Meta-Llama-3.1-8B-Instruct-Turbo |
meta-llama-3.1-8b-instruct-turbo
|
0.02 |
0.03 |
Source: deepinfra, Context: 131072
|
|
| WizardLM-2-8x22B |
wizardlm-2-8x22b
|
0.48 |
0.48 |
Source: deepinfra, Context: 65536
|
|
| phi-4 |
phi-4
|
0.07 |
0.14 |
Source: deepinfra, Context: 16384
|
|
| Mistral-Nemo-Instruct-2407 |
mistral-nemo-instruct-2407
|
0.02 |
0.04 |
Source: deepinfra, Context: 131072
|
|
| Mistral-Small-24B-Instruct-2501 |
mistral-small-24b-instruct-2501
|
0.05 |
0.08 |
Source: deepinfra, Context: 32768
|
|
| Mistral-Small-3.2-24B-Instruct-2506 |
mistral-small-3.2-24b-instruct-2506
|
0.08 |
0.20 |
Source: deepinfra, Context: 128000
|
|
| Mixtral-8x7B-Instruct-v0.1 |
mixtral-8x7b-instruct-v0.1
|
0.40 |
0.40 |
Source: deepinfra, Context: 32768
|
|
| Kimi-K2-Instruct-0905 |
kimi-k2-instruct-0905
|
0.50 |
2.00 |
Source: deepinfra, Context: 262144
|
|
| Llama-3.1-Nemotron-70B-Instruct |
llama-3.1-nemotron-70b-instruct
|
0.60 |
0.60 |
Source: deepinfra, Context: 131072
|
|
| Llama-3.3-Nemotron-Super-49B-v1.5 |
llama-3.3-nemotron-super-49b-v1.5
|
0.10 |
0.40 |
Source: deepinfra, Context: 131072
|
|
| NVIDIA-Nemotron-Nano-9B-v2 |
nvidia-nemotron-nano-9b-v2
|
0.04 |
0.16 |
Source: deepinfra, Context: 131072
|
|