← Back to all models

Deepinfra Models

70 Models
Name Model ID Input Price ($/1M) Output Price ($/1M) Description Free
Kimi K2 kimi-k2-instruct 0.50 2.00 Provider: Deep Infra, Context: 131072, Output Limit: 32768
Kimi K2 Thinking kimi-k2-thinking 0.47 2.00 Provider: Deep Infra, Context: 131072, Output Limit: 32768
MiniMax M2 minimax-m2 0.25 1.02 Provider: Deep Infra, Context: 262144, Output Limit: 32768
GPT OSS 20B gpt-oss-20b 0.03 0.14 Provider: Deep Infra, Context: 131072, Output Limit: 16384
GPT OSS 120B gpt-oss-120b 0.05 0.24 Provider: Deep Infra, Context: 131072, Output Limit: 16384
Qwen3 Coder 480B A35B Instruct qwen3-coder-480b-a35b-instruct 0.40 1.60 Provider: Deep Infra, Context: 262144, Output Limit: 66536
Qwen3 Coder 480B A35B Instruct Turbo qwen3-coder-480b-a35b-instruct-turbo 0.30 1.20 Provider: Deep Infra, Context: 262144, Output Limit: 66536
GLM-4.5 glm-4.5 0.60 2.20 Provider: Deep Infra, Context: 131072, Output Limit: 98304
GLM-4.7 glm-4.7 0.43 1.75 Provider: Deep Infra, Context: 202752, Output Limit: 16384
MythoMax-L2-13b mythomax-l2-13b 0.08 0.09 Source: deepinfra, Context: 4096
Hermes-3-Llama-3.1-405B hermes-3-llama-3.1-405b 1.00 1.00 Source: deepinfra, Context: 131072
Hermes-3-Llama-3.1-70B hermes-3-llama-3.1-70b 0.30 0.30 Source: deepinfra, Context: 131072
QwQ-32B qwq-32b 0.15 0.40 Source: deepinfra, Context: 131072
Qwen2.5-72B-Instruct qwen2.5-72b-instruct 0.12 0.39 Source: deepinfra, Context: 32768
Qwen2.5-7B-Instruct qwen2.5-7b-instruct 0.04 0.10 Source: deepinfra, Context: 32768
Qwen2.5-VL-32B-Instruct qwen2.5-vl-32b-instruct 0.20 0.60 Source: deepinfra, Context: 128000
Qwen3-14B qwen3-14b 0.06 0.24 Source: deepinfra, Context: 40960
Qwen3-235B-A22B qwen3-235b-a22b 0.18 0.54 Source: deepinfra, Context: 40960
Qwen3-235B-A22B-Instruct-2507 qwen3-235b-a22b-instruct-2507 0.09 0.60 Source: deepinfra, Context: 262144
Qwen3-235B-A22B-Thinking-2507 qwen3-235b-a22b-thinking-2507 0.30 2.90 Source: deepinfra, Context: 262144
Qwen3-30B-A3B qwen3-30b-a3b 0.08 0.29 Source: deepinfra, Context: 40960
Qwen3-32B qwen3-32b 0.10 0.28 Source: deepinfra, Context: 40960
Qwen3-Next-80B-A3B-Instruct qwen3-next-80b-a3b-instruct 0.14 1.40 Source: deepinfra, Context: 262144
Qwen3-Next-80B-A3B-Thinking qwen3-next-80b-a3b-thinking 0.14 1.40 Source: deepinfra, Context: 262144
L3-8B-Lunaris-v1-Turbo l3-8b-lunaris-v1-turbo 0.04 0.05 Source: deepinfra, Context: 8192
L3.1-70B-Euryale-v2.2 l3.1-70b-euryale-v2.2 0.65 0.75 Source: deepinfra, Context: 131072
L3.3-70B-Euryale-v2.3 l3.3-70b-euryale-v2.3 0.65 0.75 Source: deepinfra, Context: 131072
olmOCR-7B-0725-FP8 olmocr-7b-0725-fp8 0.27 1.50 Source: deepinfra, Context: 16384
claude-3-7-sonnet-latest claude-3-7-sonnet-latest 3.30 16.50 Source: deepinfra, Context: 200000
claude-4-opus claude-4-opus 16.50 82.50 Source: deepinfra, Context: 200000
claude-4-sonnet claude-4-sonnet 3.30 16.50 Source: deepinfra, Context: 200000
DeepSeek-R1 deepseek-r1 0.70 2.40 Source: deepinfra, Context: 163840
DeepSeek-R1-0528 deepseek-r1-0528 0.50 2.15 Source: deepinfra, Context: 163840
DeepSeek-R1-0528-Turbo deepseek-r1-0528-turbo 1.00 3.00 Source: deepinfra, Context: 32768
DeepSeek-R1-Distill-Llama-70B deepseek-r1-distill-llama-70b 0.20 0.60 Source: deepinfra, Context: 131072
DeepSeek-R1-Distill-Qwen-32B deepseek-r1-distill-qwen-32b 0.27 0.27 Source: deepinfra, Context: 131072
DeepSeek-R1-Turbo deepseek-r1-turbo 1.00 3.00 Source: deepinfra, Context: 40960
DeepSeek-V3 deepseek-v3 0.38 0.89 Source: deepinfra, Context: 163840
DeepSeek-V3-0324 deepseek-v3-0324 0.25 0.88 Source: deepinfra, Context: 163840
DeepSeek-V3.1 deepseek-v3.1 0.27 1.00 Source: deepinfra, Context: 163840
DeepSeek-V3.1-Terminus deepseek-v3.1-terminus 0.27 1.00 Source: deepinfra, Context: 163840
gemini-2.0-flash-001 gemini-2.0-flash-001 0.10 0.40 Source: deepinfra, Context: 1000000
gemini-2.5-flash gemini-2.5-flash 0.30 2.50 Source: deepinfra, Context: 1000000
gemini-2.5-pro gemini-2.5-pro 1.25 10.00 Source: deepinfra, Context: 1000000
gemma-3-12b-it gemma-3-12b-it 0.05 0.10 Source: deepinfra, Context: 131072
gemma-3-27b-it gemma-3-27b-it 0.09 0.16 Source: deepinfra, Context: 131072
gemma-3-4b-it gemma-3-4b-it 0.04 0.08 Source: deepinfra, Context: 131072
Llama-3.2-11B-Vision-Instruct llama-3.2-11b-vision-instruct 0.05 0.05 Source: deepinfra, Context: 131072
Llama-3.2-3B-Instruct llama-3.2-3b-instruct 0.02 0.02 Source: deepinfra, Context: 131072
Llama-3.3-70B-Instruct llama-3.3-70b-instruct 0.23 0.40 Source: deepinfra, Context: 131072
Llama-3.3-70B-Instruct-Turbo llama-3.3-70b-instruct-turbo 0.13 0.39 Source: deepinfra, Context: 131072
Llama-4-Maverick-17B-128E-Instruct-FP8 llama-4-maverick-17b-128e-instruct-fp8 0.15 0.60 Source: deepinfra, Context: 1048576
Llama-4-Scout-17B-16E-Instruct llama-4-scout-17b-16e-instruct 0.08 0.30 Source: deepinfra, Context: 327680
Llama-Guard-3-8B llama-guard-3-8b 0.06 0.06 Source: deepinfra, Context: 131072
Llama-Guard-4-12B llama-guard-4-12b 0.18 0.18 Source: deepinfra, Context: 163840
Meta-Llama-3-8B-Instruct meta-llama-3-8b-instruct 0.03 0.06 Source: deepinfra, Context: 8192
Meta-Llama-3.1-70B-Instruct meta-llama-3.1-70b-instruct 0.40 0.40 Source: deepinfra, Context: 131072
Meta-Llama-3.1-70B-Instruct-Turbo meta-llama-3.1-70b-instruct-turbo 0.10 0.28 Source: deepinfra, Context: 131072
Meta-Llama-3.1-8B-Instruct meta-llama-3.1-8b-instruct 0.03 0.05 Source: deepinfra, Context: 131072
Meta-Llama-3.1-8B-Instruct-Turbo meta-llama-3.1-8b-instruct-turbo 0.02 0.03 Source: deepinfra, Context: 131072
WizardLM-2-8x22B wizardlm-2-8x22b 0.48 0.48 Source: deepinfra, Context: 65536
phi-4 phi-4 0.07 0.14 Source: deepinfra, Context: 16384
Mistral-Nemo-Instruct-2407 mistral-nemo-instruct-2407 0.02 0.04 Source: deepinfra, Context: 131072
Mistral-Small-24B-Instruct-2501 mistral-small-24b-instruct-2501 0.05 0.08 Source: deepinfra, Context: 32768
Mistral-Small-3.2-24B-Instruct-2506 mistral-small-3.2-24b-instruct-2506 0.08 0.20 Source: deepinfra, Context: 128000
Mixtral-8x7B-Instruct-v0.1 mixtral-8x7b-instruct-v0.1 0.40 0.40 Source: deepinfra, Context: 32768
Kimi-K2-Instruct-0905 kimi-k2-instruct-0905 0.50 2.00 Source: deepinfra, Context: 262144
Llama-3.1-Nemotron-70B-Instruct llama-3.1-nemotron-70b-instruct 0.60 0.60 Source: deepinfra, Context: 131072
Llama-3.3-Nemotron-Super-49B-v1.5 llama-3.3-nemotron-super-49b-v1.5 0.10 0.40 Source: deepinfra, Context: 131072
NVIDIA-Nemotron-Nano-9B-v2 nvidia-nemotron-nano-9b-v2 0.04 0.16 Source: deepinfra, Context: 131072
Sources
models-dev: 9 models
litellm: 61 models