| GPT-3.5 Turbo 1106 |
gpt-3.5-turbo-1106
|
1.00 |
2.00 |
Provider: Azure Cognitive Services, Context: 16384, Output Limit: 16384
|
|
| Mistral Small 3.1 |
mistral-small-2503
|
0.10 |
0.30 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 32768
|
|
| Codestral 25.01 |
codestral-2501
|
0.30 |
0.90 |
Provider: Azure Cognitive Services, Context: 256000, Output Limit: 256000
|
|
| Mistral Large 24.11 |
mistral-large-2411
|
2.00 |
6.00 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 32768
|
|
| GPT-5 Pro |
gpt-5-pro
|
15.00 |
120.00 |
Provider: Azure Cognitive Services, Context: 400000, Output Limit: 272000
|
|
| DeepSeek-V3.2 |
deepseek-v3.2
|
0.28 |
0.42 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 128000
|
|
| MAI-DS-R1 |
mai-ds-r1
|
1.35 |
5.40 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 8192
|
|
| GPT-5 |
gpt-5
|
1.25 |
10.00 |
Provider: Azure Cognitive Services, Context: 272000, Output Limit: 128000
|
|
| GPT-4o mini |
gpt-4o-mini
|
0.15 |
0.60 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 16384
|
|
| Phi-4-reasoning-plus |
phi-4-reasoning-plus
|
0.13 |
0.50 |
Provider: Azure Cognitive Services, Context: 32000, Output Limit: 4096
|
|
| GPT-4 Turbo Vision |
gpt-4-turbo-vision
|
10.00 |
30.00 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
|
|
| Phi-4-reasoning |
phi-4-reasoning
|
0.13 |
0.50 |
Provider: Azure Cognitive Services, Context: 32000, Output Limit: 4096
|
|
| Phi-3-medium-instruct (4k) |
phi-3-medium-4k-instruct
|
0.17 |
0.68 |
Provider: Azure Cognitive Services, Context: 4096, Output Limit: 1024
|
|
| Codex Mini |
codex-mini
|
1.50 |
6.00 |
Provider: Azure Cognitive Services, Context: 200000, Output Limit: 100000
|
|
| o3 |
o3
|
2.00 |
8.00 |
Provider: Azure Cognitive Services, Context: 200000, Output Limit: 100000
|
|
| Mistral Nemo |
mistral-nemo
|
0.15 |
0.15 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 128000
|
|
| GPT-3.5 Turbo Instruct |
gpt-3.5-turbo-instruct
|
1.50 |
2.00 |
Provider: Azure Cognitive Services, Context: 4096, Output Limit: 4096
|
|
| Meta-Llama-3.1-8B-Instruct |
meta-llama-3.1-8b-instruct
|
0.30 |
0.61 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 32768
|
|
| text-embedding-ada-002 |
text-embedding-ada-002
|
0.10 |
0.00 |
Provider: Azure Cognitive Services, Context: 8192, Output Limit: 1536
|
|
| Embed v3 English |
cohere-embed-v3-english
|
0.10 |
0.00 |
Provider: Azure Cognitive Services, Context: 512, Output Limit: 1024
|
|
| Llama 4 Scout 17B 16E Instruct |
llama-4-scout-17b-16e-instruct
|
0.20 |
0.78 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 8192
|
|
| o1-mini |
o1-mini
|
1.10 |
4.40 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 65536
|
|
| GPT-5 Mini |
gpt-5-mini
|
0.25 |
2.00 |
Provider: Azure Cognitive Services, Context: 272000, Output Limit: 128000
|
|
| Phi-3.5-MoE-instruct |
phi-3.5-moe-instruct
|
0.16 |
0.64 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
|
|
| GPT-5.1 Chat |
gpt-5.1-chat
|
1.25 |
10.00 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 16384
|
|
| Grok 3 Mini |
grok-3-mini
|
0.30 |
0.50 |
Provider: Azure Cognitive Services, Context: 131072, Output Limit: 8192
|
|
| o1 |
o1
|
15.00 |
60.00 |
Provider: Azure Cognitive Services, Context: 200000, Output Limit: 100000
|
|
| Meta-Llama-3-8B-Instruct |
meta-llama-3-8b-instruct
|
0.30 |
0.61 |
Provider: Azure Cognitive Services, Context: 8192, Output Limit: 2048
|
|
| Phi-4-multimodal |
phi-4-multimodal
|
0.08 |
0.32 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
|
|
| o4-mini |
o4-mini
|
1.10 |
4.40 |
Provider: Azure Cognitive Services, Context: 200000, Output Limit: 100000
|
|
| GPT-4.1 |
gpt-4.1
|
2.00 |
8.00 |
Provider: Azure Cognitive Services, Context: 1047576, Output Limit: 32768
|
|
| Ministral 3B |
ministral-3b
|
0.04 |
0.04 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 8192
|
|
| GPT-3.5 Turbo 0301 |
gpt-3.5-turbo-0301
|
1.50 |
2.00 |
Provider: Azure Cognitive Services, Context: 4096, Output Limit: 4096
|
|
| GPT-4o |
gpt-4o
|
2.50 |
10.00 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 16384
|
|
| Phi-3-mini-instruct (128k) |
phi-3-mini-128k-instruct
|
0.13 |
0.52 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
|
|
| Llama-3.2-90B-Vision-Instruct |
llama-3.2-90b-vision-instruct
|
2.04 |
2.04 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 8192
|
|
| GPT-5-Codex |
gpt-5-codex
|
1.25 |
10.00 |
Provider: Azure Cognitive Services, Context: 400000, Output Limit: 128000
|
|
| GPT-5 Nano |
gpt-5-nano
|
0.05 |
0.40 |
Provider: Azure Cognitive Services, Context: 272000, Output Limit: 128000
|
|
| GPT-5.1 |
gpt-5.1
|
1.25 |
10.00 |
Provider: Azure Cognitive Services, Context: 272000, Output Limit: 128000
|
|
| o3-mini |
o3-mini
|
1.10 |
4.40 |
Provider: Azure Cognitive Services, Context: 200000, Output Limit: 100000
|
|
| Model Router |
model-router
|
0.14 |
0.00 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 16384
|
|
| Kimi K2 Thinking |
kimi-k2-thinking
|
0.60 |
2.50 |
Provider: Azure Cognitive Services, Context: 262144, Output Limit: 262144
|
|
| GPT-5.1 Codex Mini |
gpt-5.1-codex-mini
|
0.25 |
2.00 |
Provider: Azure Cognitive Services, Context: 400000, Output Limit: 128000
|
|
| Llama-3.3-70B-Instruct |
llama-3.3-70b-instruct
|
0.71 |
0.71 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 32768
|
|
| o1-preview |
o1-preview
|
16.50 |
66.00 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 32768
|
|
| Phi-3.5-mini-instruct |
phi-3.5-mini-instruct
|
0.13 |
0.52 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
|
|
| GPT-3.5 Turbo 0613 |
gpt-3.5-turbo-0613
|
3.00 |
4.00 |
Provider: Azure Cognitive Services, Context: 16384, Output Limit: 16384
|
|
| GPT-4 Turbo |
gpt-4-turbo
|
10.00 |
30.00 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
|
|
| Meta-Llama-3.1-70B-Instruct |
meta-llama-3.1-70b-instruct
|
2.68 |
3.54 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 32768
|
|
| Phi-3-small-instruct (8k) |
phi-3-small-8k-instruct
|
0.15 |
0.60 |
Provider: Azure Cognitive Services, Context: 8192, Output Limit: 2048
|
|
| DeepSeek-V3-0324 |
deepseek-v3-0324
|
1.14 |
4.56 |
Provider: Azure Cognitive Services, Context: 131072, Output Limit: 131072
|
|
| Meta-Llama-3-70B-Instruct |
meta-llama-3-70b-instruct
|
2.68 |
3.54 |
Provider: Azure Cognitive Services, Context: 8192, Output Limit: 2048
|
|
| text-embedding-3-large |
text-embedding-3-large
|
0.13 |
0.00 |
Provider: Azure Cognitive Services, Context: 8191, Output Limit: 3072
|
|
| Grok 3 |
grok-3
|
3.00 |
15.00 |
Provider: Azure Cognitive Services, Context: 131072, Output Limit: 8192
|
|
| GPT-3.5 Turbo 0125 |
gpt-3.5-turbo-0125
|
0.50 |
1.50 |
Provider: Azure Cognitive Services, Context: 16384, Output Limit: 16384
|
|
| Claude Sonnet 4.5 |
claude-sonnet-4-5
|
3.00 |
15.00 |
Provider: Azure Cognitive Services, Context: 200000, Output Limit: 64000
|
|
| Phi-4-mini-reasoning |
phi-4-mini-reasoning
|
0.08 |
0.30 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
|
|
| Phi-4 |
phi-4
|
0.13 |
0.50 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
|
|
| DeepSeek-V3.1 |
deepseek-v3.1
|
0.56 |
1.68 |
Provider: Azure Cognitive Services, Context: 131072, Output Limit: 131072
|
|
| GPT-5 Chat |
gpt-5-chat
|
1.25 |
10.00 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 16384
|
|
| GPT-4.1 mini |
gpt-4.1-mini
|
0.40 |
1.60 |
Provider: Azure Cognitive Services, Context: 1047576, Output Limit: 32768
|
|
| Llama 4 Maverick 17B 128E Instruct FP8 |
llama-4-maverick-17b-128e-instruct-fp8
|
0.25 |
1.00 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 8192
|
|
| Command R+ |
cohere-command-r-plus-08-2024
|
2.50 |
10.00 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4000
|
|
| Command A |
cohere-command-a
|
2.50 |
10.00 |
Provider: Azure Cognitive Services, Context: 256000, Output Limit: 8000
|
|
| Phi-3-small-instruct (128k) |
phi-3-small-128k-instruct
|
0.15 |
0.60 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
|
|
| Claude Opus 4.5 |
claude-opus-4-5
|
5.00 |
25.00 |
Provider: Azure Cognitive Services, Context: 200000, Output Limit: 64000
|
|
| Mistral Medium 3 |
mistral-medium-2505
|
0.40 |
2.00 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 128000
|
|
| DeepSeek-V3.2-Speciale |
deepseek-v3.2-speciale
|
0.28 |
0.42 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 128000
|
|
| Claude Haiku 4.5 |
claude-haiku-4-5
|
1.00 |
5.00 |
Provider: Azure Cognitive Services, Context: 200000, Output Limit: 64000
|
|
| Phi-3-mini-instruct (4k) |
phi-3-mini-4k-instruct
|
0.13 |
0.52 |
Provider: Azure Cognitive Services, Context: 4096, Output Limit: 1024
|
|
| GPT-5.1 Codex |
gpt-5.1-codex
|
1.25 |
10.00 |
Provider: Azure Cognitive Services, Context: 400000, Output Limit: 128000
|
|
| Grok Code Fast 1 |
grok-code-fast-1
|
0.20 |
1.50 |
Provider: Azure Cognitive Services, Context: 256000, Output Limit: 10000
|
|
| DeepSeek-R1 |
deepseek-r1
|
1.35 |
5.40 |
Provider: Azure Cognitive Services, Context: 163840, Output Limit: 163840
|
|
| Meta-Llama-3.1-405B-Instruct |
meta-llama-3.1-405b-instruct
|
5.33 |
16.00 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 32768
|
|
| GPT-4 32K |
gpt-4-32k
|
60.00 |
120.00 |
Provider: Azure Cognitive Services, Context: 32768, Output Limit: 32768
|
|
| Phi-4-mini |
phi-4-mini
|
0.08 |
0.30 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
|
|
| Embed v3 Multilingual |
cohere-embed-v3-multilingual
|
0.10 |
0.00 |
Provider: Azure Cognitive Services, Context: 512, Output Limit: 1024
|
|
| Grok 4 |
grok-4
|
3.00 |
15.00 |
Provider: Azure Cognitive Services, Context: 256000, Output Limit: 64000
|
|
| Command R |
cohere-command-r-08-2024
|
0.15 |
0.60 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4000
|
|
| Embed v4 |
cohere-embed-v-4-0
|
0.12 |
0.00 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 1536
|
|
| Llama-3.2-11B-Vision-Instruct |
llama-3.2-11b-vision-instruct
|
0.37 |
0.37 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 8192
|
|
| GPT-5.2 Chat |
gpt-5.2-chat
|
1.75 |
14.00 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 16384
|
|
| Claude Opus 4.1 |
claude-opus-4-1
|
15.00 |
75.00 |
Provider: Azure Cognitive Services, Context: 200000, Output Limit: 32000
|
|
| GPT-4 |
gpt-4
|
60.00 |
120.00 |
Provider: Azure Cognitive Services, Context: 8192, Output Limit: 8192
|
|
| Phi-3-medium-instruct (128k) |
phi-3-medium-128k-instruct
|
0.17 |
0.68 |
Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
|
|
| Grok 4 Fast (Reasoning) |
grok-4-fast-reasoning
|
0.20 |
0.50 |
Provider: Azure Cognitive Services, Context: 2000000, Output Limit: 30000
|
|
| DeepSeek-R1-0528 |
deepseek-r1-0528
|
1.35 |
5.40 |
Provider: Azure Cognitive Services, Context: 163840, Output Limit: 163840
|
|
| Grok 4 Fast (Non-Reasoning) |
grok-4-fast-non-reasoning
|
0.20 |
0.50 |
Provider: Azure Cognitive Services, Context: 2000000, Output Limit: 30000
|
|
| text-embedding-3-small |
text-embedding-3-small
|
0.02 |
0.00 |
Provider: Azure Cognitive Services, Context: 8191, Output Limit: 1536
|
|
| GPT-4.1 nano |
gpt-4.1-nano
|
0.10 |
0.40 |
Provider: Azure Cognitive Services, Context: 1047576, Output Limit: 32768
|
|