← Back to all models

Azure Models

168 Models
Name Model ID Input Price ($/1M) Output Price ($/1M) Description Free
GPT-4.1 nano gpt-4.1-nano 0.10 0.40 Provider: Azure, Context: 1047576, Output Limit: 32768
text-embedding-3-small text-embedding-3-small 0.02 0.00 Provider: Azure, Context: 8191, Output Limit: 1536
Grok 4 Fast (Non-Reasoning) grok-4-fast-non-reasoning 0.20 0.50 Provider: Azure, Context: 2000000, Output Limit: 30000
DeepSeek-R1-0528 deepseek-r1-0528 1.35 5.40 Provider: Azure, Context: 163840, Output Limit: 163840
Grok 4 Fast (Reasoning) grok-4-fast-reasoning 0.20 0.50 Provider: Azure, Context: 2000000, Output Limit: 30000
Phi-3-medium-instruct (128k) phi-3-medium-128k-instruct 0.17 0.68 Provider: Azure, Context: 128000, Output Limit: 4096
GPT-4 gpt-4 60.00 120.00 Provider: Azure, Context: 8192, Output Limit: 8192
Claude Opus 4.1 claude-opus-4-1 15.00 75.00 Provider: Azure, Context: 200000, Output Limit: 32000
GPT-5.2 Chat gpt-5.2-chat 1.75 14.00 Provider: Azure, Context: 128000, Output Limit: 16384
Llama-3.2-11B-Vision-Instruct llama-3.2-11b-vision-instruct 0.37 0.37 Provider: Azure, Context: 128000, Output Limit: 8192
Embed v4 cohere-embed-v-4-0 0.12 0.00 Provider: Azure, Context: 128000, Output Limit: 1536
Command R cohere-command-r-08-2024 0.15 0.60 Provider: Azure, Context: 128000, Output Limit: 4000
Grok 4 grok-4 3.00 15.00 Provider: Azure, Context: 256000, Output Limit: 64000
Embed v3 Multilingual cohere-embed-v3-multilingual 0.10 0.00 Provider: Azure, Context: 512, Output Limit: 1024
Phi-4-mini phi-4-mini 0.08 0.30 Provider: Azure, Context: 128000, Output Limit: 4096
GPT-4 32K gpt-4-32k 60.00 120.00 Provider: Azure, Context: 32768, Output Limit: 32768
Meta-Llama-3.1-405B-Instruct meta-llama-3.1-405b-instruct 5.33 16.00 Provider: Azure, Context: 128000, Output Limit: 32768
DeepSeek-R1 deepseek-r1 1.35 5.40 Provider: Azure, Context: 163840, Output Limit: 163840
Grok Code Fast 1 grok-code-fast-1 0.20 1.50 Provider: Azure, Context: 256000, Output Limit: 10000
GPT-5.1 Codex gpt-5.1-codex 1.25 10.00 Provider: Azure, Context: 400000, Output Limit: 128000
Phi-3-mini-instruct (4k) phi-3-mini-4k-instruct 0.13 0.52 Provider: Azure, Context: 4096, Output Limit: 1024
Claude Haiku 4.5 claude-haiku-4-5 1.00 5.00 Provider: Azure, Context: 200000, Output Limit: 64000
DeepSeek-V3.2-Speciale deepseek-v3.2-speciale 0.28 0.42 Provider: Azure, Context: 128000, Output Limit: 128000
Mistral Medium 3 mistral-medium-2505 0.40 2.00 Provider: Azure, Context: 128000, Output Limit: 128000
Claude Opus 4.5 claude-opus-4-5 5.00 25.00 Provider: Azure, Context: 200000, Output Limit: 64000
Phi-3-small-instruct (128k) phi-3-small-128k-instruct 0.15 0.60 Provider: Azure, Context: 128000, Output Limit: 4096
Command A cohere-command-a 2.50 10.00 Provider: Azure, Context: 256000, Output Limit: 8000
Command R+ cohere-command-r-plus-08-2024 2.50 10.00 Provider: Azure, Context: 128000, Output Limit: 4000
Llama 4 Maverick 17B 128E Instruct FP8 llama-4-maverick-17b-128e-instruct-fp8 0.25 1.00 Provider: Azure, Context: 128000, Output Limit: 8192
GPT-4.1 mini gpt-4.1-mini 0.40 1.60 Provider: Azure, Context: 1047576, Output Limit: 32768
GPT-5 Chat gpt-5-chat 1.25 10.00 Provider: Azure, Context: 128000, Output Limit: 16384
DeepSeek-V3.1 deepseek-v3.1 0.56 1.68 Provider: Azure, Context: 131072, Output Limit: 131072
Phi-4 phi-4 0.13 0.50 Provider: Azure, Context: 128000, Output Limit: 4096
Phi-4-mini-reasoning phi-4-mini-reasoning 0.08 0.30 Provider: Azure, Context: 128000, Output Limit: 4096
Claude Sonnet 4.5 claude-sonnet-4-5 3.00 15.00 Provider: Azure, Context: 200000, Output Limit: 64000
GPT-3.5 Turbo 0125 gpt-3.5-turbo-0125 0.50 1.50 Provider: Azure, Context: 16384, Output Limit: 16384
Grok 3 grok-3 3.00 15.00 Provider: Azure, Context: 131072, Output Limit: 8192
text-embedding-3-large text-embedding-3-large 0.13 0.00 Provider: Azure, Context: 8191, Output Limit: 3072
Meta-Llama-3-70B-Instruct meta-llama-3-70b-instruct 2.68 3.54 Provider: Azure, Context: 8192, Output Limit: 2048
DeepSeek-V3-0324 deepseek-v3-0324 1.14 4.56 Provider: Azure, Context: 131072, Output Limit: 131072
Phi-3-small-instruct (8k) phi-3-small-8k-instruct 0.15 0.60 Provider: Azure, Context: 8192, Output Limit: 2048
Meta-Llama-3.1-70B-Instruct meta-llama-3.1-70b-instruct 2.68 3.54 Provider: Azure, Context: 128000, Output Limit: 32768
GPT-4 Turbo gpt-4-turbo 10.00 30.00 Provider: Azure, Context: 128000, Output Limit: 4096
GPT-3.5 Turbo 0613 gpt-3.5-turbo-0613 3.00 4.00 Provider: Azure, Context: 16384, Output Limit: 16384
Phi-3.5-mini-instruct phi-3.5-mini-instruct 0.13 0.52 Provider: Azure, Context: 128000, Output Limit: 4096
o1-preview o1-preview 16.50 66.00 Provider: Azure, Context: 128000, Output Limit: 32768
Llama-3.3-70B-Instruct llama-3.3-70b-instruct 0.71 0.71 Provider: Azure, Context: 128000, Output Limit: 32768
GPT-5.1 Codex Mini gpt-5.1-codex-mini 0.25 2.00 Provider: Azure, Context: 400000, Output Limit: 128000
Kimi K2 Thinking kimi-k2-thinking 0.60 2.50 Provider: Azure, Context: 262144, Output Limit: 262144
Model Router model-router 0.14 0.00 Provider: Azure, Context: 128000, Output Limit: 16384
o3-mini o3-mini 1.10 4.40 Provider: Azure, Context: 200000, Output Limit: 100000
GPT-5.1 gpt-5.1 1.25 10.00 Provider: Azure, Context: 272000, Output Limit: 128000
GPT-5 Nano gpt-5-nano 0.05 0.40 Provider: Azure, Context: 272000, Output Limit: 128000
GPT-5-Codex gpt-5-codex 1.25 10.00 Provider: Azure, Context: 400000, Output Limit: 128000
Llama-3.2-90B-Vision-Instruct llama-3.2-90b-vision-instruct 2.04 2.04 Provider: Azure, Context: 128000, Output Limit: 8192
Phi-3-mini-instruct (128k) phi-3-mini-128k-instruct 0.13 0.52 Provider: Azure, Context: 128000, Output Limit: 4096
GPT-4o gpt-4o 2.50 10.00 Provider: Azure, Context: 128000, Output Limit: 16384
GPT-3.5 Turbo 0301 gpt-3.5-turbo-0301 1.50 2.00 Provider: Azure, Context: 4096, Output Limit: 4096
Ministral 3B ministral-3b 0.04 0.04 Provider: Azure, Context: 128000, Output Limit: 8192
GPT-4.1 gpt-4.1 2.00 8.00 Provider: Azure, Context: 1047576, Output Limit: 32768
o4-mini o4-mini 1.10 4.40 Provider: Azure, Context: 200000, Output Limit: 100000
Phi-4-multimodal phi-4-multimodal 0.08 0.32 Provider: Azure, Context: 128000, Output Limit: 4096
Meta-Llama-3-8B-Instruct meta-llama-3-8b-instruct 0.30 0.61 Provider: Azure, Context: 8192, Output Limit: 2048
o1 o1 15.00 60.00 Provider: Azure, Context: 200000, Output Limit: 100000
Grok 3 Mini grok-3-mini 0.30 0.50 Provider: Azure, Context: 131072, Output Limit: 8192
GPT-5.1 Chat gpt-5.1-chat 1.25 10.00 Provider: Azure, Context: 128000, Output Limit: 16384
Phi-3.5-MoE-instruct phi-3.5-moe-instruct 0.16 0.64 Provider: Azure, Context: 128000, Output Limit: 4096
GPT-5 Mini gpt-5-mini 0.25 2.00 Provider: Azure, Context: 272000, Output Limit: 128000
o1-mini o1-mini 1.10 4.40 Provider: Azure, Context: 128000, Output Limit: 65536
Llama 4 Scout 17B 16E Instruct llama-4-scout-17b-16e-instruct 0.20 0.78 Provider: Azure, Context: 128000, Output Limit: 8192
Embed v3 English cohere-embed-v3-english 0.10 0.00 Provider: Azure, Context: 512, Output Limit: 1024
text-embedding-ada-002 text-embedding-ada-002 0.10 0.00 Provider: Azure, Context: 8192, Output Limit: 1536
Meta-Llama-3.1-8B-Instruct meta-llama-3.1-8b-instruct 0.30 0.61 Provider: Azure, Context: 128000, Output Limit: 32768
GPT-5.1 Codex Max gpt-5.1-codex-max 1.25 10.00 Provider: Azure, Context: 400000, Output Limit: 128000
GPT-3.5 Turbo Instruct gpt-3.5-turbo-instruct 1.50 2.00 Provider: Azure, Context: 4096, Output Limit: 4096
Mistral Nemo mistral-nemo 0.15 0.15 Provider: Azure, Context: 128000, Output Limit: 128000
o3 o3 2.00 8.00 Provider: Azure, Context: 200000, Output Limit: 100000
Codex Mini codex-mini 1.50 6.00 Provider: Azure, Context: 200000, Output Limit: 100000
Phi-3-medium-instruct (4k) phi-3-medium-4k-instruct 0.17 0.68 Provider: Azure, Context: 4096, Output Limit: 1024
Phi-4-reasoning phi-4-reasoning 0.13 0.50 Provider: Azure, Context: 32000, Output Limit: 4096
GPT-4 Turbo Vision gpt-4-turbo-vision 10.00 30.00 Provider: Azure, Context: 128000, Output Limit: 4096
Phi-4-reasoning-plus phi-4-reasoning-plus 0.13 0.50 Provider: Azure, Context: 32000, Output Limit: 4096
GPT-4o mini gpt-4o-mini 0.15 0.60 Provider: Azure, Context: 128000, Output Limit: 16384
GPT-5 gpt-5 1.25 10.00 Provider: Azure, Context: 272000, Output Limit: 128000
MAI-DS-R1 mai-ds-r1 1.35 5.40 Provider: Azure, Context: 128000, Output Limit: 8192
DeepSeek-V3.2 deepseek-v3.2 0.28 0.42 Provider: Azure, Context: 128000, Output Limit: 128000
GPT-5 Pro gpt-5-pro 15.00 120.00 Provider: Azure, Context: 400000, Output Limit: 272000
Mistral Large 24.11 mistral-large-2411 2.00 6.00 Provider: Azure, Context: 128000, Output Limit: 32768
GPT-5.2 gpt-5.2 1.75 14.00 Provider: Azure, Context: 400000, Output Limit: 128000
Codestral 25.01 codestral-2501 0.30 0.90 Provider: Azure, Context: 256000, Output Limit: 256000
Mistral Small 3.1 mistral-small-2503 0.10 0.30 Provider: Azure, Context: 128000, Output Limit: 32768
GPT-3.5 Turbo 1106 gpt-3.5-turbo-1106 1.00 2.00 Provider: Azure, Context: 16384, Output Limit: 16384
ada ada 0.10 0.00 Source: azure, Context: 8191
command-r-plus command-r-plus 3.00 15.00 Source: azure, Context: 128000
computer-use-preview computer-use-preview 3.00 12.00 Source: azure, Context: 8192
container container 0.00 0.00 Source: azure, Context: N/A
gpt-4o-2024-08-06 gpt-4o-2024-08-06 2.75 11.00 Source: azure, Context: 128000
gpt-4o-2024-11-20 gpt-4o-2024-11-20 2.75 11.00 Source: azure, Context: 128000
gpt-4o-mini-2024-07-18 gpt-4o-mini-2024-07-18 0.17 0.66 Source: azure, Context: 128000
gpt-4o-mini-realtime-preview-2024-12-17 gpt-4o-mini-realtime-preview-2024-12-17 0.66 2.64 Source: azure, Context: 128000
gpt-4o-realtime-preview-2024-10-01 gpt-4o-realtime-preview-2024-10-01 5.50 22.00 Source: azure, Context: 128000
gpt-4o-realtime-preview-2024-12-17 gpt-4o-realtime-preview-2024-12-17 5.50 22.00 Source: azure, Context: 128000
gpt-5-2025-08-07 gpt-5-2025-08-07 1.38 11.00 Source: azure, Context: 272000
gpt-5-mini-2025-08-07 gpt-5-mini-2025-08-07 0.28 2.20 Source: azure, Context: 272000
gpt-5-nano-2025-08-07 gpt-5-nano-2025-08-07 0.06 0.44 Source: azure, Context: 272000
o1-2024-12-17 o1-2024-12-17 16.50 66.00 Source: azure, Context: 200000
o1-mini-2024-09-12 o1-mini-2024-09-12 1.21 4.84 Source: azure, Context: 128000
o1-preview-2024-09-12 o1-preview-2024-09-12 16.50 66.00 Source: azure, Context: 128000
o3-mini-2025-01-31 o3-mini-2025-01-31 1.21 4.84 Source: azure, Context: 200000
gpt-3.5-turbo gpt-3.5-turbo 0.50 1.50 Source: azure, Context: 4097
gpt-35-turbo gpt-35-turbo 0.50 1.50 Source: azure, Context: 4097
gpt-35-turbo-0125 gpt-35-turbo-0125 0.50 1.50 Source: azure, Context: 16384
gpt-35-turbo-0301 gpt-35-turbo-0301 0.20 2.00 Source: azure, Context: 4097
gpt-35-turbo-0613 gpt-35-turbo-0613 1.50 2.00 Source: azure, Context: 4097
gpt-35-turbo-1106 gpt-35-turbo-1106 1.00 2.00 Source: azure, Context: 16384
gpt-35-turbo-16k gpt-35-turbo-16k 3.00 4.00 Source: azure, Context: 16385
gpt-35-turbo-16k-0613 gpt-35-turbo-16k-0613 3.00 4.00 Source: azure, Context: 16385
gpt-4-0125-preview gpt-4-0125-preview 10.00 30.00 Source: azure, Context: 128000
gpt-4-0613 gpt-4-0613 30.00 60.00 Source: azure, Context: 8192
gpt-4-1106-preview gpt-4-1106-preview 10.00 30.00 Source: azure, Context: 128000
gpt-4-32k-0613 gpt-4-32k-0613 60.00 120.00 Source: azure, Context: 32768
gpt-4-turbo-2024-04-09 gpt-4-turbo-2024-04-09 10.00 30.00 Source: azure, Context: 128000
gpt-4-turbo-vision-preview gpt-4-turbo-vision-preview 10.00 30.00 Source: azure, Context: 128000
gpt-4.1-2025-04-14 gpt-4.1-2025-04-14 2.00 8.00 Source: azure, Context: 1047576
gpt-4.1-mini-2025-04-14 gpt-4.1-mini-2025-04-14 0.40 1.60 Source: azure, Context: 1047576
gpt-4.1-nano-2025-04-14 gpt-4.1-nano-2025-04-14 0.10 0.40 Source: azure, Context: 1047576
gpt-4.5-preview gpt-4.5-preview 75.00 150.00 Source: azure, Context: 128000
gpt-4o-2024-05-13 gpt-4o-2024-05-13 5.00 15.00 Source: azure, Context: 128000
gpt-audio-2025-08-28 gpt-audio-2025-08-28 2.50 10.00 Source: azure, Context: 128000
gpt-audio-mini-2025-10-06 gpt-audio-mini-2025-10-06 0.60 2.40 Source: azure, Context: 128000
gpt-4o-audio-preview-2024-12-17 gpt-4o-audio-preview-2024-12-17 2.50 10.00 Source: azure, Context: 128000
gpt-4o-mini-audio-preview-2024-12-17 gpt-4o-mini-audio-preview-2024-12-17 2.50 10.00 Source: azure, Context: 128000
gpt-realtime-2025-08-28 gpt-realtime-2025-08-28 4.00 16.00 Source: azure, Context: 32000
gpt-realtime-mini-2025-10-06 gpt-realtime-mini-2025-10-06 0.60 2.40 Source: azure, Context: 32000
gpt-4o-mini-transcribe gpt-4o-mini-transcribe 1.25 5.00 Source: azure, Context: 16000
gpt-4o-mini-tts gpt-4o-mini-tts 2.50 10.00 Source: azure, Context: N/A
gpt-4o-transcribe gpt-4o-transcribe 2.50 10.00 Source: azure, Context: 16000
gpt-4o-transcribe-diarize gpt-4o-transcribe-diarize 2.50 10.00 Source: azure, Context: 16000
gpt-5.1-2025-11-13 gpt-5.1-2025-11-13 1.25 10.00 Source: azure, Context: 272000
gpt-5.1-chat-2025-11-13 gpt-5.1-chat-2025-11-13 1.25 10.00 Source: azure, Context: 128000
gpt-5.1-codex-2025-11-13 gpt-5.1-codex-2025-11-13 1.25 10.00 Source: azure, Context: 272000
gpt-5.1-codex-mini-2025-11-13 gpt-5.1-codex-mini-2025-11-13 0.25 2.00 Source: azure, Context: 272000
gpt-5-chat-latest gpt-5-chat-latest 1.25 10.00 Source: azure, Context: 128000
gpt-5.2-2025-12-11 gpt-5.2-2025-12-11 1.75 14.00 Source: azure, Context: 400000
gpt-5.2-chat-2025-12-11 gpt-5.2-chat-2025-12-11 1.75 14.00 Source: azure, Context: 128000
gpt-5.2-pro gpt-5.2-pro 21.00 168.00 Source: azure, Context: 400000
gpt-5.2-pro-2025-12-11 gpt-5.2-pro-2025-12-11 21.00 168.00 Source: azure, Context: 400000
gpt-image-1 gpt-image-1 5.00 0.00 Source: azure, Context: N/A
dall-e-3 dall-e-3 0.00 0.00 Source: azure, Context: N/A
gpt-image-1-mini gpt-image-1-mini 2.00 0.00 Source: azure, Context: N/A
gpt-image-1.5 gpt-image-1.5 5.00 0.00 Source: azure, Context: N/A
gpt-image-1.5-2025-12-16 gpt-image-1.5-2025-12-16 5.00 0.00 Source: azure, Context: N/A
mistral-large-2402 mistral-large-2402 8.00 24.00 Source: azure, Context: 32000
mistral-large-latest mistral-large-latest 8.00 24.00 Source: azure, Context: 32000
o3-2025-04-16 o3-2025-04-16 2.00 8.00 Source: azure, Context: 200000
o3-deep-research o3-deep-research 10.00 40.00 Source: azure, Context: 200000
o3-pro o3-pro 20.00 80.00 Source: azure, Context: 200000
o3-pro-2025-06-10 o3-pro-2025-06-10 20.00 80.00 Source: azure, Context: 200000
o4-mini-2025-04-16 o4-mini-2025-04-16 1.10 4.40 Source: azure, Context: 200000
dall-e-2 dall-e-2 0.00 0.00 Source: azure, Context: N/A
azure-tts azure-tts 0.00 0.00 Source: azure, Context: N/A
azure-tts-hd azure-tts-hd 0.00 0.00 Source: azure, Context: N/A
tts-1 tts-1 0.00 0.00 Source: azure, Context: N/A
tts-1-hd tts-1-hd 0.00 0.00 Source: azure, Context: N/A
whisper-1 whisper-1 0.00 0.00 Source: azure, Context: N/A
sora-2 sora-2 0.00 0.00 Source: azure, Context: N/A
sora-2-pro sora-2-pro 0.00 0.00 Source: azure, Context: N/A
sora-2-pro-high-res sora-2-pro-high-res 0.00 0.00 Source: azure, Context: N/A
Sources
models-dev: 92 models
litellm: 76 models