← Back to all models

Azurecognitiveservices Models

90 Models
Name Model ID Input Price ($/1M) Output Price ($/1M) Description Free
GPT-3.5 Turbo 1106 gpt-3.5-turbo-1106 1.00 2.00 Provider: Azure Cognitive Services, Context: 16384, Output Limit: 16384
Mistral Small 3.1 mistral-small-2503 0.10 0.30 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 32768
Codestral 25.01 codestral-2501 0.30 0.90 Provider: Azure Cognitive Services, Context: 256000, Output Limit: 256000
Mistral Large 24.11 mistral-large-2411 2.00 6.00 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 32768
GPT-5 Pro gpt-5-pro 15.00 120.00 Provider: Azure Cognitive Services, Context: 400000, Output Limit: 272000
DeepSeek-V3.2 deepseek-v3.2 0.28 0.42 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 128000
MAI-DS-R1 mai-ds-r1 1.35 5.40 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 8192
GPT-5 gpt-5 1.25 10.00 Provider: Azure Cognitive Services, Context: 272000, Output Limit: 128000
GPT-4o mini gpt-4o-mini 0.15 0.60 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 16384
Phi-4-reasoning-plus phi-4-reasoning-plus 0.13 0.50 Provider: Azure Cognitive Services, Context: 32000, Output Limit: 4096
GPT-4 Turbo Vision gpt-4-turbo-vision 10.00 30.00 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
Phi-4-reasoning phi-4-reasoning 0.13 0.50 Provider: Azure Cognitive Services, Context: 32000, Output Limit: 4096
Phi-3-medium-instruct (4k) phi-3-medium-4k-instruct 0.17 0.68 Provider: Azure Cognitive Services, Context: 4096, Output Limit: 1024
Codex Mini codex-mini 1.50 6.00 Provider: Azure Cognitive Services, Context: 200000, Output Limit: 100000
o3 o3 2.00 8.00 Provider: Azure Cognitive Services, Context: 200000, Output Limit: 100000
Mistral Nemo mistral-nemo 0.15 0.15 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 128000
GPT-3.5 Turbo Instruct gpt-3.5-turbo-instruct 1.50 2.00 Provider: Azure Cognitive Services, Context: 4096, Output Limit: 4096
Meta-Llama-3.1-8B-Instruct meta-llama-3.1-8b-instruct 0.30 0.61 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 32768
text-embedding-ada-002 text-embedding-ada-002 0.10 0.00 Provider: Azure Cognitive Services, Context: 8192, Output Limit: 1536
Embed v3 English cohere-embed-v3-english 0.10 0.00 Provider: Azure Cognitive Services, Context: 512, Output Limit: 1024
Llama 4 Scout 17B 16E Instruct llama-4-scout-17b-16e-instruct 0.20 0.78 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 8192
o1-mini o1-mini 1.10 4.40 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 65536
GPT-5 Mini gpt-5-mini 0.25 2.00 Provider: Azure Cognitive Services, Context: 272000, Output Limit: 128000
Phi-3.5-MoE-instruct phi-3.5-moe-instruct 0.16 0.64 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
GPT-5.1 Chat gpt-5.1-chat 1.25 10.00 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 16384
Grok 3 Mini grok-3-mini 0.30 0.50 Provider: Azure Cognitive Services, Context: 131072, Output Limit: 8192
o1 o1 15.00 60.00 Provider: Azure Cognitive Services, Context: 200000, Output Limit: 100000
Meta-Llama-3-8B-Instruct meta-llama-3-8b-instruct 0.30 0.61 Provider: Azure Cognitive Services, Context: 8192, Output Limit: 2048
Phi-4-multimodal phi-4-multimodal 0.08 0.32 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
o4-mini o4-mini 1.10 4.40 Provider: Azure Cognitive Services, Context: 200000, Output Limit: 100000
GPT-4.1 gpt-4.1 2.00 8.00 Provider: Azure Cognitive Services, Context: 1047576, Output Limit: 32768
Ministral 3B ministral-3b 0.04 0.04 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 8192
GPT-3.5 Turbo 0301 gpt-3.5-turbo-0301 1.50 2.00 Provider: Azure Cognitive Services, Context: 4096, Output Limit: 4096
GPT-4o gpt-4o 2.50 10.00 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 16384
Phi-3-mini-instruct (128k) phi-3-mini-128k-instruct 0.13 0.52 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
Llama-3.2-90B-Vision-Instruct llama-3.2-90b-vision-instruct 2.04 2.04 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 8192
GPT-5-Codex gpt-5-codex 1.25 10.00 Provider: Azure Cognitive Services, Context: 400000, Output Limit: 128000
GPT-5 Nano gpt-5-nano 0.05 0.40 Provider: Azure Cognitive Services, Context: 272000, Output Limit: 128000
GPT-5.1 gpt-5.1 1.25 10.00 Provider: Azure Cognitive Services, Context: 272000, Output Limit: 128000
o3-mini o3-mini 1.10 4.40 Provider: Azure Cognitive Services, Context: 200000, Output Limit: 100000
Model Router model-router 0.14 0.00 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 16384
Kimi K2 Thinking kimi-k2-thinking 0.60 2.50 Provider: Azure Cognitive Services, Context: 262144, Output Limit: 262144
GPT-5.1 Codex Mini gpt-5.1-codex-mini 0.25 2.00 Provider: Azure Cognitive Services, Context: 400000, Output Limit: 128000
Llama-3.3-70B-Instruct llama-3.3-70b-instruct 0.71 0.71 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 32768
o1-preview o1-preview 16.50 66.00 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 32768
Phi-3.5-mini-instruct phi-3.5-mini-instruct 0.13 0.52 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
GPT-3.5 Turbo 0613 gpt-3.5-turbo-0613 3.00 4.00 Provider: Azure Cognitive Services, Context: 16384, Output Limit: 16384
GPT-4 Turbo gpt-4-turbo 10.00 30.00 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
Meta-Llama-3.1-70B-Instruct meta-llama-3.1-70b-instruct 2.68 3.54 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 32768
Phi-3-small-instruct (8k) phi-3-small-8k-instruct 0.15 0.60 Provider: Azure Cognitive Services, Context: 8192, Output Limit: 2048
DeepSeek-V3-0324 deepseek-v3-0324 1.14 4.56 Provider: Azure Cognitive Services, Context: 131072, Output Limit: 131072
Meta-Llama-3-70B-Instruct meta-llama-3-70b-instruct 2.68 3.54 Provider: Azure Cognitive Services, Context: 8192, Output Limit: 2048
text-embedding-3-large text-embedding-3-large 0.13 0.00 Provider: Azure Cognitive Services, Context: 8191, Output Limit: 3072
Grok 3 grok-3 3.00 15.00 Provider: Azure Cognitive Services, Context: 131072, Output Limit: 8192
GPT-3.5 Turbo 0125 gpt-3.5-turbo-0125 0.50 1.50 Provider: Azure Cognitive Services, Context: 16384, Output Limit: 16384
Claude Sonnet 4.5 claude-sonnet-4-5 3.00 15.00 Provider: Azure Cognitive Services, Context: 200000, Output Limit: 64000
Phi-4-mini-reasoning phi-4-mini-reasoning 0.08 0.30 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
Phi-4 phi-4 0.13 0.50 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
DeepSeek-V3.1 deepseek-v3.1 0.56 1.68 Provider: Azure Cognitive Services, Context: 131072, Output Limit: 131072
GPT-5 Chat gpt-5-chat 1.25 10.00 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 16384
GPT-4.1 mini gpt-4.1-mini 0.40 1.60 Provider: Azure Cognitive Services, Context: 1047576, Output Limit: 32768
Llama 4 Maverick 17B 128E Instruct FP8 llama-4-maverick-17b-128e-instruct-fp8 0.25 1.00 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 8192
Command R+ cohere-command-r-plus-08-2024 2.50 10.00 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4000
Command A cohere-command-a 2.50 10.00 Provider: Azure Cognitive Services, Context: 256000, Output Limit: 8000
Phi-3-small-instruct (128k) phi-3-small-128k-instruct 0.15 0.60 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
Claude Opus 4.5 claude-opus-4-5 5.00 25.00 Provider: Azure Cognitive Services, Context: 200000, Output Limit: 64000
Mistral Medium 3 mistral-medium-2505 0.40 2.00 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 128000
DeepSeek-V3.2-Speciale deepseek-v3.2-speciale 0.28 0.42 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 128000
Claude Haiku 4.5 claude-haiku-4-5 1.00 5.00 Provider: Azure Cognitive Services, Context: 200000, Output Limit: 64000
Phi-3-mini-instruct (4k) phi-3-mini-4k-instruct 0.13 0.52 Provider: Azure Cognitive Services, Context: 4096, Output Limit: 1024
GPT-5.1 Codex gpt-5.1-codex 1.25 10.00 Provider: Azure Cognitive Services, Context: 400000, Output Limit: 128000
Grok Code Fast 1 grok-code-fast-1 0.20 1.50 Provider: Azure Cognitive Services, Context: 256000, Output Limit: 10000
DeepSeek-R1 deepseek-r1 1.35 5.40 Provider: Azure Cognitive Services, Context: 163840, Output Limit: 163840
Meta-Llama-3.1-405B-Instruct meta-llama-3.1-405b-instruct 5.33 16.00 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 32768
GPT-4 32K gpt-4-32k 60.00 120.00 Provider: Azure Cognitive Services, Context: 32768, Output Limit: 32768
Phi-4-mini phi-4-mini 0.08 0.30 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
Embed v3 Multilingual cohere-embed-v3-multilingual 0.10 0.00 Provider: Azure Cognitive Services, Context: 512, Output Limit: 1024
Grok 4 grok-4 3.00 15.00 Provider: Azure Cognitive Services, Context: 256000, Output Limit: 64000
Command R cohere-command-r-08-2024 0.15 0.60 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4000
Embed v4 cohere-embed-v-4-0 0.12 0.00 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 1536
Llama-3.2-11B-Vision-Instruct llama-3.2-11b-vision-instruct 0.37 0.37 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 8192
GPT-5.2 Chat gpt-5.2-chat 1.75 14.00 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 16384
Claude Opus 4.1 claude-opus-4-1 15.00 75.00 Provider: Azure Cognitive Services, Context: 200000, Output Limit: 32000
GPT-4 gpt-4 60.00 120.00 Provider: Azure Cognitive Services, Context: 8192, Output Limit: 8192
Phi-3-medium-instruct (128k) phi-3-medium-128k-instruct 0.17 0.68 Provider: Azure Cognitive Services, Context: 128000, Output Limit: 4096
Grok 4 Fast (Reasoning) grok-4-fast-reasoning 0.20 0.50 Provider: Azure Cognitive Services, Context: 2000000, Output Limit: 30000
DeepSeek-R1-0528 deepseek-r1-0528 1.35 5.40 Provider: Azure Cognitive Services, Context: 163840, Output Limit: 163840
Grok 4 Fast (Non-Reasoning) grok-4-fast-non-reasoning 0.20 0.50 Provider: Azure Cognitive Services, Context: 2000000, Output Limit: 30000
text-embedding-3-small text-embedding-3-small 0.02 0.00 Provider: Azure Cognitive Services, Context: 8191, Output Limit: 1536
GPT-4.1 nano gpt-4.1-nano 0.10 0.40 Provider: Azure Cognitive Services, Context: 1047576, Output Limit: 32768