| IBM Granite 4.0 H Micro |
granite-4.0-h-micro
|
0.02 |
0.11 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| BART Large CNN |
bart-large-cnn
|
0.00 |
0.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Mistral 7B Instruct v0.1 |
mistral-7b-instruct-v0.1
|
0.11 |
0.19 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| DistilBERT SST-2 INT8 |
distilbert-sst-2-int8
|
0.03 |
0.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| MyShell MeloTTS |
melotts
|
0.00 |
0.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Gemma 3 12B IT |
gemma-3-12b-it
|
0.35 |
0.56 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| PLaMo Embedding 1B |
plamo-embedding-1b
|
0.02 |
0.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| GPT OSS 20B |
gpt-oss-20b
|
0.20 |
0.30 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| GPT OSS 120B |
gpt-oss-120b
|
0.35 |
0.75 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| IndicTrans2 EN-Indic 1B |
indictrans2-en-indic-1b
|
0.34 |
0.34 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Pipecat Smart Turn v2 |
smart-turn-v2
|
0.00 |
0.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Qwen 2.5 Coder 32B Instruct |
qwen2.5-coder-32b-instruct
|
0.66 |
1.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Qwen3 30B A3B FP8 |
qwen3-30b-a3b-fp8
|
0.05 |
0.34 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Qwen3 Embedding 0.6B |
qwen3-embedding-0.6b
|
0.01 |
0.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| QwQ 32B |
qwq-32b
|
0.66 |
1.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Mistral Small 3.1 24B Instruct |
mistral-small-3.1-24b-instruct
|
0.35 |
0.56 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Deepgram Aura 2 (ES) |
aura-2-es
|
0.00 |
0.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Deepgram Aura 2 (EN) |
aura-2-en
|
0.00 |
0.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Deepgram Nova 3 |
nova-3
|
0.00 |
0.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Gemma SEA-LION v4 27B IT |
gemma-sea-lion-v4-27b-it
|
0.35 |
0.56 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Llama 3.2 11B Vision Instruct |
llama-3.2-11b-vision-instruct
|
0.05 |
0.68 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Llama 3.1 8B Instruct FP8 |
llama-3.1-8b-instruct-fp8
|
0.15 |
0.29 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Llama 2 7B Chat FP16 |
llama-2-7b-chat-fp16
|
0.56 |
6.67 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Llama 3 8B Instruct |
llama-3-8b-instruct
|
0.28 |
0.83 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Llama 3.1 8B Instruct |
llama-3.1-8b-instruct
|
0.28 |
0.83 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| M2M100 1.2B |
m2m100-1.2b
|
0.34 |
0.34 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Llama 3.2 3B Instruct |
llama-3.2-3b-instruct
|
0.05 |
0.34 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Llama 3.3 70B Instruct FP8 Fast |
llama-3.3-70b-instruct-fp8-fast
|
0.29 |
2.25 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Llama 3 8B Instruct AWQ |
llama-3-8b-instruct-awq
|
0.12 |
0.27 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Llama 3.2 1B Instruct |
llama-3.2-1b-instruct
|
0.03 |
0.20 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Llama 4 Scout 17B 16E Instruct |
llama-4-scout-17b-16e-instruct
|
0.27 |
0.85 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Llama Guard 3 8B |
llama-guard-3-8b
|
0.48 |
0.03 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| Llama 3.1 8B Instruct AWQ |
llama-3.1-8b-instruct-awq
|
0.12 |
0.27 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| BGE M3 |
bge-m3
|
0.01 |
0.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| BGE Base EN v1.5 |
bge-base-en-v1.5
|
0.07 |
0.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| BGE Large EN v1.5 |
bge-large-en-v1.5
|
0.20 |
0.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| BGE Reranker Base |
bge-reranker-base
|
0.00 |
0.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| BGE Small EN v1.5 |
bge-small-en-v1.5
|
0.02 |
0.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| DeepSeek R1 Distill Qwen 32B |
deepseek-r1-distill-qwen-32b
|
0.50 |
4.88 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| GPT-4 |
gpt-4
|
30.00 |
60.00 |
Provider: Cloudflare AI Gateway, Context: 8192, Output Limit: 8192
|
|
| GPT-5.1 Codex |
gpt-5.1-codex
|
1.25 |
10.00 |
Provider: Cloudflare AI Gateway, Context: 400000, Output Limit: 128000
|
|
| GPT-3.5-turbo |
gpt-3.5-turbo
|
0.50 |
1.50 |
Provider: Cloudflare AI Gateway, Context: 16385, Output Limit: 4096
|
|
| GPT-4 Turbo |
gpt-4-turbo
|
10.00 |
30.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 4096
|
|
| o3-mini |
o3-mini
|
1.10 |
4.40 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 100000
|
|
| GPT-5.1 |
gpt-5.1
|
1.25 |
10.00 |
Provider: Cloudflare AI Gateway, Context: 400000, Output Limit: 128000
|
|
| GPT-4o |
gpt-4o
|
2.50 |
10.00 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| o4-mini |
o4-mini
|
1.10 |
4.40 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 100000
|
|
| o1 |
o1
|
15.00 |
60.00 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 100000
|
|
| o3-pro |
o3-pro
|
20.00 |
80.00 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 100000
|
|
| o3 |
o3
|
2.00 |
8.00 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 100000
|
|
| GPT-4o mini |
gpt-4o-mini
|
0.15 |
0.60 |
Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
|
|
| GPT-5.2 |
gpt-5.2
|
1.75 |
14.00 |
Provider: Cloudflare AI Gateway, Context: 400000, Output Limit: 128000
|
|
| Claude Opus 4 (latest) |
claude-opus-4
|
15.00 |
75.00 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 32000
|
|
| Claude Opus 4.1 (latest) |
claude-opus-4-1
|
15.00 |
75.00 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 32000
|
|
| Claude Haiku 4.5 (latest) |
claude-haiku-4-5
|
1.00 |
5.00 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 64000
|
|
| Claude Haiku 3 |
claude-3-haiku
|
0.25 |
1.25 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 4096
|
|
| Claude Opus 4.5 (latest) |
claude-opus-4-5
|
5.00 |
25.00 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 64000
|
|
| Claude Opus 3 |
claude-3-opus
|
15.00 |
75.00 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 4096
|
|
| Claude Sonnet 4.5 (latest) |
claude-sonnet-4-5
|
3.00 |
15.00 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 64000
|
|
| Claude Sonnet 3.5 v2 |
claude-3.5-sonnet
|
3.00 |
15.00 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 8192
|
|
| Claude Sonnet 3 |
claude-3-sonnet
|
3.00 |
15.00 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 4096
|
|
| Claude Haiku 3.5 (latest) |
claude-3-5-haiku
|
0.80 |
4.00 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 8192
|
|
| Claude Haiku 3.5 (latest) |
claude-3.5-haiku
|
0.80 |
4.00 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 8192
|
|
| Claude Sonnet 4 (latest) |
claude-sonnet-4
|
3.00 |
15.00 |
Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 64000
|
|