← Back to all models

Cloudflareaigateway Models

64 Models
Name Model ID Input Price ($/1M) Output Price ($/1M) Description Free
IBM Granite 4.0 H Micro granite-4.0-h-micro 0.02 0.11 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
BART Large CNN bart-large-cnn 0.00 0.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Mistral 7B Instruct v0.1 mistral-7b-instruct-v0.1 0.11 0.19 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
DistilBERT SST-2 INT8 distilbert-sst-2-int8 0.03 0.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
MyShell MeloTTS melotts 0.00 0.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Gemma 3 12B IT gemma-3-12b-it 0.35 0.56 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
PLaMo Embedding 1B plamo-embedding-1b 0.02 0.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
GPT OSS 20B gpt-oss-20b 0.20 0.30 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
GPT OSS 120B gpt-oss-120b 0.35 0.75 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
IndicTrans2 EN-Indic 1B indictrans2-en-indic-1b 0.34 0.34 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Pipecat Smart Turn v2 smart-turn-v2 0.00 0.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Qwen 2.5 Coder 32B Instruct qwen2.5-coder-32b-instruct 0.66 1.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Qwen3 30B A3B FP8 qwen3-30b-a3b-fp8 0.05 0.34 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Qwen3 Embedding 0.6B qwen3-embedding-0.6b 0.01 0.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
QwQ 32B qwq-32b 0.66 1.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Mistral Small 3.1 24B Instruct mistral-small-3.1-24b-instruct 0.35 0.56 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Deepgram Aura 2 (ES) aura-2-es 0.00 0.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Deepgram Aura 2 (EN) aura-2-en 0.00 0.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Deepgram Nova 3 nova-3 0.00 0.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Gemma SEA-LION v4 27B IT gemma-sea-lion-v4-27b-it 0.35 0.56 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Llama 3.2 11B Vision Instruct llama-3.2-11b-vision-instruct 0.05 0.68 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Llama 3.1 8B Instruct FP8 llama-3.1-8b-instruct-fp8 0.15 0.29 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Llama 2 7B Chat FP16 llama-2-7b-chat-fp16 0.56 6.67 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Llama 3 8B Instruct llama-3-8b-instruct 0.28 0.83 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Llama 3.1 8B Instruct llama-3.1-8b-instruct 0.28 0.83 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
M2M100 1.2B m2m100-1.2b 0.34 0.34 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Llama 3.2 3B Instruct llama-3.2-3b-instruct 0.05 0.34 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Llama 3.3 70B Instruct FP8 Fast llama-3.3-70b-instruct-fp8-fast 0.29 2.25 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Llama 3 8B Instruct AWQ llama-3-8b-instruct-awq 0.12 0.27 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Llama 3.2 1B Instruct llama-3.2-1b-instruct 0.03 0.20 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Llama 4 Scout 17B 16E Instruct llama-4-scout-17b-16e-instruct 0.27 0.85 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Llama Guard 3 8B llama-guard-3-8b 0.48 0.03 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
Llama 3.1 8B Instruct AWQ llama-3.1-8b-instruct-awq 0.12 0.27 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
BGE M3 bge-m3 0.01 0.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
BGE Base EN v1.5 bge-base-en-v1.5 0.07 0.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
BGE Large EN v1.5 bge-large-en-v1.5 0.20 0.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
BGE Reranker Base bge-reranker-base 0.00 0.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
BGE Small EN v1.5 bge-small-en-v1.5 0.02 0.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
DeepSeek R1 Distill Qwen 32B deepseek-r1-distill-qwen-32b 0.50 4.88 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
GPT-4 gpt-4 30.00 60.00 Provider: Cloudflare AI Gateway, Context: 8192, Output Limit: 8192
GPT-5.1 Codex gpt-5.1-codex 1.25 10.00 Provider: Cloudflare AI Gateway, Context: 400000, Output Limit: 128000
GPT-3.5-turbo gpt-3.5-turbo 0.50 1.50 Provider: Cloudflare AI Gateway, Context: 16385, Output Limit: 4096
GPT-4 Turbo gpt-4-turbo 10.00 30.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 4096
o3-mini o3-mini 1.10 4.40 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 100000
GPT-5.1 gpt-5.1 1.25 10.00 Provider: Cloudflare AI Gateway, Context: 400000, Output Limit: 128000
GPT-4o gpt-4o 2.50 10.00 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
o4-mini o4-mini 1.10 4.40 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 100000
o1 o1 15.00 60.00 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 100000
o3-pro o3-pro 20.00 80.00 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 100000
o3 o3 2.00 8.00 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 100000
GPT-4o mini gpt-4o-mini 0.15 0.60 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
GPT-5.2 gpt-5.2 1.75 14.00 Provider: Cloudflare AI Gateway, Context: 400000, Output Limit: 128000
Claude Opus 4 (latest) claude-opus-4 15.00 75.00 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 32000
Claude Opus 4.1 (latest) claude-opus-4-1 15.00 75.00 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 32000
Claude Haiku 4.5 (latest) claude-haiku-4-5 1.00 5.00 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 64000
Claude Haiku 3 claude-3-haiku 0.25 1.25 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 4096
Claude Opus 4.5 (latest) claude-opus-4-5 5.00 25.00 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 64000
Claude Opus 3 claude-3-opus 15.00 75.00 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 4096
Claude Sonnet 4.5 (latest) claude-sonnet-4-5 3.00 15.00 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 64000
Claude Sonnet 3.5 v2 claude-3.5-sonnet 3.00 15.00 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 8192
Claude Sonnet 3 claude-3-sonnet 3.00 15.00 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 4096
Claude Haiku 3.5 (latest) claude-3-5-haiku 0.80 4.00 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 8192
Claude Haiku 3.5 (latest) claude-3.5-haiku 0.80 4.00 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 8192
Claude Sonnet 4 (latest) claude-sonnet-4 3.00 15.00 Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 64000