← Back to all models

Nebius Models

15 Models
Name Model ID Input Price ($/1M) Output Price ($/1M) Description Free
Hermes 4 70B hermes-4-70b 0.13 0.40 Provider: Nebius Token Factory, Context: 131072, Output Limit: 8192
Hermes-4 405B hermes-4-405b 1.00 3.00 Provider: Nebius Token Factory, Context: 131072, Output Limit: 8192
Kimi K2 Instruct kimi-k2-instruct 0.50 2.40 Provider: Nebius Token Factory, Context: 131072, Output Limit: 8192
Llama 3.1 Nemotron Ultra 253B v1 llama-3_1-nemotron-ultra-253b-v1 0.60 1.80 Provider: Nebius Token Factory, Context: 131072, Output Limit: 8192
GPT OSS 20B gpt-oss-20b 0.05 0.20 Provider: Nebius Token Factory, Context: 131072, Output Limit: 8192
GPT OSS 120B gpt-oss-120b 0.15 0.60 Provider: Nebius Token Factory, Context: 131072, Output Limit: 8192
Qwen3 235B A22B Instruct 2507 qwen3-235b-a22b-instruct-2507 0.20 0.60 Provider: Nebius Token Factory, Context: 262144, Output Limit: 8192
Qwen3 235B A22B Thinking 2507 qwen3-235b-a22b-thinking-2507 0.20 0.80 Provider: Nebius Token Factory, Context: 262144, Output Limit: 8192
Qwen3 Coder 480B A35B Instruct qwen3-coder-480b-a35b-instruct 0.40 1.80 Provider: Nebius Token Factory, Context: 262144, Output Limit: 66536
Llama 3.1 405B Instruct llama-3_1-405b-instruct 1.00 3.00 Provider: Nebius Token Factory, Context: 131072, Output Limit: 8192
Llama-3.3-70B-Instruct (Fast) llama-3.3-70b-instruct-fast 0.25 0.75 Provider: Nebius Token Factory, Context: 131072, Output Limit: 8192
Llama-3.3-70B-Instruct (Base) llama-3.3-70b-instruct-base 0.13 0.40 Provider: Nebius Token Factory, Context: 131072, Output Limit: 8192
GLM 4.5 glm-4.5 0.60 2.20 Provider: Nebius Token Factory, Context: 131072, Output Limit: 8192
GLM 4.5 Air glm-4.5-air 0.20 1.20 Provider: Nebius Token Factory, Context: 131072, Output Limit: 8192
DeepSeek V3 deepseek-v3 0.50 1.50 Provider: Nebius Token Factory, Context: 131072, Output Limit: 8192