← Back to all models

@cf/meta/llama-3.2-3b-instruct

llama-3.2-3b-instruct

Provider: Cloudflare Workers AI, Context: 128000, Output Limit: 128000

Available at 6 Providers

Provider Source Input Price ($/1M) Output Price ($/1M) Description Free
cloudflareworkersai models-dev Input: $0.05 Output: $0.34 Provider: Cloudflare Workers AI, Context: 128000, Output Limit: 128000
cloudflareaigateway models-dev Input: $0.05 Output: $0.34 Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
inference models-dev Input: $0.02 Output: $0.02 Provider: Inference, Context: 16000, Output Limit: 4096
deepinfra litellm Input: $0.02 Output: $0.02 Source: deepinfra, Context: 131072
hyperbolic litellm Input: $0.12 Output: $0.30 Source: hyperbolic, Context: 32768
openrouter openrouter Input: $0.02 Output: $0.02 Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it supports eight languages, including English, Spanish, and Hindi, and is adaptable for additional languages. Trained on 9 trillion tokens, the Llama 3.2 3B model excels in instruction-following, complex reasoning, and tool use. Its balanced performance makes it ideal for applications needing accuracy and efficiency in text generation across multilingual settings. Click here for the [original model card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_2/MODEL_CARD.md). Usage of this model is subject to [Meta's Acceptable Use Policy](https://www.llama.com/llama3/use-policy/). Context: 131072