@cf/meta/llama-3.2-3b-instruct

llama-3.2-3b-instruct

Provider: Cloudflare Workers AI, Context: 128000, Output Limit: 128000

Provider	Source	Input Price ($/1M)	Output Price ($/1M)	Description
cloudflareworkersai	models-dev	Input: $0.05	Output: $0.34	Provider: Cloudflare Workers AI, Context: 128000, Output Limit: 128000
cloudflareaigateway	models-dev	Input: $0.05	Output: $0.34	Provider: Cloudflare AI Gateway, Context: 128000, Output Limit: 16384
inference	models-dev	Input: $0.02	Output: $0.02	Provider: Inference, Context: 16000, Output Limit: 4096
deepinfra	litellm	Input: $0.02	Output: $0.02	Source: deepinfra, Context: 131072
hyperbolic	litellm	Input: $0.12	Output: $0.30	Source: hyperbolic, Context: 32768
openrouter	openrouter	Input: $0.02	Output: $0.02	Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it supports eight languages, including English, Spanish, and Hindi, and is adaptable for additional languages. Trained on 9 trillion tokens, the Llama 3.2 3B model excels in instruction-following, complex reasoning, and tool use. Its balanced performance makes it ideal for applications needing accuracy and efficiency in text generation across multilingual settings. Click here for the [original model card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_2/MODEL_CARD.md). Usage of this model is subject to [Meta's Acceptable Use Policy](https://www.llama.com/llama3/use-policy/). Context: 131072