llama-3.3-70b-cs

llama-3.3-70b-cs

World’s fastest inference for Llama 3.3 70B with Cerebras. The Llama 3.3 instruction tuned text only model is optimized for multilingual dialogue use cases and outperforms many of the available open source and closed chat models on common industry benchmarks.

Available at 1 Provider

Provider	Source	Input Price ($/1M)	Output Price ($/1M)	Description	Free
poe	poe	Input: $7,800.00	Output: -	World’s fastest inference for Llama 3.3 70B with Cerebras. The Llama 3.3 instruction tuned text only model is optimized for multilingual dialogue use cases and outperforms many of the available open source and closed chat models on common industry benchmarks.