Meta: Llama 3.3 70B Instruct

MetaID: meta-llama/llama-3.3-70b-instruct

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model is optimized for multilingual dialogue use cases and outperforms many of the available open source and closed chat models on common industry benchmarks. Supported languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai. [Model Card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_3/MODEL_CARD.md)

Pricing per 1M Tokens

Input (Prompt)	$0.10
Output (Completion)	$0.32
Cache Read	Free
Cache Write	Free
Image	N/A

Specifications

Context Length	131K
Max Output Tokens	16K
Input Modalities	Text
Output Modalities	Text
Tokenizer	Llama3
Instruct Type	llama3
Top Provider Context	131K
Top Provider Max Output	16K
Moderated	No