Meta: Llama 3.3 70B Instruct

MetaID: meta-llama/llama-3.3-70b-instruct

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model is optimized for multilingual dialogue use cases and outperforms many of the available open source and closed chat models on common industry benchmarks. Supported languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai. [Model Card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_3/MODEL_CARD.md)

Pricing per 1M Tokens

Input (Prompt)$0.10
Output (Completion)$0.32
Cache ReadFree
Cache WriteFree
ImageN/A

Specifications

Context Length131K
Max Output Tokens16K
Input ModalitiesText
Output ModalitiesText
TokenizerLlama3
Instruct Typellama3
Top Provider Context131K
Top Provider Max Output16K
ModeratedNo

More from Meta

Last updated: March 23, 2026

First tracked: March 23, 2026